Monarch geneset OGS2.0

DPOGS207408
TranscriptDPOGS207408-TA1587 bp
ProteinDPOGS207408-PA528 aa
Genomic positionDPSCF300087 - 262063-267705
RNAseq coverage1032x (Rank: top 12%)
Annotation
HeliconiusHMEL0148640.088.94% 
BombyxBGIBMGA009376-TA0.079.64% 
DrosophilaCG4797-PB4e-5831.06% 
EBI UniRef50UniRef50_UPI00020608CB4e-9742.42%UPI00020608CB related cluster n=1 Tax=unknown RepID=UPI00020608CB
NCBI RefSeqXP_001944504.12e-9742.76%PREDICTED: similar to AGAP007667-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|3800218712e-9940.77%PREDICTED: facilitated trehalose transporter Tret1-like [Apis florea]
NCBI nr blastxgi|3287018378e-9942.61%PREDICTED: facilitated trehalose transporter Tret1-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00550851.1e-56transmembrane transport
GO:00160211.1e-56integral to membrane
GO:00228571.1e-56transmembrane transporter activity
GO:00160203.3e-05membrane
GO:00228913.3e-05substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[1-479] IPR0161961.6e-57Major facilitator superfamily domain, general substrate transporter
[73-473] IPR0058281.1e-56General substrate transporter
Orthology groupMCL18350 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207408-TA
ATGACCTCAAATGAACTCAATGTGTTTATGAAGTGGAGACTTGTTCGTTTAGTTGCTTGTCGACTACACCGTAGAATGCCAACTTCCACAACAAACGATATCCATTTAGAATCTGTTGGAGTCAACATCGCTGTGCTGGCGGCCTTGGCGGCGCAGTCGATAAACATATCGGTCGGCTTCTGCCAGGGTTTCTCCGCCGTACTGCTGCCGCAGTACACCAGAGACCACCCTGGCATATCGTCAGAACAGACCTCATGGATCGCAAGCCTTGGTGTTATATCGAACCCCATAGGAGCTCTCTTGGGCGGTATGATGGTGGATGCCGTCGGTCGAAGGTTACTCCTTCAGTCAATAGTTCTTCCGAACCTGATCGGCTGGTTGGTCATAGCTTTATCGGATACATATGTCTTCTTGTGTGTCGGACGATTCATCACTGGCTTCACTATCGGAATGTCCACGGCATCTTATATCTACGTAGCTGAGATTACGACTCCCGAAAAAAGGGGAGTACTAAGTGCTCTGGGTCCGGGGTTGGTTTCCACCGGTATATTTATAGTGTACTCTTTGGGCGCCTTCATACATTGGAGGACGGTTGCAGCGATATGTGCTGCAGTGTCTTTATTGACGCCGTTCTTGATGTACTTCGTGCCTGAATCACCGCTGTGGCTGGCCTCCAAGGGGCAAATGAAGGAAGCCTACGACGCAATGTTCTGGCTGAGACAGAACAATAACACAGCCCAGCAGGAGCTCATGGAGTTCACCAAGGACCGAAAACAGAACGAGTCGATGACTTTCAAACAAAAGCTTGGGTTGTTTAAGAGGAGGAGCGTTCTGAAACCGTTCGCTTTGCTGATCATATTCTTCATGTTCCAAGAGATGTCTGGAATTTACGTTATTTTATATTATGCTGTGGACTTTTTTAAATCTGTTGGGACGAGTGTTAACGAATTTACAGCTTCCATAATTGTAGGAGGAGTGAGGGTTTTTATGGGGGCCGTAGGAGCTTGTCTCATCAATAGTTTTAGAAGAAAAACTTTAGCTGCTGCTTCGGGTCTCCTTTTGGGAGTTGCAATGTTGGGAGCCGCTGTTTGCGACAGTTTGAACGGCCCACCGTCTATTAAATTGGGTTGTATTCTTCTACACGTTTCTTTCAGCATGGTTGGCTTTTTACAATTACCATGGATTATGTCTGGTGAACTGTATCCTCAGGATATTAGAGGCATTATGTCTGGAGCGACCTCATGCTGTGCTTATGTCCTCATCTTCTTCAATATTAAAACATATCCACAGTTGGAGTCTCTGGTGACCAGTAATGGAACGCTATATATTTTCGCTATTTGTGCTATACTCGGAGCAACCTATTGTTACTTGTTCTTGCCGGAGACGAAAGGAAAGACCTTGACGGAGATCATGAGGCAGTTCGACGAAGAGAAGAAAGAGAACGATCCAGAAATAGGATACATGAAACATGAAAGTGGAGAAGTAAAATCGATACAAAGAAGACACAGCGCGGGAGCAGCTGTCTCATTGGAGAAGAGTAAGGAATTGGTGAAGCAGTGGACGCAGAATAATAACGACAAGAAATAA

Protein sequence:

>DPOGS207408-PA
MTSNELNVFMKWRLVRLVACRLHRRMPTSTTNDIHLESVGVNIAVLAALAAQSINISVGFCQGFSAVLLPQYTRDHPGISSEQTSWIASLGVISNPIGALLGGMMVDAVGRRLLLQSIVLPNLIGWLVIALSDTYVFLCVGRFITGFTIGMSTASYIYVAEITTPEKRGVLSALGPGLVSTGIFIVYSLGAFIHWRTVAAICAAVSLLTPFLMYFVPESPLWLASKGQMKEAYDAMFWLRQNNNTAQQELMEFTKDRKQNESMTFKQKLGLFKRRSVLKPFALLIIFFMFQEMSGIYVILYYAVDFFKSVGTSVNEFTASIIVGGVRVFMGAVGACLINSFRRKTLAAASGLLLGVAMLGAAVCDSLNGPPSIKLGCILLHVSFSMVGFLQLPWIMSGELYPQDIRGIMSGATSCCAYVLIFFNIKTYPQLESLVTSNGTLYIFAICAILGATYCYLFLPETKGKTLTEIMRQFDEEKKENDPEIGYMKHESGEVKSIQRRHSAGAAVSLEKSKELVKQWTQNNNDKK-