Monarch geneset OGS2.0

DPOGS209874
TranscriptDPOGS209874-TA1359 bp
ProteinDPOGS209874-PA452 aa
Genomic positionDPSCF300302 + 100075-107940
RNAseq coverage239x (Rank: top 43%)
Annotation
HeliconiusHMEL0075324e-17871.07% 
BombyxBGIBMGA004436-TA0.079.42% 
DrosophilaCG10960-PB6e-5027.83% 
EBI UniRef50UniRef50_B0WPL11e-6533.63%Sugar transporter n=5 Tax=Culicidae RepID=B0WPL1_CULQU
NCBI RefSeqXP_001850645.12e-6633.63%sugar transporter [Culex quinquefasciatus]
NCBI nr blastpgi|1700461614e-6533.63%sugar transporter [Culex quinquefasciatus]
NCBI nr blastxgi|1700461612e-6633.78%sugar transporter [Culex quinquefasciatus]
Group
Gene OntologyGO:00550856.6e-62transmembrane transport
GO:00160216.6e-62integral to membrane
GO:00228576.6e-62transmembrane transporter activity
KEGG pathway 
InterPro domain[26-427] IPR0058286.6e-62General substrate transporter
[15-426] IPR0161961.4e-50Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL25019 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209874-TA
ATGGGCTGGATATCACCGAATAAGAAACTCCTCATGGGGAAAGATTCTCCATCGAATCCACCATTGACGGAAGACGACATATCATGGATGGCTTCCATTATGTTTATATTCGCCCCCATAGCTGTCTTCATGTACGGAACAGCGGCTGATAGGTTTGGAAGAAAAAGAGCACTCCTCTGCGCTTCAATTCCAATATCGATCGGCTGGGCCATCAAGTTAATATCCGCACGTCCTGTAGCCCTGATAGCCGCACGCGCTCTAATCGGCTTCGGTTCAGGCGGCGGTTTTGTGGTTTGTCCGCTATATGTTAAAGAAATAAGTGAAGACAGCATCAGAGGCATGACAGGAACCTTCGTTATATTCTCACAGACGGTTGGGAATTTACTGATATTTGTTCTTGGTGATCTGCTGCCGTTTCACACGGTCCTTTGGATACTCCTTGCTGTGCCGTTAGTACACTTCTGCGTGCTCCTCAAGTTACCGGAGACGCCGTCCTATCTCATCAAGTGTGGCAAAAATGAGGAGACAGCAAAAGTATTAGGCTGGCTGCGATCACTTCCGCCCACAGATAAAACAATCACAGAAGAGGTTGATAGACTTAATATAGAGCAGACAAAATGTGAACCGAAGTTTTCTCCTAGATTATTATTTTCAGATAAGACCGCCTTGAAAGCCTTCTGGGTAGCGCTCATAGTAAACCTCACCAGAGAATTTTGTGGCTGCATTGCCGTTTTAGTTTACGCGAGCCACATCTTCACTGAAGCTGGGAAAGATCAAAATTCAAGCATCTCATTGTCACCCAACAAACAGTCGATTGTGCTCGCTGCTGTACAGATATTTGGGTCGTTTTTGGCGTGCCAACTCGTTGATAGGGCTGGAAGAAAGCCGCTTCTAGCGCTAACAAGCGCTCTCGCTGGCTTCAGTCTCTGTGTGCTAGGCGCATGGTTCTACCTTCAGAGTGTGGGTACAGCGCTGGCCGGCTGGTTACCAATCGCCGCTTTGTGTACTTGTATTTTTGCGGACGCTTTGGGATTACAGCCCTTGCCATTCGTTATAATGACCGAGATGTTCGGTTTTCAGTTGCGGGGCACAGTGGCAACACTAATCATGGCGGTATCCTTGGGCACTGATTTCGCGCTTTTAAAACTCTTCGCGCCATTAAATTCGTGGATCGGATACCACTACACCTTTTGGGGTTTTAGCTTTATATGTCTATCGAACGTGTTCTATCTCATATTCTGCGTGCCGGAAACTAAAATGAGATCCCTAGAAGATATTTATGCTGATTTAGAAGGCAGGAGTAAAACTAACGACAAAAAAGTCGTAAACGATACTATAGTGGAATCTCAACATGTATAG

Protein sequence:

>DPOGS209874-PA
MGWISPNKKLLMGKDSPSNPPLTEDDISWMASIMFIFAPIAVFMYGTAADRFGRKRALLCASIPISIGWAIKLISARPVALIAARALIGFGSGGGFVVCPLYVKEISEDSIRGMTGTFVIFSQTVGNLLIFVLGDLLPFHTVLWILLAVPLVHFCVLLKLPETPSYLIKCGKNEETAKVLGWLRSLPPTDKTITEEVDRLNIEQTKCEPKFSPRLLFSDKTALKAFWVALIVNLTREFCGCIAVLVYASHIFTEAGKDQNSSISLSPNKQSIVLAAVQIFGSFLACQLVDRAGRKPLLALTSALAGFSLCVLGAWFYLQSVGTALAGWLPIAALCTCIFADALGLQPLPFVIMTEMFGFQLRGTVATLIMAVSLGTDFALLKLFAPLNSWIGYHYTFWGFSFICLSNVFYLIFCVPETKMRSLEDIYADLEGRSKTNDKKVVNDTIVESQHV-