Monarch geneset OGS2.0

DPOGS213924
TranscriptDPOGS213924-TA1410 bp
ProteinDPOGS213924-PA469 aa
Genomic positionDPSCF300218 + 274417-279348
RNAseq coverage121x (Rank: top 57%)
Annotation
HeliconiusHMEL0060660.078.86% 
BombyxBGIBMGA004629-TA0.071.46% 
DrosophilaTret1-2-PA1e-4227.16% 
EBI UniRef50UniRef50_D2A5A61e-7735.87%Putative uncharacterized protein GLEAN_15501 n=2 Tax=Tribolium castaneum RepID=D2A5A6_TRICA
NCBI RefSeqXP_973763.12e-7835.87%PREDICTED: similar to sugar transporter [Tribolium castaneum]
NCBI nr blastpgi|910845694e-7735.87%PREDICTED: similar to sugar transporter [Tribolium castaneum]
NCBI nr blastxgi|1571168484e-8034.60%sugar transporter [Aedes aegypti]
Group
Gene OntologyGO:00550853.3e-51transmembrane transport
GO:00160213.3e-51integral to membrane
GO:00228573.3e-51transmembrane transporter activity
GO:00160202.5e-08membrane
GO:00228912.5e-08substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[43-443] IPR0058283.3e-51General substrate transporter
[1-460] IPR0161968.9e-48Major facilitator superfamily domain, general substrate transporter
[97-116] IPR0036632.5e-08Sugar/inositol transporter
Orthology groupMCL25042 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213924-TA
ATGTTAGCAGTAAGCGGTGTGGTTATGTACATGATAGGCACCGGCGCCAACATCGCTTTCCCCGGAGTACTGCTGCAACAGTTGCGACAACCTGGATCAGTGTTGAAACTTACCCTGGAACATGAGTCCTGGATTGCCTCGATCCTCGGCTTGGCGCTCATTAGTGGTATCTTAGTGGCACCATTCATGATGCAGAGGCTGGGTCGTCGTCTTAGCAACATGCTGAGTACTCTGCCCTCGCTGGCTGGTTGGGCTCTCATGGTAGCGGCTAAGGATCCTACCGCACTGCTCATCAGTCGTTCCTTGCAGGGTTTCGGTATGGGAGTTCAGGCTGCAGCAGCTCCTATATCCATCGCGGAGTATTCTGCTCCAAGGCATAGAGGTGCGTTCCTCGCTACGATCGCGTTCTCCTTCGCCACAGGGATGTTGATCGCTCATATCTTCGGCACCATTCTATTCTGGCGTCAGGCTGCCCTGGCCTGTGGCAGTTTCTACGTTCTATCGCTGATCCTCATATCATTGTCGCCGGAAACTCCACCGTATTTGGCATCCGTCGGCAAGTTTGAGGATTGTCGCAAAACATTCAGGTGGCTCAGAGGGAGTGACGACGAATCGGAAAAAGAATTAGAAGTAATGCTCAACAGCCAGAAGAAGAAGACTATCGTGTCACCGGAAGTATCGAAGATCAAATACTACATGAACATTGTTATGTCGCCGGGTTTTTACAAACCGACCGTTATAATGATGTTCATGTTCGTGTTGTTTCAAATCTCCGGCATGACGGTGGTGCCATCGTACACTGTGCCAATGATGAACGAGGTCAGCGGCGGCCACATAGAGTCCTACACCAGCATGCTGATGGTAGATATAGTGAGGTTCGCTACAGCCGTTCTATCTTGTGTGGTCGTCAATAAATTTAATCGACGAACCGTATTGTTTTTTGGCATATATGTCAGTGTGGTTTCATTATTATTGACGTCGATTCTATTGTACGTGAGAGACTTCGGATATTTGCCCGAAAAGTATAAATGGATTCCTGTGATACCCACTCTGGTGTACATATTTGGTAAGACCATAGGTATTCTTCCTATCCCCTGGGCCATAGCCGGCGAGATCTTCCCGTTGGCTTATAGATCTCTTGGCAGTGGCATATCTGGCATGTTTCTCTCGCTCATGTTCTTTGTCGTCGTTAAAACAGCACCAACCTCCTTCAGGCAGATCGGCGTCAAGGGTACGTTCTGTCTTTATGGCCTCTGTATTGCTTTATGTGGCGCATTTCTCTACTACCTCTTGCCAGAAACGAAGGGCAAGACTTTGTATGAGATCGAATGTCACTTCAAAGGTGTCAAGGACACAAAAGGTAATGTCGAAGAAAAAGACAGAATGTTAGAGAAGGTGGAAGAAGGTTGA

Protein sequence:

>DPOGS213924-PA
MLAVSGVVMYMIGTGANIAFPGVLLQQLRQPGSVLKLTLEHESWIASILGLALISGILVAPFMMQRLGRRLSNMLSTLPSLAGWALMVAAKDPTALLISRSLQGFGMGVQAAAAPISIAEYSAPRHRGAFLATIAFSFATGMLIAHIFGTILFWRQAALACGSFYVLSLILISLSPETPPYLASVGKFEDCRKTFRWLRGSDDESEKELEVMLNSQKKKTIVSPEVSKIKYYMNIVMSPGFYKPTVIMMFMFVLFQISGMTVVPSYTVPMMNEVSGGHIESYTSMLMVDIVRFATAVLSCVVVNKFNRRTVLFFGIYVSVVSLLLTSILLYVRDFGYLPEKYKWIPVIPTLVYIFGKTIGILPIPWAIAGEIFPLAYRSLGSGISGMFLSLMFFVVVKTAPTSFRQIGVKGTFCLYGLCIALCGAFLYYLLPETKGKTLYEIECHFKGVKDTKGNVEEKDRMLEKVEEG-