Monarch geneset OGS2.0

DPOGS213915
TranscriptDPOGS213915-TA1344 bp
ProteinDPOGS213915-PA447 aa
Genomic positionDPSCF300218 + 59861-62740
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0155610.074.00% 
BombyxBGIBMGA004623-TA0.074.22% 
DrosophilaCG4797-PB2e-3226.68% 
EBI UniRef50UniRef50_E0X9062e-6632.61%Sugar transporter protein 3 n=2 Tax=Obtectomera RepID=E0X906_BOMMO
NCBI RefSeqNP_001182631.13e-6732.61%sugar transporter protein 3 [Bombyx mori]
NCBI nr blastpgi|3076119296e-6632.61%sugar transporter protein 3 [Bombyx mori]
NCBI nr blastxgi|3076119291e-6732.40%sugar transporter protein 3 [Bombyx mori]
Group
Gene OntologyGO:00550855.9e-32transmembrane transport
GO:00160215.9e-32integral to membrane
GO:00228575.9e-32transmembrane transporter activity
GO:00160201.6e-07membrane
GO:00228911.6e-07substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[1-429] IPR0161964.5e-47Major facilitator superfamily domain, general substrate transporter
[45-423] IPR0058285.9e-32General substrate transporter
[104-123] IPR0036631.6e-07Sugar/inositol transporter
Orthology groupMCL25041 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213915-TA
ATGAAGCCTTTCATCAAACAGGCATTCGTAGTATCAGGCGCGGCCTTGAACATAGTTGGCCACGGATGCGCCCACGGCTATCCCGCGGTGCTATTCTCTCAGATCAAGAGTGATGGAGGTCCAGTCACCCTCACGGATCACGACATGTCCTGGATTGCATCAGCGGTTGGTGTGATGGGCATATTAGGTAATTTCATATCACCGATCTTTATGACGAGGTACGGCAGACAGAAGGCTCACCTCATTTGTACGGTGCCGGCCTTGCTTGGCTGGGTGGTCTTCGTCCTAGGCAATTCCGTGCCTTTGTTCCTATTCGCTAGAATCCTACACGGACTTGCCTTAGGTCTCCGGACCCCGTTAGCAGCTATACTGGTAGCGGAATACACGGAACCAAGATATAGAGGGGCTTTTCTAGGCACATTCGCTATATCTCTAGGTTTGGGTATTCTTCTTGCACATCTATGGGGGTCTTATATGTCGTGGAAAATGACGGCAGTCGTGTGTTCAGTGTTCCCTATAATAGCTATGGCGATCATAAGTCTATCACCGGAATCCCCGAGTTGGAAAAAAATTAAATTGAACACTCTATTCGGGAGCGTCTTCGATTCTATAAAAGCAGCCATGAGAGTGTTCAAAAAGAAGGAGTTTTACAAGCCGTTAGTCATAGCTATATGTATGCTGATAGTGTTTGAGTTCGGTGGGGCGCATATGGTGCCGGCCTATGGGAATTTGATATTACAGTCAGTATTAGACAAAGACGATCCGAAAGATGTAGCTTGGCAGTTCACCGTCATGGACTTCCTGAGGACTATCTGCGCTCTGCTAGCTATATTTCTATTAAAGAACGTTAAACGCAGAACCATTCTATTCACCAGTGGAGTTTTCACCGTGTTATCTCTAACATTGATATCAGTTTTTATATATTTAAGGAAATATGAGATTCTCACCCACAGTTGGTTATTGGACACTGTTCCTATGATTTTGATGATATTTTACACTGTGTCCTTCTGTTTGGGACTTGTTCCGTTGAATTGGGTGATATGCGGGGAAGTATTCCCGTTGACGTATAGGAGTCTAGGTTCGACATTGTCCACTTCCTTTTTGACGCCAGCTTTCGTGGTGTCAATGAAGACAGCTCCTCATTTCTATTCATCTATAGGAGTTGAAGGAGCTTTTCTGGTTTATTCGGGAACTTTGACGGTGTGTTTGTTGATAATGTATGCGATTTTACCGGAAACAAAGGATAGGACACTGCAAGATATAGAAGATAGCTTTAAGGGTAGAAAGCAATTGGATGTGGAAGTCCAGCTTAGTTTGATGGGGAAAGATAATATAGTGAAGTAA

Protein sequence:

>DPOGS213915-PA
MKPFIKQAFVVSGAALNIVGHGCAHGYPAVLFSQIKSDGGPVTLTDHDMSWIASAVGVMGILGNFISPIFMTRYGRQKAHLICTVPALLGWVVFVLGNSVPLFLFARILHGLALGLRTPLAAILVAEYTEPRYRGAFLGTFAISLGLGILLAHLWGSYMSWKMTAVVCSVFPIIAMAIISLSPESPSWKKIKLNTLFGSVFDSIKAAMRVFKKKEFYKPLVIAICMLIVFEFGGAHMVPAYGNLILQSVLDKDDPKDVAWQFTVMDFLRTICALLAIFLLKNVKRRTILFTSGVFTVLSLTLISVFIYLRKYEILTHSWLLDTVPMILMIFYTVSFCLGLVPLNWVICGEVFPLTYRSLGSTLSTSFLTPAFVVSMKTAPHFYSSIGVEGAFLVYSGTLTVCLLIMYAILPETKDRTLQDIEDSFKGRKQLDVEVQLSLMGKDNIVK-