Monarch geneset OGS2.0

DPOGS209580
TranscriptDPOGS209580-TA1521 bp
ProteinDPOGS209580-PA506 aa
Genomic positionDPSCF300015 - 915434-920624
RNAseq coverage316x (Rank: top 36%)
Annotation
HeliconiusHMEL0170370.070.39% 
BombyxBGIBMGA006642-TA1e-15957.68% 
DrosophilaCG7882-PA2e-11041.61% 
EBI UniRef50UniRef50_D6X1994e-12449.07%Putative uncharacterized protein n=4 Tax=Tribolium castaneum RepID=D6X199_TRICA
NCBI RefSeqXP_972450.21e-12449.07%PREDICTED: similar to glucose transporter (sugar transporter [Tribolium castaneum]
NCBI nr blastpgi|2700141491e-12349.07%hypothetical protein TcasGA2_TC012857 [Tribolium castaneum]
NCBI nr blastxgi|1892414344e-12849.27%PREDICTED: similar to glucose transporter (sugar transporter [Tribolium castaneum]
Group
Gene OntologyGO:00550855.5e-95transmembrane transport
GO:00160215.5e-95integral to membrane
GO:00228575.5e-95transmembrane transporter activity
GO:00160201.7e-89membrane
GO:00228911.7e-89substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[38-487] IPR0058285.5e-95General substrate transporter
[75-481] IPR0036631.7e-89Sugar/inositol transporter
[88-481] IPR0161961.2e-46Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL16474 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209580-TA
ATGAAGGTCACGGATATCTTCAATAAACTATTCGTAACCAACGAGCACCAGCACTTGACAAGTTCTAACGAGTACAAAGCGGGATGGTCGTTTTACTTGGTCCTGGCCGGAGTGGTGACCACTCTCGGCTCCTCCCTGCCAGTCGGCTACAACATCGGAGTCGTCAACACGCCGGCTGAGATATTAAAGAATTTCTGCAATGAAAGTTTCATCAGCCGCTATGACCTGCCGCTGGACAATAGATGGCTGAACGTGCTGTGGTCGGCCGTCGTGTCTACGTTCATCGTGGGCGGCTGCATCGGCTCGTTACTCGGCTCGGTGCTGGCTGACAAACTCGGCCGTAAGAAGGCGACAATATCAACCAATATCCTGTTCATCTCCGGCGGCGTTCTGTTTGTGTCGTGTCGAGCCGCCGACTCGGTTGAGATGATGATTGCCGGGAGATTCCTGGTGGGGCTGGCGGGAGGTTTAACAACTAGTATAGTTCCTATGTACCTGGCGGAGCTCGCGCCTCCCTCGCTCACCGGCGCTATGGGGGTCGCCTGCCCCATGGGCGTCAACGTTGGGGTTCTGGTGGGGCAGGTCATGGGGCTGGACTCTCTGCTAGGCGGTCCGGACAACTGGCCATACCTGCTGTCGTGTTTCATGCTGCTAGTCGTTTTTAGTTTACCCGTGCTGTGCGTATTGCCGGAGAGTCCCAAGTACCTGTTTGTGGTGAAGAGAGATGAGGTCGCCGCTCTTAATGAGTTGAGTCGCGTGCGTGGTGAGAGTGTGTCTCGTGTGTCGTGCGAGCTGGAGTCTCTCAGGTTGGAGTCGTCGTCCAGGGCGGGGTCCCTCTTGGCCGCCCTGAGAGACCCGGCCCTCCGCCTGCCGCTGCTGCTGGTGTGTGCAGCACAGGCCGGACAGCAGACCAGTGGCATTAACGCAGTGTTCTACTACTCGCAGAGTATCTTCCGCCAGGCCGGGCTGTCCGAGCACAGTTCGCAGCTGGCGAGTATCTCGTGTGGCGCCATCAACGTGTGTACCGCGGTGGTGGTGTCCCGTGTGGTGAGTCACTCCGGCCGCCGTCCCCTACTGCTGGGATCCACACTTGCCGCCACTCTATCTCTGGCAGCGCTCGGATTCGCCAAAATCTATAGCGATGCTGTATCTTGGATGCCGTTCGTGTGCATGTCTGCAGTTCTCGTGTATGTTCTGGTGTACGGCCTCGGTCTCGGTCCCATACCGTATTTCATAGCTTCAGAGTTGTTCGAGGTGTCGTCTCGGTCGGCGGGCATGTCGTGGGGTTCCCTGGCCAATTGGGGCGGGAACTTCGTGGTGGGGATGACCTTCCCAACCATGCGGGACACCCTCGGCGCCTACTGCTTCCTCGTGTTCGCCGGCATTACCGGAACACTCTTCGTTTTCCTTAAATTATACTTCCCGGAGACGCGAGGCAAAACGCCGGCTCAGGTGGCAGAGCTGTGTAGTCACGGGTTCAAGTCCCGGCCGGTGAGGGCGTCGCGACTTGACATGTCTTGA

Protein sequence:

>DPOGS209580-PA
MKVTDIFNKLFVTNEHQHLTSSNEYKAGWSFYLVLAGVVTTLGSSLPVGYNIGVVNTPAEILKNFCNESFISRYDLPLDNRWLNVLWSAVVSTFIVGGCIGSLLGSVLADKLGRKKATISTNILFISGGVLFVSCRAADSVEMMIAGRFLVGLAGGLTTSIVPMYLAELAPPSLTGAMGVACPMGVNVGVLVGQVMGLDSLLGGPDNWPYLLSCFMLLVVFSLPVLCVLPESPKYLFVVKRDEVAALNELSRVRGESVSRVSCELESLRLESSSRAGSLLAALRDPALRLPLLLVCAAQAGQQTSGINAVFYYSQSIFRQAGLSEHSSQLASISCGAINVCTAVVVSRVVSHSGRRPLLLGSTLAATLSLAALGFAKIYSDAVSWMPFVCMSAVLVYVLVYGLGLGPIPYFIASELFEVSSRSAGMSWGSLANWGGNFVVGMTFPTMRDTLGAYCFLVFAGITGTLFVFLKLYFPETRGKTPAQVAELCSHGFKSRPVRASRLDMS-