Monarch geneset OGS2.0

DPOGS206262
TranscriptDPOGS206262-TA2232 bp
ProteinDPOGS206262-PA743 aa
Genomic positionDPSCF300290 - 341237-360793
RNAseq coverage1588x (Rank: top 8%)
Annotation
HeliconiusHMEL0131231e-17168.23% 
BombyxBGIBMGA010742-TA2e-17160.80% 
DrosophilaCG1213-PC4e-9941.75% 
EBI UniRef50UniRef50_UPI00015B44CF1e-11044.35%UPI00015B44CF related cluster n=1 Tax=unknown RepID=UPI00015B44CF
NCBI RefSeqXP_309669.14e-11646.09%AGAP003493-PB [Anopheles gambiae str. PEST]
NCBI nr blastpgi|312014398e-11546.09%AGAP003493-PC [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1576744613e-11545.34%putative sugar transporter [Lutzomyia longipalpis]
Group
Gene OntologyGO:00550854.6e-83transmembrane transport
GO:00160214.6e-83integral to membrane
GO:00228574.6e-83transmembrane transporter activity
GO:00160202.7e-79membrane
GO:00228912.7e-79substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[45-468] IPR0058284.6e-83General substrate transporter
[26-465] IPR0036632.7e-79Sugar/inositol transporter
[1-477] IPR0161961.8e-58Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL10224 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206262-TA
ATGGATAGAAGTTATTCGTACGACCCGGTGTCTACCGGCGCAGTCAACCAGCAGGCGCACAGGATGGAGGCAGGGCAGCTGTGGCGGCAGTACATCATCGCGGGCATCGCGAACATAGCGATCGCCTCCACGGGTTATTCCATGGGATGGACGTCACCAATCAATGGGAAGTTATCGGACAATACAACAAATATCCTTGACAAACCGGCAACAGCTGACGAGCTAGCGTGGATGGGTTCGGTGCTCAACATTGGAGCTATATTAGGTCCATTCGTGGGTGGTTACTTGGCTGGCAGGATTGGCAGGAAGTGGGGTCTCCTGAGTTCAGCCGTACCTTTACTTCTCGGATGGATTCTCGTGGCAACTGTGGAGAACATGGCGTTCCTGTACGCAGCCAGAATCTTTTGGGGCGTCGGTGTGGGGATGCTCTTTACCATATCGCCGATGTATTGCGCTGAAATCGCTACGAACGAATCTCGAGGTGCCCTGGGATCGTTTCTCCAGTTGTTCATAACCCTTGGTTACATCCTGGTCTACGGTATCGGACCCTCCACAACGTACATGAATGTAGCCTACGTCGGTATAGCCTTCGTGGCAGTATTCGCTGTAGGATTCTTCTTCATGCCGGAAACGCCAACATACCACCTCCTTAAAGGCGATCGCGAGGCGGCTGCGTCGTGTCTCAGCACCATCCGCGGCCGTTCCCGAGCTGGTGTCGAAGCCGAGCTCAGTCTCATCGAGACTGACGTTAAGGCTTCAATGGAAAAAACAGCGACGGTAATGGACGTGTTCCAAGGCAGTAACTTCAAGGCGTTCTATATATCCTGCGCTTTGGTGTTCTTCCAACAGTTCAGTGGCATCAACGCTGTTCTGTTCTACATGACGGACATCTTCGAGTCCTCTGGCAGCGACCTCCAGCCTGCCATCGCAACCATTATAATCGGAGCCGTTCAGGTGGTAGCGTCTTGTATCACTCCCGTGGTGGTAGACCGTCTCGGCCGCCGTCTGCTGTTAATGGTGTCAGCCTGCGGTACAGCGATCGGAGCCATTCTGCTCGGAATGTTTTTCCTGCTGAAGCACAATGAGAGCGAAGTGGTAGCGTCGATCAGCTTTCTTCCTATATTGTCTCTGGTGCTGTTCATTGTGACGTACTGTTGGGGTCTCGGTCCCCTGCCCTGGGCGGTGATGAGCGAGTTATTCCCAATAGAAGTTAAGGCTGCAGCCTCACCGATAGCTACAGCGTTTTGCTGGCTATTGTCCTTCCTGATTACCAAATTCTTCCCGTCCCTGGACCGTCACGTTGGCTTCCTCGTGTTCGGTGGGTGTTGTGTCGTATCTTTAGTCTTCTCACTGCTAGTCATTCCAGAGACCAAAGGAAAGAGCTTCTCTGAGATACAAATGATGCTGTCCGGGAAGAAAAAGGAAGAAAAGACAAAAGATAATGCTATGAAGCGGAACTTCAGGGATAAGGCGAGGTTAAAGGCACTCACTCAAAATATCTACACAATTGTAATTCACAACTTGGGCTTGAAATATCTTGCCAACTTCCACGCCTCCTGTTGTCACCTGGTGAAACGAGACAAACACATATTTTACAATACTAAAGTCACCAATATCGACCTTAGATTTACTACCCTGAAGTGGAATTGGGCCGGTTATAACCCCAGATATTTAGGTGAAGAGGAGCCAGGAGACTGCGTTTTGGTCAACGTAGTTCAAGATGTCTTGCAGACTTATTCGATCGCGAATTATGGAGGCTGGGAAATTATGGCGACAATATATCATCGCGGGCATAAATTCTTCCCGTCCCTGGACCGTCACGTCGGCTTCCTCATGTTCGGTGGGTGTTGTGTCGCGGCGTTCGCCTTCTCGCTGCTGGTTGTTCCAGAGACCAAAGGAAAGAGCTTCTCTGAAATACAAGACATGCTGGCAGGGAACTTCAGGGATAAGGCGAGGTTAAAGGCACTCACTCAAAATATCTACACAATTGTAATTCACAACTTGGGCTTGAAATATCTTGCCAACTTCCACGCCTCCTGTTGTCACCTGGTGAAACGAGACAAACACATATTTTACAATACTAAAGTCACCAATATCGACCTTAGATTTACTACCCTGAAGTGGAATTGGGCCGGTTATAACCCCAGATATTTAGGTGAAGAGGAGCCAGGAGACTGCGTTTTGGTCAACGTAGTTCAAGATGTCTTGCAGTACGATGAATCGAGCCGTTGA

Protein sequence:

>DPOGS206262-PA
MDRSYSYDPVSTGAVNQQAHRMEAGQLWRQYIIAGIANIAIASTGYSMGWTSPINGKLSDNTTNILDKPATADELAWMGSVLNIGAILGPFVGGYLAGRIGRKWGLLSSAVPLLLGWILVATVENMAFLYAARIFWGVGVGMLFTISPMYCAEIATNESRGALGSFLQLFITLGYILVYGIGPSTTYMNVAYVGIAFVAVFAVGFFFMPETPTYHLLKGDREAAASCLSTIRGRSRAGVEAELSLIETDVKASMEKTATVMDVFQGSNFKAFYISCALVFFQQFSGINAVLFYMTDIFESSGSDLQPAIATIIIGAVQVVASCITPVVVDRLGRRLLLMVSACGTAIGAILLGMFFLLKHNESEVVASISFLPILSLVLFIVTYCWGLGPLPWAVMSELFPIEVKAAASPIATAFCWLLSFLITKFFPSLDRHVGFLVFGGCCVVSLVFSLLVIPETKGKSFSEIQMMLSGKKKEEKTKDNAMKRNFRDKARLKALTQNIYTIVIHNLGLKYLANFHASCCHLVKRDKHIFYNTKVTNIDLRFTTLKWNWAGYNPRYLGEEEPGDCVLVNVVQDVLQTYSIANYGGWEIMATIYHRGHKFFPSLDRHVGFLMFGGCCVAAFAFSLLVVPETKGKSFSEIQDMLAGNFRDKARLKALTQNIYTIVIHNLGLKYLANFHASCCHLVKRDKHIFYNTKVTNIDLRFTTLKWNWAGYNPRYLGEEEPGDCVLVNVVQDVLQYDESSR-