Monarch geneset OGS2.0

DPOGS213409
TranscriptDPOGS213409-TA1419 bp
ProteinDPOGS213409-PA472 aa
Genomic positionDPSCF300492 - 25345-27894
RNAseq coverage1x (Rank: top 95%)
Annotation
HeliconiusHMEL0110021e-16558.94% 
BombyxBGIBMGA013651-TA5e-11347.02% 
DrosophilaCG33281-PA5e-4525.00% 
EBI UniRef50UniRef50_D4AHW93e-5228.63%Sugar transporter 4 n=1 Tax=Nilaparvata lugens RepID=D4AHW9_NILLU
NCBI RefSeqXP_001607393.11e-5227.93%PREDICTED: similar to ENSANGP00000023240 [Nasonia vitripennis]
NCBI nr blastpgi|3838566253e-5430.72%PREDICTED: facilitated trehalose transporter Tret1-like [Megachile rotundata]
NCBI nr blastxgi|3838566255e-5730.79%PREDICTED: facilitated trehalose transporter Tret1-like [Megachile rotundata]
Group
Gene OntologyGO:00550859.3e-62transmembrane transport
GO:00160219.3e-62integral to membrane
GO:00228579.3e-62transmembrane transporter activity
GO:00160202.3e-10membrane
GO:00228912.3e-10substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[20-460] IPR0058289.3e-62General substrate transporter
[1-458] IPR0161968.4e-48Major facilitator superfamily domain, general substrate transporter
[27-37] IPR0036632.3e-10Sugar/inositol transporter
Orthology groupMCL19889 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213409-TA
ATGGTGAGCCTAGTTAAGGAAACAAAAATAGGTAACACTTACCGGCAATGGATTTTCGCCTTATTAGCAAATTTTACACTCTTTACATATGGAATGGACTGTGGATGGATTTCGCCAATGACTAAGATATTACAATCAAACGAATCTCCAACAGGACAAGCAATTACTGACAATGATTTATCTTGGATAGCCAGTTCTTTGAGTATAGCAGCAATTTTTGGAGTATCCGTCTATACGTTTATTTCGGATTATTTTGGAAGAAAGATAAGCGTTATATGTATTGCTGTTCCGCAAGCAATTTCATGGACCATAAGACTGTGCTATCCGACCACCATTACTTTGATTCTATCAAGAGTTTTATCAGGGTTATCGGCTGGTGGATGTTTTATTATAGTTCCCATGTACGTAAAAGAAATAAGTCAAGATGATATAAGAGGAGTTCTAGGAACATTGGTTATATTATTGCAAACAACAGGTTTACTTTTCATGTACATCATTGGGACATACTTGAGTTATTACACAGTCACAGTGATTACCTTAACCATCTCTATTGCTGTAACATTATTGGTACTTAAGGCTCCCGAATCACCAGCTTTTCTTGTGAAGCTAGAGAAATATGAAGAAGCAGCAGAAACGGTCGCATATCTTAGGGGTCTGGATAAAAACGACAAAATTGTCCAAAACGTAACGGATTGCATGAAATCTGAAGAAACCTTATTTAAATCATTGCCAAATATTTCGTTGGCTAGTATTTTCAAGAACAGATCCTGGCGTCGCGGGTTGTTTCTTATTACTGCAACCTTCGTGTTTCACGGCTTGAATGGATCCTATGTCATTGTAACCTACGCTTCGACTATTCTAATTTCTACTGGAGTTAAATTCGAAATCAGTCCAGAAATACAGACTTTCAGTTTTCCCATCTTTATGATTGTGGGTTCGTTGTCACTTGCAGCTATCGTAGAAAAAGTCGGGAGAAAGCCTTTGCTAATAGGTTCATTTTTAGTAACAGCTATTTGTATGGCTCTTATAGTAATAATGATGATATTACAAGAACGAGGTGTGAGTATACCATCGTGGTTGCCAGTTTTAGCCATCATTTTGGCGGTCTCAATGTATGGTGCTGGCGTATCTCCTATACCATATATCATAATGACAGAGATGTTTAGTTTTCAAATTCGAGCGAAAGCAATGGGCATGGTTGTCACTTTCGCCTGGTCGTTGACTTCTCTATTAGTTATATCATATACACCATTGAATAATTACATAGCGCCATACGCACCATTCATATTATATGCGGTTATTAATTTCTTGGGTTCCATTTTCACCTTACTGTTCATCCCAGAGACGAGAGCAAAGACCGAAGAACAAATCAATGCTATTTTAGAAACTGGAATAATTTCCAAAAATAGTCATAAATGA

Protein sequence:

>DPOGS213409-PA
MVSLVKETKIGNTYRQWIFALLANFTLFTYGMDCGWISPMTKILQSNESPTGQAITDNDLSWIASSLSIAAIFGVSVYTFISDYFGRKISVICIAVPQAISWTIRLCYPTTITLILSRVLSGLSAGGCFIIVPMYVKEISQDDIRGVLGTLVILLQTTGLLFMYIIGTYLSYYTVTVITLTISIAVTLLVLKAPESPAFLVKLEKYEEAAETVAYLRGLDKNDKIVQNVTDCMKSEETLFKSLPNISLASIFKNRSWRRGLFLITATFVFHGLNGSYVIVTYASTILISTGVKFEISPEIQTFSFPIFMIVGSLSLAAIVEKVGRKPLLIGSFLVTAICMALIVIMMILQERGVSIPSWLPVLAIILAVSMYGAGVSPIPYIIMTEMFSFQIRAKAMGMVVTFAWSLTSLLVISYTPLNNYIAPYAPFILYAVINFLGSIFTLLFIPETRAKTEEQINAILETGIISKNSHK-