Monarch geneset OGS2.0

DPOGS212674
TranscriptDPOGS212674-TA1296 bp
ProteinDPOGS212674-PA431 aa
Genomic positionDPSCF300198 + 185560-192039
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0075103e-17168.28% 
BombyxBGIBMGA014054-TA3e-11950.47% 
DrosophilaCG1208-PC2e-2926.94% 
EBI UniRef50UniRef50_Q173J53e-3828.21%Sugar transporter n=4 Tax=Culicidae RepID=Q173J5_AEDAE
NCBI RefSeqXP_001850640.11e-4127.98%sugar transporter [Culex quinquefasciatus]
NCBI nr blastpgi|1700461502e-4027.98%sugar transporter [Culex quinquefasciatus]
NCBI nr blastxgi|1700461503e-4228.31%sugar transporter [Culex quinquefasciatus]
Group
Gene OntologyGO:00550852.5e-48transmembrane transport
GO:00160212.5e-48integral to membrane
GO:00228572.5e-48transmembrane transporter activity
KEGG pathway 
InterPro domain[2-419] IPR0058282.5e-48General substrate transporter
[1-417] IPR0161969.1e-48Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL26167 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212674-TA
ATGTCCCTGTTGGGAAGTATCGTCAATATTGGTGGACTTATAGCGACACCGCTTTGTGGATATGCTGTTGATAAAATAGGAAGGAAGTACTCGGCTATGCTTTTTGGATTGCCCTTTGTGATATCGTGGGCCCTTATTGCTGCAACTACATCATTCTACACTGTGTTGTTTGCTGTTGGTTTATCTGGGTTTGGCGCCGCAGGTCAAGCTGTGTCTACCGTTTATATATCAGAAATAGCTCAAGATTCAATAAGAGGAGCTTTAACTTCTTCAACAGTCTATGGATTTTTCTTTGGCCTCTTGATGTCCTATACTTTTGGGGGATATATGTCTTATTATAGTGTTTTGTACACACATCTAGCTCTCTCAGTTTTGTACATCTTAATGTTGATACCTTTAAAGGAATCTCCGTGCTATCTTCTCATGCTTGAGAAGGAGAAGGAGGCAGCGGAGTCAATAGCTTTTTATCAAAGAGTGGACGTGTCGTCTAAAGAAGTTGAATTAGAAATTCAGAAGATTAAACTCCAATTAGGTTCTAAAGAAGATAAAATACTCAAATCTGACGCAGATACTCAAGAAGCAGAAGATCTTTTAAAAAAGACTTCGGTTGAATCAAATGAAAAGAAAGAGACAGCGTGGCAGTTTTTAAAGAGATCGCGATCATCCCAAAGGGCTTTAATTGCTGTCTTTACAGTGATGTCTTTGACAATACTGATGGGTTCAATAGTCCTCCAAGTTTATGCTGAACCGTTATTTAAAGAGGCAGTTCCCACTATGCATCCAAATACTTGCTCCATTTTGATGGCAGTCACTTACTTGACAGCTGCCCTATTATGTGCAAGTATGTTAGACAAATTTGGAAGAAAGGCACTTTTAACGGTGACAAGTATATTAACAGGGATATCAAATATAATACTTGGGACCCAACTGAATTTACACTGGGCGCCACATTGGTTCACCGCCTTCATTATCTATGGCTCAAGTTTTGTGTACAATCTTGGTGCCGCCATTGTACCTTTTGTGCTAACTGCAGAAGTGTTTCTGCCGCAGGTACGCGGCCTTGGAAATAGCGTAGCGATGGCAACAATGTGGATTATGAATTGGGTTACTCTCATCATTTTTAACCCTATAGTGGAGTGGTGGGGTCTTGGTTCTGCGTTTTATTTCTTCTCTTTTATGTGTTTCCTCTCAGCAGCCTACGGCCAATTCTGCTTACCAGAAACCAAGGGTTTGTCAGCTGATGAGATACAGTTATTGTTTTTGAAAGAAAAAAGAAATGACACACAAAAAGTGTAG

Protein sequence:

>DPOGS212674-PA
MSLLGSIVNIGGLIATPLCGYAVDKIGRKYSAMLFGLPFVISWALIAATTSFYTVLFAVGLSGFGAAGQAVSTVYISEIAQDSIRGALTSSTVYGFFFGLLMSYTFGGYMSYYSVLYTHLALSVLYILMLIPLKESPCYLLMLEKEKEAAESIAFYQRVDVSSKEVELEIQKIKLQLGSKEDKILKSDADTQEAEDLLKKTSVESNEKKETAWQFLKRSRSSQRALIAVFTVMSLTILMGSIVLQVYAEPLFKEAVPTMHPNTCSILMAVTYLTAALLCASMLDKFGRKALLTVTSILTGISNIILGTQLNLHWAPHWFTAFIIYGSSFVYNLGAAIVPFVLTAEVFLPQVRGLGNSVAMATMWIMNWVTLIIFNPIVEWWGLGSAFYFFSFMCFLSAAYGQFCLPETKGLSADEIQLLFLKEKRNDTQKV-