Monarch geneset OGS2.0

DPOGS211981
TranscriptDPOGS211981-TA1092 bp
ProteinDPOGS211981-PA363 aa
Genomic positionDPSCF300011 + 1372375-1378069
RNAseq coverage13x (Rank: top 83%)
Annotation
HeliconiusHMEL0075108e-6040.00% 
BombyxBGIBMGA013654-TA5e-7940.62% 
DrosophilaCG10960-PB1e-2524.94% 
EBI UniRef50UniRef50_E2B5H98e-2726.11%Sugar transporter ERD6-like 7 n=4 Tax=Formicidae RepID=E2B5H9_HARSA
NCBI RefSeqXP_001866055.16e-3025.37%solute carrier family 2 [Culex quinquefasciatus]
NCBI nr blastpgi|2914615832e-2927.70%sugar transporter 12 [Nilaparvata lugens]
NCBI nr blastxgi|1571382415e-3326.41%sugar transporter [Aedes aegypti]
Group
Gene OntologyGO:00550851e-34transmembrane transport
GO:00160211e-34integral to membrane
GO:00228571e-34transmembrane transporter activity
KEGG pathway 
InterPro domain[2-359] IPR0161961.4e-35Major facilitator superfamily domain, general substrate transporter
[108-354] IPR0058281e-34General substrate transporter
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211981-TA
ATGGAAATATCACTTCTTGGAAGTTTATCAAGCATTGCAGCCGTAGTAGGAACTCCTCTCACTGGGGTCTTATTGGATCTATTAGGAAGAAAATACTGCTGTATTTTATTTTCTCTGGCTCCAGTGATATCGTGGGCGATCCTGTCGTTCACACGTCGAGTGGAGGTCGTCCTGCTGTCGGTGTTCCTGTCAGGCCTGGTCGGAACGTCGTTCATGATAGTCCCCAACTTCATCAATGAGATCACTCAGGACAGCATACGGGGCTCCCTCACTTCCACAGAGGCAGTCGAAGCAATATGTTTTTATAGATCTGAAAAACCGAACTCTAAAATAGTGCTGCAAGAAATTGATAATCTAAGAAAAATTTTAAGTCCACAGTTCAACAACACCCAACCCGAAGAAGAGACGTTGAAACCCGAGAGTGCAAAAAAGCTCACAAAATGGCAATTCATTCGAAAACATAAACCCACGGTCCGTGGATTTTTCATCGCTCTTACGTTGCTGAGCCTTTCAATATTTCAAGGAGTAATCGTAGTACAAGTCTACGCACAGCTTTTATTCGAGGATACGTTACCTAATATGTCAGCTACTTGGAGCAGTATTGCGTTCGCTCTCGTAAATGTACTCTCTGGATTGGTCGCAGCATACCTGCTCGACGTTTTCGGCAGACGACCTATCATGATCTATTCGTCGTTGTTGACGTGTTTGTTCTGTACCTTGCTGGGTTCTCACCTCCAACTCCACTGGGCCCCGTCCTGGCTCGCCGCAGCACTCATTTTTCTATATTGCATGACTGTCACCTCAGGAGCCAATATCGTGCCTTTCGTTATTGTCGCTGAGATCTTCCTGCCAGAGGTGCGCGGCGTAATGAGCATGTTGGTCATCGAGTCTGGATGGATTTGGAATTTTTTAATGTTAGTCATCTTTAACCCTTTCATGGCTGCGGTCGGAATGGGCCCCGTATTTTATATTTTTGCGTTGATTAGTTTATTGACTGCTATATTTACGTCGATTTATCTTCCTGAGACGAAGGGCCTGACAGTGCAGGACATACAAACACTACTAGAAAGAAGACAACGGAGCTTACACTAA

Protein sequence:

>DPOGS211981-PA
MEISLLGSLSSIAAVVGTPLTGVLLDLLGRKYCCILFSLAPVISWAILSFTRRVEVVLLSVFLSGLVGTSFMIVPNFINEITQDSIRGSLTSTEAVEAICFYRSEKPNSKIVLQEIDNLRKILSPQFNNTQPEEETLKPESAKKLTKWQFIRKHKPTVRGFFIALTLLSLSIFQGVIVVQVYAQLLFEDTLPNMSATWSSIAFALVNVLSGLVAAYLLDVFGRRPIMIYSSLLTCLFCTLLGSHLQLHWAPSWLAAALIFLYCMTVTSGANIVPFVIVAEIFLPEVRGVMSMLVIESGWIWNFLMLVIFNPFMAAVGMGPVFYIFALISLLTAIFTSIYLPETKGLTVQDIQTLLERRQRSLH-