Monarch geneset OGS2.0

DPOGS203833
TranscriptDPOGS203833-TA1692 bp
ProteinDPOGS203833-PA563 aa
Genomic positionDPSCF300010 + 2559027-2564208
RNAseq coverage91x (Rank: top 63%)
Annotation
HeliconiusHMEL0069540.087.62% 
BombyxBGIBMGA003740-TA0.072.44% 
DrosophilaTret1-2-PA3e-15663.72% 
EBI UniRef50UniRef50_UPI00021A66EB3e-15462.00%UPI00021A66EB related cluster n=2 Tax=unknown RepID=UPI00021A66EB
NCBI RefSeqXP_001846280.14e-16064.83%sugar transporter [Culex quinquefasciatus]
NCBI nr blastpgi|1700368627e-15964.83%sugar transporter [Culex quinquefasciatus]
NCBI nr blastxgi|1954363021e-15864.92%GK22112 [Drosophila willistoni]
Group
Gene OntologyGO:00550852.2e-88transmembrane transport
GO:00160212.2e-88integral to membrane
GO:00228572.2e-88transmembrane transporter activity
GO:00160201.3e-86membrane
GO:00228911.3e-86substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[145-538] IPR0058282.2e-88General substrate transporter
[146-536] IPR0036631.3e-86Sugar/inositol transporter
[145-537] IPR0161961.8e-57Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL10651 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203833-TA
ATGAAGATTTTAATGAGAGCCGATACTCATTACAGCATAGTTGTAAGTGGCAGTGAATATGTTAAGCCGAAATATACATTTTCTCAGGTGTTAGCTGCCGTCGCAGTGTCTATGGGTTCCATGGTCGTTGGTTATTCCACTGCCTACACGTCACCTGCCCTCGTCACCATGGAAAATAGCACAACTATATCCGTCACTGAGGAACAAGCAAGTTGGGTTGGTGGATTAATGCCATTAGCGGCTTTAGCCGGTGGTGTGCTTGGCGGTCCATTGGTTGACTATATTGGCCGGCGAAAAACTATACTACTCACAGCTGTACCCTTCTTCGTCGGTTGGATTTTAATAGCTACTGCAAGAATTGTTCATTTGGTACTTATAGGACGTGCTATATGTGGATTATGTGTCGGTATTGGATCACTAGCCTTTCCGGCAAGTTGGGTTGGTGGATTAATGCCATTAGCGGCTTTAGCCGGTGGTGTGCTTGGCGGTCCATTGGTTGACTATATTGGCCGGCGAAAAACTATACTACTCACAGCTGTACCCTTCTTCGTCGGTTGGATTTTAATAGCTACTGCAAGAATTGTTCATTTGGTACTTATAGGACGTGCTATATGTGGATTATGTGTCGGTATTGGATCACTAGCCTTTCCGGTATACCTCGGAGAAACAATTCAACCCGAAGTGAGAGGCACTTTAGGTCTGTTTCCTACAGCAATTGGCAATATTGGTATTTTAATTTGCTACATTGCTGGAAAATACCTTGATTGGTCACAATTAGCATACCTAGGGGCGTCGCTGCCAATTCCATTCCTTATTCTTATGTTTATGATCCCAGAAACTCCGCGATGGTACATGTTACGAGGAAGAAATGAAGAAGCTCGTAAAGCTCTTCAATGGTTGAGGGGCAAAAATACTAAAATAGATAATGAAATGCGTGATATAGCTCTTTCAGACGCTGAAGTTGATAGCGATTTGAAATTTAAAGACATTTTAAAAATGAAATATTTGAAATCTATATTGATAGCTCTGGGTCTCATGCTTTTCCAACAGCTCTCCGGGATTAACGCCGTGATATTCTACACAGTCAAAATATTCAACATGTCAGGCAGTTCCGTAGATGGTAATTTATCAACAATTATTGTCGGATTAGTCAATTTCATCTCAACATTCGTAGCAACTGCTCTCATAGACAGAACAGGACGCAAAATACTACTTTACATTTCTTCGGTAACAATGACCGTGACGCTCATAGTGCTAGGAACGTTCTTTTACGTTCGAGACACATTACATATGAATGTCACCAACTTAGGTTGGCTTCCGCTGACAAGTGTGATGTTTTATTTACTTGGATTTTCTTTGGCTTTCGGGCCGATACCTTGGTTAATGATGGGGGAAATTTTACCAGCAAAAATTAGAGGTGGAGCCGCGTCTATGATTACTGCTTTTAATTGGTTATGCACTTTTGCTGTTACAAAAACATTCCACAATATCTTAGTAGCCATCGGGCCAGCTGGTACGTTTTGGTTATTTGGTTGCATTTGTTTTGTTGGACTATTCTTTGTTATAGTATTTGTGCCCGAGACCAGGGGTAAAAGCCTTGAACAAATAGAGAATAAAATGACAGGAACCAAAGCGAGGTCACGTAGGATGAGTTCAATTGCGAACATAAAACCCCTACCAAATGGATGCTAG

Protein sequence:

>DPOGS203833-PA
MKILMRADTHYSIVVSGSEYVKPKYTFSQVLAAVAVSMGSMVVGYSTAYTSPALVTMENSTTISVTEEQASWVGGLMPLAALAGGVLGGPLVDYIGRRKTILLTAVPFFVGWILIATARIVHLVLIGRAICGLCVGIGSLAFPASWVGGLMPLAALAGGVLGGPLVDYIGRRKTILLTAVPFFVGWILIATARIVHLVLIGRAICGLCVGIGSLAFPVYLGETIQPEVRGTLGLFPTAIGNIGILICYIAGKYLDWSQLAYLGASLPIPFLILMFMIPETPRWYMLRGRNEEARKALQWLRGKNTKIDNEMRDIALSDAEVDSDLKFKDILKMKYLKSILIALGLMLFQQLSGINAVIFYTVKIFNMSGSSVDGNLSTIIVGLVNFISTFVATALIDRTGRKILLYISSVTMTVTLIVLGTFFYVRDTLHMNVTNLGWLPLTSVMFYLLGFSLAFGPIPWLMMGEILPAKIRGGAASMITAFNWLCTFAVTKTFHNILVAIGPAGTFWLFGCICFVGLFFVIVFVPETRGKSLEQIENKMTGTKARSRRMSSIANIKPLPNGC-