Monarch geneset OGS2.0

DPOGS206261
TranscriptDPOGS206261-TA1503 bp
ProteinDPOGS206261-PA500 aa
Genomic positionDPSCF300290 - 375676-381131
RNAseq coverage1162x (Rank: top 11%)
Annotation
HeliconiusHMEL0131240.085.29% 
BombyxBGIBMGA010740-TA2e-14369.74% 
DrosophilaCG1208-PC3e-11045.90% 
EBI UniRef50UniRef50_Q7PCM58e-13251.74%AGAP003492-PA n=3 Tax=Anopheles RepID=Q7PCM5_ANOGA
NCBI RefSeqXP_309674.44e-13352.15%AGAP003492-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479699858e-13252.15%AGAP003492-PB [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479699856e-13652.15%AGAP003492-PB [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550859.4e-97transmembrane transport
GO:00160219.4e-97integral to membrane
GO:00228579.4e-97transmembrane transporter activity
GO:00160205.4e-95membrane
GO:00228915.4e-95substrate-specific transmembrane transporter activity
KEGG pathway 
InterPro domain[78-494] IPR0058289.4e-97General substrate transporter
[79-491] IPR0036635.4e-95Sugar/inositol transporter
[17-492] IPR0161961.8e-62Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL14819 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206261-TA
ATGGATAAGCCGACGATGCACATGCTCTCCGTCCAGGAGAAGGGCGGAGAAACTCGAAAACTAAACCAGTACATAGCCGCTGTAGCAGCTTCTATCGGCGCAGTGGCCGCTGGTACGATCTTGGCGTGGTCATCACCAGCCCTCCCACAGCTCCAACCGCCGAAGAATACTTCCACGAATTTAACAGAGATCGACATATTCTTTCTCAATGAAACATCGAAAGCCAATGTCTCTGACGTATTAATCAACGCTTTGGGCCAACCAGCCGATTTTTTGTTGAACACCAAAGATAGTTCATTAGTGTCGTCTATCCTTGCAATTGGGGCAGCGATCAGCGCTTTACCCGTCGGTTTTTCTGCCGAGCGCTTCGGAAGACGACCCACGATTCTGATGCTGTCTCTGCCGTTTCTGATCAACTGGTTGCTCACAATATTCGCTAATGGATCGGGGATGCTTATTGCTGCTAGATTTTTTGCTGGTTTGGGAACAGGTGGTATATGCGTGTGCGCGCCAATGTACATAGGGGAGGTAGCGGAGACGTCTATCCGCGGCTCGCTTGGCTCTTTTTTCCAGCTCTTCCTGACTGTAGGAATCCTCTTCACCTTCGTGGTCGGCGGTTGGACCCATTGGAGGACGCTCTCCATTATATCTGCTGTGTTTCCCGTCTTACTGATCGCCGTTTTCTGGTGGATGCCGGAGACGCCGCAGTACCTTCTAGGTAAAAACCGTCGTCGCGACGCCGAGAGGTCTCTCCGCTGGCTCAGAGGACCCCTGGCCGACCTCAGCGGAGAACTGGAAGAGATGCAGAAAGATGTCGACACAGCCTCCCGTCAAAGCGCCGGCATCCTCTCCATGGTGACGCAGCGAGCTCCTCTGATGGCGCTGATCTGTTCATTGGGCCTCATGTTCTTCCAGCAGTTCAGCGGAATCAACGCCGTCATCTTCTACACCAACAACATCTTCCAGTCCGCCGGCTCCAACATCCCGCCGGTCATAGCCACCATCATAGTGGGCGTGGTGCAGACCATAGCTACTTACATTTCGTCATTGCTTATAGAAAAAGCCGGCAGGAGGATCCTCTTGCTCCAGAGCTGTATCATTATGGGCATCTGTTTGATAGTGTTGGGCACATACTTCAAGTTGCAGGAGAGCGGCGCCAATGTTGGCACGTTTGGTTGGCTCCCATTGGTCTGCTTGGTCCTCTTCATCGTTTCCTTTTCCTTGGGCTTCGGGCCGATACCCTGGATGATGATGTCCGAGCTGTTCGCTATCGAGTTCCGAGGAACAGCTACGGGTATAGCCGTTATAACCAACTGGTGTCTAGTCTTCATAGTGACGCTATGCTTCCCGTTGCTGAAGGACATGATAGGTATTTACAGCTGCTTCTGGGTCTTCAGCGGCTTCATGATAGTTTGCGTGTTCTTCGTTTTCTTCCTGATACCAGAGACCAAAGGGAAGACCGTCTCTCAGATCCAAACCATACTTGGCGGGAAACGCGCGTGA

Protein sequence:

>DPOGS206261-PA
MDKPTMHMLSVQEKGGETRKLNQYIAAVAASIGAVAAGTILAWSSPALPQLQPPKNTSTNLTEIDIFFLNETSKANVSDVLINALGQPADFLLNTKDSSLVSSILAIGAAISALPVGFSAERFGRRPTILMLSLPFLINWLLTIFANGSGMLIAARFFAGLGTGGICVCAPMYIGEVAETSIRGSLGSFFQLFLTVGILFTFVVGGWTHWRTLSIISAVFPVLLIAVFWWMPETPQYLLGKNRRRDAERSLRWLRGPLADLSGELEEMQKDVDTASRQSAGILSMVTQRAPLMALICSLGLMFFQQFSGINAVIFYTNNIFQSAGSNIPPVIATIIVGVVQTIATYISSLLIEKAGRRILLLQSCIIMGICLIVLGTYFKLQESGANVGTFGWLPLVCLVLFIVSFSLGFGPIPWMMMSELFAIEFRGTATGIAVITNWCLVFIVTLCFPLLKDMIGIYSCFWVFSGFMIVCVFFVFFLIPETKGKTVSQIQTILGGKRA-