Monarch geneset OGS2.0

DPOGS214822
TranscriptDPOGS214822-TA2262 bp
ProteinDPOGS214822-PA753 aa
Genomic positionDPSCF300375 - 98523-110211
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0035410.078.39% 
BombyxBGIBMGA013928-TA0.096.11% 
DrosophilaTret1-2-PA2e-7534.59% 
EBI UniRef50UniRef50_Q7PVH73e-16863.45%AGAP009274-PA n=5 Tax=Neoptera RepID=Q7PVH7_ANOGA
NCBI RefSeqXP_320068.46e-16963.45%AGAP009274-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1583000681e-16763.45%AGAP009274-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|2420156266e-16764.62%sugar transporter, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00550853.9e-54transmembrane transport
GO:00160213.9e-54integral to membrane
GO:00228573.9e-54transmembrane transporter activity
KEGG pathway 
InterPro domain[186-742] IPR0161968.7e-55Major facilitator superfamily domain, general substrate transporter
[297-743] IPR0058283.9e-54General substrate transporter
Orthology groupMCL16337 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214822-TA
ATGGCGGGTTCAAGAGAGAGATTGCTATACGATCGTCACGTGGAATTTGAATTAGAATATGACAGAAGACGCCGACAGGAAACCAGGTTTGAACCCCAGAGGTCACACACGAGCCGAGCTCCGGAAAGATACCAAGAAACACCTCAGAGGAAAAATGGCTACGTCTTCGAAGACATTTATTCAGATTTCGACGAAGCCCAAGGCCGCCTGTCACTATTCCGACCCATCAGACCCGAATATTTAAATCTGCAAACAACATATGACAACAAATACAACTCGCTGCCTCGGACGTATTACAATGACGAGAGACGGCGCTATAAAAAAGATTATTACGACTTCAGAATCGATGAAAGGAAAATACCGAAGCAATACTATGAAAAATTTCAATTGGACGAAAGAATCGAAAGGAAAAGCAATGAGATCAGGACAAAGGAGAGGCGATCCAACTCTAACGAACCGAGAGTTGATAGGAAGCCTAAGAATATTGCCGTCTGCAAACCCCTTCAGCTGCCTGAGAGGAACGCCAGATACTGGACAACGGATCCACCGGGAAAAGATAAAAAACCAAAACGAGAGGAGAATAAGAAAATTGTGAAGTTCTCAGAAGTTTCTGGGTGTAGGAAGATGGTTATAGATGACCCTGTGTGCGGCAGAAAAGTCGTAGCCGTCCGTGAACTTGAGGTTTCTTTGGACCAAATGATACAGAACGGTTACTTCGAGAGACACAACATACCCATACGACTACCGGAAAAAAAGGAAGAGATAACTAGCGCGCTGCCTAACGGGAAGTGTGTCAGCGATACGCCGACGAGAAAGTCTAGACATTCCAGGCATCTACCGCAGGTGTTGGCAGCCCTTGCGGTATCACTAGCTCCGTTTTCGGCAGGTCTTGGTAAGGGTTACAGTTCCCCAGCTATAGCTTCGCTTCAAGGACCTGGAAACGCAACCAGGAGGGATTTCCAGCTCACAGATCAACAGGCGTCCTGGCTTGCTTCGTTATCTTTTCTCGGTGCGCTGTTCGGTGGTATGGCTGGAGGCGCTGCTATGCGTCACGGACGTCGCCGAGTACTGTCCCTGGCCGCAGCACCTTGCAGCCTCTCCTGGCTATTGACGGTGCTGGCCACATCAGTACGCATGATGTGCATCACAGCTTTTCTGGGAGGCTTCTGCTGCTCTATACTGACAATGTTATCACAGGTTTACATCAGTGAAATATCAGTACCAGACATCAGGGGCTGTCTGAGCGCTGTTCTCAAAATTGTAGGTCACCTTGGAGTGCTGTTTAGTTTCACCATCGGAGCCTATCTCGACTGGCAGCAACTAGCTCTTTGTATTTCCGCTGCCCCCCTACTTCTCTTCTGCACAGTGCTTTATATACCGGAGACCCCAAGCTACCTCGTCCTTATTGGAAAAGACGATGAAGCATACAAAAGCTTGCTATGGCTTCGTGGACCAAACTCTGACGTTGCTCAAGAATTAGCTACAATCAGAACTAATGTACTCGCCAGCAAAAATTTTAGTCAGCGACAAAGTCAAATGTCCAGCAGTCAACTGATAAGTTCGTTAGACGTTAGGACGATGAACCGCTTGCTTGGACCAATTTTAGTGACTTGCGGACTGATGATGTTTCAGCGGTTCTCTGGTGCTCACGCATTCTCTTTCTACGCTGTTCCGATATTCAGGAAGACTTTTGGAGGAATGAATCCACACGGAGCAGCTATTGCTGTCTCATTCGTGCAATTACTAGCATCTTGTCTCTCTGGTTTATTGATCGACACTGTTGGGCGACTGCCTTTGCTTATAGTGAGCTCGGTGTTAATGTCTATGGCTTTAGCTGGCTTCGGAAGTTATGCTTACTACGAAGAAGTCCACAGGAATCAGAGGATACAGAACGTAATGTTCCATCAAACAGTGGGACAAAATGATTGGATTCCATTACTATGTGTTTTGGTATTTACAATTGCGTTTTCTTTGGGCATGAGTCCCATATCGTGGCTGCTTATAGGTGAGCTCTTCCCTCTCGAATACAGGGCGTTTGGCAGCGCGATGGCAACTGCGTTTAGTTATCTCTGTGCTTTCGTCGGTGTGAAAACTTTTGTCGATTTCCAACAAGCGTTGGGTTTGCACGGCGCATTTTGGTTGTACGCCTCCATAAGTGTCGGGGGACTTTGTTTTGTTGTCTGCTGTGTGCCAGAAACCAAAGGCAGGGATCTTGATGAAATGGATCCAAACTATGTCCAAAGCCTCTCCCCCAAAAGGTAA

Protein sequence:

>DPOGS214822-PA
MAGSRERLLYDRHVEFELEYDRRRRQETRFEPQRSHTSRAPERYQETPQRKNGYVFEDIYSDFDEAQGRLSLFRPIRPEYLNLQTTYDNKYNSLPRTYYNDERRRYKKDYYDFRIDERKIPKQYYEKFQLDERIERKSNEIRTKERRSNSNEPRVDRKPKNIAVCKPLQLPERNARYWTTDPPGKDKKPKREENKKIVKFSEVSGCRKMVIDDPVCGRKVVAVRELEVSLDQMIQNGYFERHNIPIRLPEKKEEITSALPNGKCVSDTPTRKSRHSRHLPQVLAALAVSLAPFSAGLGKGYSSPAIASLQGPGNATRRDFQLTDQQASWLASLSFLGALFGGMAGGAAMRHGRRRVLSLAAAPCSLSWLLTVLATSVRMMCITAFLGGFCCSILTMLSQVYISEISVPDIRGCLSAVLKIVGHLGVLFSFTIGAYLDWQQLALCISAAPLLLFCTVLYIPETPSYLVLIGKDDEAYKSLLWLRGPNSDVAQELATIRTNVLASKNFSQRQSQMSSSQLISSLDVRTMNRLLGPILVTCGLMMFQRFSGAHAFSFYAVPIFRKTFGGMNPHGAAIAVSFVQLLASCLSGLLIDTVGRLPLLIVSSVLMSMALAGFGSYAYYEEVHRNQRIQNVMFHQTVGQNDWIPLLCVLVFTIAFSLGMSPISWLLIGELFPLEYRAFGSAMATAFSYLCAFVGVKTFVDFQQALGLHGAFWLYASISVGGLCFVVCCVPETKGRDLDEMDPNYVQSLSPKR-