Monarch geneset OGS2.0

DPOGS213734
TranscriptDPOGS213734-TA1932 bp
ProteinDPOGS213734-PA643 aa
Genomic positionDPSCF300278 + 167386-182844
RNAseq coverage204x (Rank: top 47%)
Annotation
HeliconiusHMEL0025690.086.09% 
BombyxBGIBMGA011489-TA0.088.45% 
DrosophilaCG10019-PB7e-8941.38% 
EBI UniRef50UniRef50_UPI00022CA0D40.065.66%UPI00022CA0D4 related cluster n=1 Tax=unknown RepID=UPI00022CA0D4
NCBI RefSeqXP_392186.20.065.66%PREDICTED: similar to CG10019-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|3504194960.065.66%PREDICTED: hypothetical protein LOC100748465 [Bombus impatiens]
NCBI nr blastxgi|3504194960.065.78%PREDICTED: hypothetical protein LOC100748465 [Bombus impatiens]
Group
Gene OntologyGO:00550851.6e-15transmembrane transport
GO:00160211.6e-15integral to membrane
KEGG pathway 
InterPro domain[1-471] IPR0161961.8e-36Major facilitator superfamily domain, general substrate transporter
[77-387] IPR0117011.6e-15Major facilitator superfamily
Orthology groupMCL17321 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213734-TA
ATGGAAGAGTCATGGTCATCAGGCGGCGCGCCGCTTCTACGTACACGTTCGTTACCCCTGGTGTTAGAGGCGAGCGGTCCACCTGCGCGGCAAACCAAGCTAGATCCGCGACAACTGCTCACCCTCACCAGACATTATTACCCCGAGGGGGGCTGGGGCTGGGGGGTCGCGGCCGCAGCGACCATCGCCCAGCTCTTGGCCCATGGATTGCACCAAGCCGCCGCTGTATTGGCCGTCGAGATTGTAAGACGATTCGGTCCCGAGGTTCGGATGCAAGCAGGTTGCCTAGGCGCCATTTCTGCGGGAGTAGCTCTCGCGCTGTCACCAGTGACAGTGGCGTTATGCGTTCGGAAATCTACAAGAGTGACGGCGGTGGTTGGGGGCCTGGTTGCAGCACTCGGATGTCTCTTCACCTCATTCGCTACGCAGTTTCATCAGTTATTCTTCAGTTACGGGACAGTAGTAGGAGTAGGGGTCGGTTTGACAAGGGATTGCTCAACGCTGATGGTGGCGCAGTACTTCAAGAGGCGACGGGAGCTGGTAGAGATATTCATCGTCAGCGGAAGTGGTCTTGGCATAGCTGTCATGTCCACATTTATAAAAGGGGCCATCAGAGCTATTGGTTGGCGTCTAGGACTTCAGGCAGTGACGGGAGTGGTATTCGTGACATTTATTCTGGGGACGTTTTATCGTTCTGCTTCTCTGTACCATCCCCAAAGACGGGCGATACTCCATCTCAAGAACCAAAACAAAATCAAGCGGAAAATGAAGGACAGGAATAAGGCTGACGACAGGCAGCCTTTCTTTGACTTCTCCACTCTAAAGTCTAAAACCGTTAGGATCCTGCTCATGTCCACTGGGATATCGGCGTTTGGCATCAATACGCCGATATTTTATTTAGCCTATCACGCTGAAGAAGAAGGTCTCGGTGATACAGCAGAATTGTTGCAAGCGTATTTAGGTTTAGCGTGGGCGGTAGGATGCGCTGCGTTCGGATTGTTGGTGCGTCAGAACAGCGCGGAGTGCCGTATTGCTCGTCAGTATCTTACCCAAGCGGCAGTGTTCGGATGCGCGCTCGCCACTATGGCCTTAACTGCGGTTGAGGGGTCTTACAGAGGATACGTTATGTTCGCGTGGGTTTATGGTATTTTCTGCGGAGGTTACCACTATTCACTCAAGATGTACACCTACGAGCGCGTGAGGTCTCGCAATTTCGCTCGCACGTGGGGTTTCGTACAGTGCTCGCAGGCTGTACCTATTGCGATCGGTGTACCTCTATCAGGTTACATCAACGACGGTTGCGGCGGCAAGGCTGGTTACTACTTCAGTTCGACTTGCTCCATCATTGGTTCACTCTCATTGTTCTGCATCGATTTGCATCGTCGCAGCGTTGCGCATAAACACACCAAAGAAAATGGCGGCAAGTCGTGCGAATCGACATGCCCGCCACGAGGCCGACCGCGGTCCGAGCAACGCGTCGCCGGCGCCACCGCATTAAGTGCTGAACTAGTGACGCCGGGGAGCCGTCGAGATATTCTGCTGGATATCGGGCCTGGTAGCTTAGGATCCCCTCCCCCCAACCTTCCCCCTGAATTGACCTGCATCAGCGAGGAAGGCGGGCTAGACTTAGACCTAGACCTAGATATACCCGAACACCTGCTAGAAGATCTAGACTGCGGGGGAGATTGTATTACTAGTTGTAATAAGGTCGAAAACTATTTAATGTTAAGCGAATACGAAAACAATTTGATAGCGGAGCTGCCCAATTTGAACGAGCGTCGTGGTCGGCGTTGGTCTATCGTAGTCTCGAACACCAACTCGCCGCAACCGGAAAATCAATCCCCCGAACACAGGCGGAACTCGATCAAGTTTAAGAAGAAGTGCCATACGAATAACAGATTGATAACAGTGATAAACGAAGCGTCGCTGTAG

Protein sequence:

>DPOGS213734-PA
MEESWSSGGAPLLRTRSLPLVLEASGPPARQTKLDPRQLLTLTRHYYPEGGWGWGVAAAATIAQLLAHGLHQAAAVLAVEIVRRFGPEVRMQAGCLGAISAGVALALSPVTVALCVRKSTRVTAVVGGLVAALGCLFTSFATQFHQLFFSYGTVVGVGVGLTRDCSTLMVAQYFKRRRELVEIFIVSGSGLGIAVMSTFIKGAIRAIGWRLGLQAVTGVVFVTFILGTFYRSASLYHPQRRAILHLKNQNKIKRKMKDRNKADDRQPFFDFSTLKSKTVRILLMSTGISAFGINTPIFYLAYHAEEEGLGDTAELLQAYLGLAWAVGCAAFGLLVRQNSAECRIARQYLTQAAVFGCALATMALTAVEGSYRGYVMFAWVYGIFCGGYHYSLKMYTYERVRSRNFARTWGFVQCSQAVPIAIGVPLSGYINDGCGGKAGYYFSSTCSIIGSLSLFCIDLHRRSVAHKHTKENGGKSCESTCPPRGRPRSEQRVAGATALSAELVTPGSRRDILLDIGPGSLGSPPPNLPPELTCISEEGGLDLDLDLDIPEHLLEDLDCGGDCITSCNKVENYLMLSEYENNLIAELPNLNERRGRRWSIVVSNTNSPQPENQSPEHRRNSIKFKKKCHTNNRLITVINEASL-