Monarch geneset OGS2.0

DPOGS203137
TranscriptDPOGS203137-TA1533 bp
ProteinDPOGS203137-PA510 aa
Genomic positionDPSCF300035 - 1290530-1293225
RNAseq coverage94x (Rank: top 62%)
Annotation
HeliconiusHMEL0032461e-14667.78% 
BombyxBGIBMGA009185-TA4e-16665.27% 
DrosophilaCG7458-PA8e-8433.33% 
EBI UniRef50UniRef50_E2A7K49e-8236.78%Solute carrier family 22 member 21 n=7 Tax=Formicidae RepID=E2A7K4_CAMFO
NCBI RefSeqNP_649374.12e-8233.33%CG7458 [Drosophila melanogaster]
NCBI nr blastpgi|3838630037e-8638.43%PREDICTED: solute carrier family 22 member 21-like [Megachile rotundata]
NCBI nr blastxgi|3838630035e-8538.43%PREDICTED: solute carrier family 22 member 21-like [Megachile rotundata]
Group
Gene OntologyGO:00550852.9e-25transmembrane transport
GO:00160212.9e-25integral to membrane
GO:00228572.9e-25transmembrane transporter activity
KEGG pathway 
InterPro domain[97-471] IPR0161961.1e-38Major facilitator superfamily domain, general substrate transporter
[95-457] IPR0058282.9e-25General substrate transporter
Orthology groupMCL25584 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203137-TA
ATGTTGTCAGCAATGTATTCCCTCAATTATGTCTTTGTCGCAGATCAAGTAGCTTTTAGGTGTTTAGTGAAAGAATGTGAAGGACAAGGCGGTTATTTTGCGAACAAAACAATACAAGATCTATTACCAACTCCAAATGAAACCTGCCACCGCTATCAGGCAGTACAGGCCGACCAAATTAGCTGTGACATAAAAGACTATTATTTAAATAAAACGAAAAAATGTAATGACTTTGTCTATGAAACAATGAATACTATTTACGCCGAGTTTTCCATGGCTTGCAAAGAATGGCAGCGTACCCTCGTTGGTACTATCCGAAACAGTGCTCTACCTTTAGCTCTTGTCCTCACTGGATATATTTCCGATAGATATGGACGGCGAACCGCATTTTGTATTTTCGCTGCATGTGCTGGTGTGCTGGGTATTATAAAATCTTTCATGCCAAATTATTCGGCATACCTAACAATGGAGTTTTTAGAAGCTGCTCTAGGTTACGGTTTCAATAGTGCCGTCTATGTCATGATTGTTGAGTTAGCTCGTCCTTCATTACGAGTTGCATTCGCCTGTGTGACTGGTATAGGATACGGGCTGGGTGGCATGCTATTCGCGCTGATTGCTAGCCAAGTCCCTTACTGGCGAAACCTGCTAAGAGCTATACACACACCAGCTTTATTTCTGCCCCTATATTGGTTTCTTCTAGACGAGAGCGCTAGATGGTTGCATGCAACCAATAAGAAGCATGAAAGTATCAGGGTTATTAAAAAAGCTGCAAGATGGAATAAGGTAGTAGTAGATGAAGATCTCATAAATAGTATAAAAGGCGAAACAGGAGCTGATGAAGTAAAAAATAAAAACAATCCATGGTTGAATTTATTAAAATCTAAAATACTTATGCTCCGGTTTTGCATTTGTTGCTGGTGTTGGATATCGGTCACCTTTGTTTATTATGGACTCACAATTAATTCGGTGTCCATGTCCGGTGACAAATATGTGAACTTCGCCCTTAGCATGTTTATGGAGATCGTGGCGTCTTTGTTAATCATGATGGCGTTGGAACGCTTCGGACGTAAACAGAGCATTTTCGTTGCCTTCTTGGTTTGCGGAATCACTTGTGTTACGCCTTTCTTTATATCACATTCCAATACGAAAACAGCGTTATTCTTTGTCGGGAAACTGTCAATCACATTTGCCTTCAATTCGCTGTACGTTTTCACGGCTGAGTTATTCCCAACTGAGGTTCGATCTTCGGCAATGGCGGCTGTGTCTCTTATTGGTCGCGTAGGATCTCTTGTGGCCCCACAGACGCCACTTTTGAGCGAATTTATTCAAGCACTACTATATGGGATCAGCTCAATCTCAGCAGCTCTGTTAGTGTTACTCGCGCCAGAAACACGTCGTGCGGCACTTCCACAACACGTGCAGCATGCGGAGCAAATGCATGCGCTTGCGCATCCACCTAACAGGTCGTTTGGCATACGAGGAAGCTTCAACGATCCCAGCCGTTATAGCTTACCAACTACTTCGCAGCTTTAA

Protein sequence:

>DPOGS203137-PA
MLSAMYSLNYVFVADQVAFRCLVKECEGQGGYFANKTIQDLLPTPNETCHRYQAVQADQISCDIKDYYLNKTKKCNDFVYETMNTIYAEFSMACKEWQRTLVGTIRNSALPLALVLTGYISDRYGRRTAFCIFAACAGVLGIIKSFMPNYSAYLTMEFLEAALGYGFNSAVYVMIVELARPSLRVAFACVTGIGYGLGGMLFALIASQVPYWRNLLRAIHTPALFLPLYWFLLDESARWLHATNKKHESIRVIKKAARWNKVVVDEDLINSIKGETGADEVKNKNNPWLNLLKSKILMLRFCICCWCWISVTFVYYGLTINSVSMSGDKYVNFALSMFMEIVASLLIMMALERFGRKQSIFVAFLVCGITCVTPFFISHSNTKTALFFVGKLSITFAFNSLYVFTAELFPTEVRSSAMAAVSLIGRVGSLVAPQTPLLSEFIQALLYGISSISAALLVLLAPETRRAALPQHVQHAEQMHALAHPPNRSFGIRGSFNDPSRYSLPTTSQL-