Monarch geneset OGS2.0

DPOGS209696
TranscriptDPOGS209696-TA1626 bp
ProteinDPOGS209696-PA541 aa
Genomic positionDPSCF300309 + 24461-37369
RNAseq coverage101x (Rank: top 61%)
Annotation
HeliconiusHMEL0098410.072.15% 
BombyxBGIBMGA001599-TA3e-17972.08% 
DrosophilaCG7442-PA3e-11138.09% 
EBI UniRef50UniRef50_E2A7K41e-11541.03%Solute carrier family 22 member 21 n=7 Tax=Formicidae RepID=E2A7K4_CAMFO
NCBI RefSeqXP_391853.21e-12242.06%PREDICTED: similar to CG7442-PA [Apis mellifera]
NCBI nr blastpgi|3838630032e-12342.49%PREDICTED: solute carrier family 22 member 21-like [Megachile rotundata]
NCBI nr blastxgi|3287826982e-12242.00%PREDICTED: solute carrier family 22 member 21-like [Apis mellifera]
Group
Gene OntologyGO:00550851.3e-27transmembrane transport
GO:00160211.3e-27integral to membrane
GO:00228571.3e-27transmembrane transporter activity
KEGG pathway 
InterPro domain[8-528] IPR0161966.5e-58Major facilitator superfamily domain, general substrate transporter
[109-519] IPR0058281.3e-27General substrate transporter
Orthology groupMCL15564 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209696-TA
ATGGCCAAAGACATAGGATTGGACTCTATCCTCCAAGAATTGGGTGCTTTTGGTAAATATAATATAGTGAACTATGCAATTTTGCTGTTTCCAGTCTTCTTGGCTGGAATGTATGGTTCAATTTATATTTTCGAAGCCCCCGATATAGACTACAGATGCAAGATAACGCAATGCGAAGGTCTGAACAACACGTCCTGGCTCTCAAACGCCATACCAAAGGAGAAGGACGTTCTGTCGAGATGCATGAAGTACAAACATATTGACAACTACAACTACAGCTGCAGCGTCGATAGTTTTAATACATCCATTACAGAAGTCTGCGAGGAATACGAGTATAGCAACGAGAATTCAGCTGTTAAGGATTTTGACCTTGGTTGCCAAAATTGGAAGAGGACTCTTATAGGGACTGTTCACAATGCAGGTCTCTTCCTGTCGCTGCCGCTCACAGGGTACATATCAGATAGATTCGGCAGGAAATTTGCGCTATCTATTGCATCCCTCATGAATGGTACCTTTGGTTTCCTAAGATCATTTTCAACGAATTACGTTATGATGCTCGCGTTTGAGTTCCTCGAGGCAGGTTTGGGAGCTGGTGCGTACAGCACAGCGTTTGTTCTTGCTATGGAATTGGTAGGTCCAAAGGGTCGCGTTTTCGGAAATACCATCATAAATGCTGTTTATGTGTCGGGGCTGATGACGTTGGCAGCTTTATCATGGTGGCTTCAGAGTTGGAGGCATCTGTTAAGAATAATTTACATTCCAGCTGTGTTTGTTATTTCATACATTTGGATTTTGAATGAGAGTATACGATGGTTGCTCAGCAAAGGACGTACCGAGGAAGCGATCGATATTTTAAAAAAAGCTGCTAAAATGAACAAAGTACAATTATCAGAGGAAGCTCTCACTCCCCTATATGAATTAAAAAAACTAAATGGAGACCATGAAAGCGAGAAACAAAAAGACGATATAACAAATAAAGTAGAAGAAAAAACTTCAAATTTTGTTAAAGTAATAAGGTCCAGTGTTATGAGAAAGAGATTGGCTATATGTTCGTTTCTCTGGATAACATGTACTTTTGTGTACTACGGTCTGTCCATTAACTCCGTGTCATTGGGTGGCAATAAATATATTAACTTCATGCTGGTCGCATTTGCTGAGATTCCAGCGAACATTGTGTGTTTCTTGGTGTTGGACAGATTTGGAAGAAAAAAAGTGCTAATAATAACTTACGTGTTGAGTGCTTGTTTGTGCATCGGATTGTCTTTTCTGCCCAAAGATCAGAAATGGTTGTCTCTCGTACTGTATTTGTCTGGTAAGTTTTCCATTACGGTGTCATACAGCTCCGTGTACATCTACGTGTCGGAGGTGTTCCCCACTAGCGTGAGACAATCACTGCTAGCTGTGTGCTCTTCGCTTGGACGGGTCGGATCGACATTAGCGCCGCTAACGCCTTTACTAACGTTATACTATCATAATCTGCCAGCGATATTCTTCGGATCCATGGCATTAGCTGCTAGTTTGTTAGTGTTCACACTACCAGAGACAATAAACGTCGCTTTACCAGACACCATAGAAGAAGCCGAGATGATATCAAAGAGGAAACATGGGAGTGATAACGAAAGATAA

Protein sequence:

>DPOGS209696-PA
MAKDIGLDSILQELGAFGKYNIVNYAILLFPVFLAGMYGSIYIFEAPDIDYRCKITQCEGLNNTSWLSNAIPKEKDVLSRCMKYKHIDNYNYSCSVDSFNTSITEVCEEYEYSNENSAVKDFDLGCQNWKRTLIGTVHNAGLFLSLPLTGYISDRFGRKFALSIASLMNGTFGFLRSFSTNYVMMLAFEFLEAGLGAGAYSTAFVLAMELVGPKGRVFGNTIINAVYVSGLMTLAALSWWLQSWRHLLRIIYIPAVFVISYIWILNESIRWLLSKGRTEEAIDILKKAAKMNKVQLSEEALTPLYELKKLNGDHESEKQKDDITNKVEEKTSNFVKVIRSSVMRKRLAICSFLWITCTFVYYGLSINSVSLGGNKYINFMLVAFAEIPANIVCFLVLDRFGRKKVLIITYVLSACLCIGLSFLPKDQKWLSLVLYLSGKFSITVSYSSVYIYVSEVFPTSVRQSLLAVCSSLGRVGSTLAPLTPLLTLYYHNLPAIFFGSMALAASLLVFTLPETINVALPDTIEEAEMISKRKHGSDNER-