Monarch geneset OGS2.0

DPOGS215175
TranscriptDPOGS215175-TA1398 bp
ProteinDPOGS215175-PA465 aa
Genomic positionDPSCF300143 - 470072-472579
RNAseq coverage421x (Rank: top 29%)
Annotation
HeliconiusHMEL0044180.064.36% 
BombyxBGIBMGA011016-TA9e-11141.99% 
DrosophilaCG8654-PB2e-8738.60% 
EBI UniRef50UniRef50_D6WKJ17e-10041.47%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WKJ1_TRICA
NCBI RefSeqXP_973659.11e-10040.09%PREDICTED: similar to organic cation transporter [Tribolium castaneum]
NCBI nr blastpgi|910825352e-9940.09%PREDICTED: similar to organic cation transporter [Tribolium castaneum]
NCBI nr blastxgi|910825377e-10141.47%PREDICTED: similar to organic cation transporter [Tribolium castaneum]
Group
Gene OntologyGO:00550851.6e-36transmembrane transport
GO:00160211.6e-36integral to membrane
GO:00228571.6e-36transmembrane transporter activity
KEGG pathway 
InterPro domain[80-462] IPR0161961.4e-52Major facilitator superfamily domain, general substrate transporter
[14-455] IPR0058281.6e-36General substrate transporter
Orthology groupMCL35050 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215175-TA
ATGATATGTCTGACGATACCCATTATAAAGTTCTCGTCGGGCTGGGTACAAATGGCGATAGTATTCCTCACACCAAAAACTACTTTTTGGTGCACAGACTTCGGAGACAATTCCACGAGAACAGGAGACAACATGACCTGCTACGATGAATGTGCGAAATACAGCTACGACGCCTCGCCCTTCAATAATACTATCATATCAGAATGGGACCTGGTTTGTGAGAGAAGCTGGCTGACGAGCCTTACTCAGATGATGCTGCAGTTGGGTATCCTGCTGGGGAGCATACTTTTTGGATTTTTCTCTGACAGATATGGCAGAAGAAAAACTTTGTTGTTGTCCGTGGTCGGGTTGATAGTGTTCGGATTCGCCGTGTCCTTTTCTCCAGACTACATCACGTTCACAACCTTAAGGTTCCTCCTCGGCGTGGCAACCGCTGGAACTATGGTCGTCTCTTTTGTACTCATCATGGAGACGATAGGACCGAAGTATCGTGAAGTGTGTGGCTGTCTCTTTCAACTGCCTTTCATCATCGGTCACGCGACGATGCCGATATTTGCGTTTTATAATAGAAGCTGGGACTCCTACAGTCTTGCCATGGCAGTTCCTCCGCTGATCTACTTAGTATTCTTCTTCACAGCACCGGAGTCTCCAAGATGGCTTATCTCAATGGGAAAAACAGATCAAGCAAGTCGCGTCGTCACTAAAGTAGCTGAAATGAATAAACTTCCGACAACCAAAGTAGAGGAGACCATAAAAAGCTTGTCCGAAGAAATCCGCTCAAAGGCGACCACTGTGAAGCCGCACTACGGAGATCTATTTCGAGGTTCTCTGATGATAAAGACGATAAGTTCCTGCGTCATATGGATGATCACCGGTCTGACCTACTACGGCTTCAACCAGTACGTCAGCCAGACCAGCCCCAACCCCTTCATCACTGTAGCAGCTGCGGGACTCATTCAGATTCCATCTATATTCATATCAATAGTCCTGCTCAAGTATTTCGGTCGCAAGACGACGATCGTTACCTTTTTTGTTTTGGGAGGTCTCTTCGTCCTAGTGTTGGGTTTAGTGTCAGGTAGTTTCTGGACGAATTTAACACTAGCATGTGTCGGAATAAGCTGTGTGTCCGTAGTTTGCACGTGTGTGTATATTTACACATCAGAATTGTTCCCAACCGTGGTCAGGAATATGAGTATGGGGGCTTGTTCCACGTGTATGAGGATCGGCTCGATGATAGCTCCTTTCATCTCCAACTTGTCGGAAACTGTGCCCTGGATGCCGACCGTTATTTTTGGATTCGCCCCACTTTTAGGAGCTCTAATCTGTCTCATGCTACCAGAAACTAAAGGGACAACTCTACAGGACGTAATAGATAGTAAAGAAGAACAAAAAACGTGA

Protein sequence:

>DPOGS215175-PA
MICLTIPIIKFSSGWVQMAIVFLTPKTTFWCTDFGDNSTRTGDNMTCYDECAKYSYDASPFNNTIISEWDLVCERSWLTSLTQMMLQLGILLGSILFGFFSDRYGRRKTLLLSVVGLIVFGFAVSFSPDYITFTTLRFLLGVATAGTMVVSFVLIMETIGPKYREVCGCLFQLPFIIGHATMPIFAFYNRSWDSYSLAMAVPPLIYLVFFFTAPESPRWLISMGKTDQASRVVTKVAEMNKLPTTKVEETIKSLSEEIRSKATTVKPHYGDLFRGSLMIKTISSCVIWMITGLTYYGFNQYVSQTSPNPFITVAAAGLIQIPSIFISIVLLKYFGRKTTIVTFFVLGGLFVLVLGLVSGSFWTNLTLACVGISCVSVVCTCVYIYTSELFPTVVRNMSMGACSTCMRIGSMIAPFISNLSETVPWMPTVIFGFAPLLGALICLMLPETKGTTLQDVIDSKEEQKT-