Monarch geneset OGS2.0

DPOGS203206
TranscriptDPOGS203206-TA1320 bp
ProteinDPOGS203206-PA439 aa
Genomic positionDPSCF300035 + 653963-656589
RNAseq coverage232x (Rank: top 44%)
Annotation
HeliconiusHMEL0157480.063.03% 
BombyxBGIBMGA011090-TA4e-15761.82% 
DrosophilaCG8654-PB8e-9536.62% 
EBI UniRef50UniRef50_D6WKJ12e-10540.33%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WKJ1_TRICA
NCBI RefSeqXP_001649205.18e-10840.57%organic cation transporter [Aedes aegypti]
NCBI nr blastpgi|1571061822e-10640.57%organic cation transporter [Aedes aegypti]
NCBI nr blastxgi|910825359e-10441.61%PREDICTED: similar to organic cation transporter [Tribolium castaneum]
Group
Gene OntologyGO:00550851.8e-29transmembrane transport
GO:00160211.8e-29integral to membrane
GO:00228571.8e-29transmembrane transporter activity
KEGG pathway 
InterPro domain[83-423] IPR0161962.4e-46Major facilitator superfamily domain, general substrate transporter
[102-317] IPR0058281.8e-29General substrate transporter
Orthology groupMCL25810 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203206-TA
ATGGAGTTACAACGGCAAGGAGATGAGAATGGTTCGAAATCGAATCAAAATGAACCTAAGTTAGATCATATACAGAGTGCCATAGGATCATTTGGAAAATATCAATTTTACTTGTGCTTCCTGATATTTTTATCAAAATTCCCGGTTGCATTCCATCAGATGGCAATAATATTTTTGGCTCCAAAAGCAGAATTTACTTGTGAAGGAACACAAATAAAAGGAACTTGCCCTTGTGACAAACCAGTATATGACACTTCGATATTCACAAATACAATCATAACGCAATGGGATCTTATATGCAAAGATAAATGGTTGGCGAGCTTAACTCAAACCTTGTTTCAATTGGGGACCTTGATTGGCAGTCTTCTTTTCGGGATGGCTTCTGATAGATTCGGACGCAAAAAACCGATGTTATTTGCTGTCTTATTACAAGTTTCATCAGGAGTAGCCGCGGCCTTTGCACCTGACTACTGGTCTTTCAGCCTTCTTCGTTTTATCGTAGGGATGTCAGTAGGAGGGACAATGGTCGTAGGATTTGTTATTATTATGGAATATGTAGGCGCAAAATATCGTGATATTATTTCTGCTCTTTACCAAGCACCTTTCAATATGGGTCATATGTTGTTACCAGTTTTTGGATACTTCTTCAGAGATTATGTTAATTTCCAATTGGCAATATCCTTACCTGCAATCCTTCTCCTGTCTTACTTTTTTTTGTTACCAGAAACAGGAAGGTGGTTGATAGCAACACAACGTACAGAAGAGGCTATCCAGATTATAGAACGTGTTGCTACAATAAATAAACGACCAACGGAACATATTCGAAAAGATATAGAAACCCATCAAAAACAATTAGAGAACAATAAACTCAAGAAGGGAACTTTATTGGATCTTTTCCGCACTCCAAATCTCAGAAAGAATATATTGGCTATGTCTTTTAACTGGCTAACATGCAGTTATTGTTTCTACGGTGTATCTCAATACGTCGGACAATCATTTTCAGTATTTTTCGCGAGTATTGGAGTAGTGGCTAGTTTTATAGTTTTTGTTGTAGTTTATTTGTATTGTACCGAACTATTTCCTACCGTTGTCCGTAATGCTGCTATAGGATTTTCATCAATGATGGCGAGAATTGGTTCAATGATAGCACCATTCGTAATAGATCTGCGAGACACGGCTGTTTGGCTACCTCCCATAATTTTTGCAATATTCCCACTCGCAGCTGCTATGGTTTCTTTTCTCCTTCCTGAAACTAAAGGTCACGAGCTCATGACCACAATCGAGGAGGGAGAAAGATTCGGTAAAAAAGAACAAAAGTAG

Protein sequence:

>DPOGS203206-PA
MELQRQGDENGSKSNQNEPKLDHIQSAIGSFGKYQFYLCFLIFLSKFPVAFHQMAIIFLAPKAEFTCEGTQIKGTCPCDKPVYDTSIFTNTIITQWDLICKDKWLASLTQTLFQLGTLIGSLLFGMASDRFGRKKPMLFAVLLQVSSGVAAAFAPDYWSFSLLRFIVGMSVGGTMVVGFVIIMEYVGAKYRDIISALYQAPFNMGHMLLPVFGYFFRDYVNFQLAISLPAILLLSYFFLLPETGRWLIATQRTEEAIQIIERVATINKRPTEHIRKDIETHQKQLENNKLKKGTLLDLFRTPNLRKNILAMSFNWLTCSYCFYGVSQYVGQSFSVFFASIGVVASFIVFVVVYLYCTELFPTVVRNAAIGFSSMMARIGSMIAPFVIDLRDTAVWLPPIIFAIFPLAAAMVSFLLPETKGHELMTTIEEGERFGKKEQK-