Monarch geneset OGS2.0

DPOGS201106
TranscriptDPOGS201106-TA2085 bp
ProteinDPOGS201106-PA694 aa
Genomic positionDPSCF300137 - 403561-410806
RNAseq coverage152x (Rank: top 53%)
Annotation
HeliconiusHMEL0211429e-12557.66% 
BombyxBGIBMGA013688-TA0.058.41% 
DrosophilaCG3790-PA1e-4834.38% 
EBI UniRef50UniRef50_E2BTA51e-7253.38%Organic cation transporter 1 n=8 Tax=Formicidae RepID=E2BTA5_HARSA
NCBI RefSeqXP_394472.32e-7644.80%PREDICTED: similar to CG9317-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3407132532e-7543.66%PREDICTED: organic cation transporter 1-like [Bombus terrestris]
NCBI nr blastxgi|3407132532e-7443.66%PREDICTED: organic cation transporter 1-like [Bombus terrestris]
Group
Gene OntologyGO:00550854.7e-21transmembrane transport
GO:00160214.7e-21integral to membrane
GO:00228574.7e-21transmembrane transporter activity
KEGG pathway 
InterPro domain[174-671] IPR0161961.3e-59Major facilitator superfamily domain, general substrate transporter
[175-327] IPR0058284.7e-21General substrate transporter
Orthology groupMCL26114 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201106-TA
ATGACCTGGACCCGGGAGTTGGATGAGGCTGCTACCCTGGCGACGAGACAAGGCTACTGGCACATCATTCTTTTCTATCTTCTATGTGGTCTCGCGGCGATACCTACTTCGTTTTTAGTTTTCTCCCAGGTGTTCACGAACGCCACACCAGTCCACTGGTGTGCTCCGATACCGGAAATTGATGAATTGGATCTTCCAGAGGACGTCGTACTGAACCTCACAGTGCCGGGGGTAGATGGAGTTTACGAGTCCTGTCTCACTTACGACATAGATATAAACGCATTAAATAAAACTCTTAAACAAATGACTAACAGATCCAATAACGTACCACAGGGATCAGTACAGAATGTACTTGAAGATAAACAGATAGAGTCTGTAGGAGACAGTATATTGATGTTGAGGAAGGGACGTGTGTCCTGTAAAAATGGTTGGCAATTTCAAAAAGATCTGTACACTAGGACTTTGGTCACGGAGTTTTCGCTGGTTTGCGACAGAGAGTGGTTACCAAGAACAAGCAACACGCTATTCTGGGTGGGATCTATTTTCGGAAACGTATTCTTCGGGTTATTATCTGACAGATACGGGCGCAGACCCACGATTCTACTCATGATAACCCTGGAGGTGCCGCTAGCTATCGCCGCTTCATTCCCAGGGAATTATTGGATGTTCATAGTGTTGAGGGTGGCTGGTGGACTGTTCTTCCCAGCGCTTTACCAGTTACCCTTCATATTAGCTTTGGAGCTGATGCCGCCGAAATGGAGAACTTATGCTGGTATCGTAGTTGGGATGTTGTTTGGAAGTGGTATGTGTCTGCTGGCAGTCCTGGCGTGGCTGTTGAGAGATTGGTTCTATCTATCGCTCGCCTCCAGTGTTCCCTTCGTCCTTTTGTTTGGATATTACTGGCTTATTCCTGAATCACCACGCTGGTTAGTGGGGAGAGGACGTATTGCTGATGCAGAGAAAGTTCTCAGAGAACTTGCTAGAAGAAATGGTATCCGTTTGGACAGTGGCTTTCTAATAGAATTACATAAAAAGGTGAAAAACGATGACGACGAAGCACAAATTACAATCTTACGTAACGGTCAGGTTTTGTCGCCACAAACGAAGGAATTGAAGCAAAGAAAGAATGAAATGATTGCGTATAAACCAACAATCATAATAAGTGAAGTACACGAGGAAGATGATAAAACTGACCACATAGCTGAAGAAGAAACAAGAAATATTAATATTATAAAACTTGCAGACACTAAAAGTGATACAGAAGAAGAGCTGAGCCCCAACGACAGAAATGAAATCCTACAACCAATTGACAAATCAAAAATACATACATTGAGACGAAAATCGATGCAATTGGTGAACAAGATCTTACTGAAAGAAGAGGAAGAGGATATCTTAGAAAGAAATCGGTTAGATGACCAAAACTCCGAAGGAGATTGCAAGGCTTCTCCCCTTGATATATTGAAATATCCTAATTTAAGAAGGAAGTTCTTTTTGCTCACTGTCAATTGGATAGCAATAGGCGTTGTGTACAACGGTCTGAGTTACAATACACCAAACTTGGGCGTGGACGATTATCTGGCATTCTTTATAGGAGGTTTAGTGGAACTGCCATCATATTTCATTGCCTGGAAGTCCATGGAACGGTTCGGGCGGCGGTGGGTGTTGTTCTGCTTTGTCAACGTTGGTGGAGTTGCTTGTTTATGTTGTGCGCTAGTGCCAGAAGCCTGGCCGTGGGTGACAGTGTTTCTGGCGATGCTGGGTCGTCTCTGTGCAGCGGCCTCTTTCTCTGTGTTCTATGTGCACATAGGTGAACTTCTGCCCACAGTTCTAAGAGCACAGGCTATGGCGTGCGCGTCTTTCATAGCCGGAATAGGACTGTTGGCGTGCCCTTATATTGTGTCTTTGGCCTCCTTGTCGAGGGACCTGCCCTTGGTCATCATGGGCGTGTTGAGTTTGTTTGCGGGAGTCATCACATTATTCCTCCCGGAAACTCTCAACCAGCCGTTACCCCAGACCCTGGGGGATGGAGAATTGTTCGGTCGAGATTTTAAAGTACTAAGCTGTGTTGAGGAAAACCTTGAATAA

Protein sequence:

>DPOGS201106-PA
MTWTRELDEAATLATRQGYWHIILFYLLCGLAAIPTSFLVFSQVFTNATPVHWCAPIPEIDELDLPEDVVLNLTVPGVDGVYESCLTYDIDINALNKTLKQMTNRSNNVPQGSVQNVLEDKQIESVGDSILMLRKGRVSCKNGWQFQKDLYTRTLVTEFSLVCDREWLPRTSNTLFWVGSIFGNVFFGLLSDRYGRRPTILLMITLEVPLAIAASFPGNYWMFIVLRVAGGLFFPALYQLPFILALELMPPKWRTYAGIVVGMLFGSGMCLLAVLAWLLRDWFYLSLASSVPFVLLFGYYWLIPESPRWLVGRGRIADAEKVLRELARRNGIRLDSGFLIELHKKVKNDDDEAQITILRNGQVLSPQTKELKQRKNEMIAYKPTIIISEVHEEDDKTDHIAEEETRNINIIKLADTKSDTEEELSPNDRNEILQPIDKSKIHTLRRKSMQLVNKILLKEEEEDILERNRLDDQNSEGDCKASPLDILKYPNLRRKFFLLTVNWIAIGVVYNGLSYNTPNLGVDDYLAFFIGGLVELPSYFIAWKSMERFGRRWVLFCFVNVGGVACLCCALVPEAWPWVTVFLAMLGRLCAAASFSVFYVHIGELLPTVLRAQAMACASFIAGIGLLACPYIVSLASLSRDLPLVIMGVLSLFAGVITLFLPETLNQPLPQTLGDGELFGRDFKVLSCVEENLE-