Monarch geneset OGS2.0

DPOGS211090
TranscriptDPOGS211090-TA1899 bp
ProteinDPOGS211090-PA632 aa
Genomic positionDPSCF300007 - 1132567-1136308
RNAseq coverage81x (Rank: top 64%)
Annotation
HeliconiusHMEL0124844e-11684.75% 
BombyxBGIBMGA002970-TA0.075.92% 
DrosophilaCG9317-PA0.058.63% 
EBI UniRef50UniRef50_Q9VIK20.058.63%CG9317, isoform A n=21 Tax=Endopterygota RepID=Q9VIK2_DROME
NCBI RefSeqXP_001663432.10.059.36%organic cation transporter [Aedes aegypti]
NCBI nr blastpgi|1571352140.059.36%organic cation transporter [Aedes aegypti]
NCBI nr blastxgi|583895710.061.95%AGAP008335-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550851.1e-34transmembrane transport
GO:00160211.1e-34integral to membrane
GO:00228571.1e-34transmembrane transporter activity
KEGG pathway 
InterPro domain[166-553] IPR0161964.2e-51Major facilitator superfamily domain, general substrate transporter
[162-544] IPR0058281.1e-34General substrate transporter
Orthology groupMCL16879 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211090-TA
ATGCCTAAAACTACGAATCAAACCGACCCATATAATAAAGAAAAGTCACCGGTTAAAGAAAATAACGACGACACCATCGACTTGGATGATGTGCTCCCAAAAATTGGGGAATTCGGAATTTATCAGAAGTTTCTATTGTGGTTTGTGTGCTTGCCGGCCTGTTTGCCGTGTGGTTTCTGTGCATTTAACCAACTCTTCATGACAGACACTCCGGATCACTGGTGCCGGGTTCCTGAATTAAGTAATATGACTTTAGAAAATCGAAAAATGCTGTCCATTCCGAGGAAGGTAGATGGAGATAATAATACAGTCTTCGAGAACTGTGTCCGTTATTCTGTGAATTGGAAGACGATATTAAGTAATGGTGGAACAATAAAAGTGAATGAAAGTTGGCCTGTGGAGCCCTGTCTTGATGGCTATGAATATGATACATCTGAAGTGCAATCGTCCATTGTAATCGATTTTGATTTAGTTTGCGAGTACGATGTATATCCAACGCTTGGTTTAGTAGCACTAAATATTGGTGGACCAATAGGAGTTTACTCTTTCGGTATATTAAACGATAGAATAGGAAGAAAGAAATCCTTCTTTGCCTGTTTGTCAACACTTTTGCTTGGAAGCATGATGACAGCTTATGCACAAGCTTACTGGTTTTGGGTATTGGCTAGGGTCATCGTGGGTCTGACTATTCCAGCAGTTTATCAAATACCTTTTATAATCTCTTTGGAACTAGCAGGACCCAATTATAGATCCTTTGTCACTGTAATGACATGCATATTTTACACTCTGGGTCTCATTCTGCTCTCCGGCGTAACTTATATGTTGCGTGATTGGAAACAACTTGCACTAGCAACGTCAGTGCCATTCTTCTTTTACTATCTATATTGGTTCATATTACCCGAATCTCCGCGCTGGTTGCTTATGATGGAGAGATTAGAAGAAGCAAATAAAATATTACAAACAATAGCAAGGATTAATGGCAAAGAATTACCCGTGGAATACACGGAAAAGATTCAAAGACAAGTTTTAGATCAAAAAGAACATGGATTAAAAGAAACAAAGGCTCCTAGCGTCTTCGCTCTATGCAAAACGCCTAATTTAAGACTAAAAACTATTTTGATCACCCTAAATTGGTGTGCGTCTGAAATGGTTTATGTAGGTTTAAGTTATTATGGACCCTCAATGGGGAGCAACCAGTACATGAGTTTTTTTCTCTCATCCGCTGTAGAAATTCCTAGTTATATTATCTGCTGGATTTTAATGGATAAAGTTGGGCGACGATGGCCACTTTGTCTATCGATGGTCATCAGTGGAATTTTCTGTATGATTACAGTTCTCCTACCAAGCGACGCGGAGACCGAGACTCTTGTGCTCTATTTAATATCAAAATCCCTTATATCTGCGTCATTCTTGATTATATACCCGTATGCTGGTGAGCTGTACCCAACCGAACTGAGGGGTATAGGCATCGGGACGTCAGCTTACATCGGTGGCTTGGGGCTTATTATAATTCCATTTATAAACTATTTGGGTACCAGCAATCTAGTGTTGCCATTGTTTCTAATGGGTGCGTTGTCAGTGGTAGGTGGTATCACAGCTCTACGACTACCAGAAACGTTGCATTGTCCACTTCCGCAAACAGCGGAAGAAGGCGAGGAATTCGGGAAGGATTGGACCTACAAAGACTGTTGCACTTGTGGAGGACCAGGTGGCGTACCGGAGGCAGATTCGTATGAAAACTTGGATCAAATGGAACTAACACAGGCTCCGTTAATGGAAGACACGTTAAGCGATTTACCTATACGAAGGAGTTCAATGAAAAAACTCGTGAGACAGGCGAGCACGTTGGAAACACAGAGGGATATCGACGGAACTTTGAAAATGACCTACTGGTTTTGA

Protein sequence:

>DPOGS211090-PA
MPKTTNQTDPYNKEKSPVKENNDDTIDLDDVLPKIGEFGIYQKFLLWFVCLPACLPCGFCAFNQLFMTDTPDHWCRVPELSNMTLENRKMLSIPRKVDGDNNTVFENCVRYSVNWKTILSNGGTIKVNESWPVEPCLDGYEYDTSEVQSSIVIDFDLVCEYDVYPTLGLVALNIGGPIGVYSFGILNDRIGRKKSFFACLSTLLLGSMMTAYAQAYWFWVLARVIVGLTIPAVYQIPFIISLELAGPNYRSFVTVMTCIFYTLGLILLSGVTYMLRDWKQLALATSVPFFFYYLYWFILPESPRWLLMMERLEEANKILQTIARINGKELPVEYTEKIQRQVLDQKEHGLKETKAPSVFALCKTPNLRLKTILITLNWCASEMVYVGLSYYGPSMGSNQYMSFFLSSAVEIPSYIICWILMDKVGRRWPLCLSMVISGIFCMITVLLPSDAETETLVLYLISKSLISASFLIIYPYAGELYPTELRGIGIGTSAYIGGLGLIIIPFINYLGTSNLVLPLFLMGALSVVGGITALRLPETLHCPLPQTAEEGEEFGKDWTYKDCCTCGGPGGVPEADSYENLDQMELTQAPLMEDTLSDLPIRRSSMKKLVRQASTLETQRDIDGTLKMTYWF-