Monarch geneset OGS2.0

DPOGS209143
TranscriptDPOGS209143-TA1497 bp
ProteinDPOGS209143-PA498 aa
Genomic positionDPSCF300061 - 648477-658455
RNAseq coverage137x (Rank: top 55%)
Annotation
HeliconiusHMEL0147920.073.58% 
BombyxBGIBMGA009199-TA8e-12471.38% 
DrosophilaCG7458-PA3e-8935.00% 
EBI UniRef50UniRef50_B0XDF02e-10341.01%Organic cation transporter n=5 Tax=Culicidae RepID=B0XDF0_CULQU
NCBI RefSeqXP_001659801.13e-10440.37%organic cation transporter [Aedes aegypti]
NCBI nr blastpgi|1571209656e-10340.37%organic cation transporter [Aedes aegypti]
NCBI nr blastxgi|1571209653e-10440.45%organic cation transporter [Aedes aegypti]
Group
Gene OntologyGO:00550853.2e-21transmembrane transport
GO:00160213.2e-21integral to membrane
KEGG pathway 
InterPro domain[131-498] IPR0161961.8e-41Major facilitator superfamily domain, general substrate transporter
[132-474] IPR0117013.2e-21Major facilitator superfamily
Orthology groupMCL25587 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209143-TA
ATGAAGAAAAATAAGTTGAATGTGGATGATATTTTAAAGGAAGTCGGTGATCTGGGAACCTTCCAGATAAGAAATTTTGTTTATATTTTGTTTGCTATTATATTTTGTGCCTTCTACAACACCCTGTATTTATTCACAGCATCTGCCTCTGTAGCCAGATGTCGAGTCCCGGAATGCGAGGCCGATCCACCCATTTTCGATACCAAGCATTGGGGAGCGTGGGCCCTCCCAGATAGTCGAGGGCGCTGCGAGAGATTCCTTCCCATTGGAAACACTTGCGCCGCGGGATCCTTTCATCCGACGGAGACAAAGAGATGCTCCAGCTGGATCTATGAGAATCATGATAGTATTGTCTCGACGTTCGACCTCGCCTGTGAGGAGTGGAAGCGGACCCTAGTGGGTACTATACACAGCGCAGGATTGTTCATAGCCCTTCCCCTGACCGCTTTTATATCTGACAATTTCGGACGTCGCATCGCTTTTATCGCGACGGCTGTGGCCCCGGCTCTTGTGGGTTTAGGACGATCCTTCACTCAAGACTACGTCTCATATGTGGCTCTCGAATTCCTGGATGCCGTCGTGGGGGCCGGAGTATACAGTTCCGGATTTATTCTTGCTCTAGAAATGATGGGTCTTCATCGGCGTGTTCTTGGAGGAAACATAATTTCCTGCACTTTTGCCATCGGACAAGCTATAGTCGCTTTAATAGCCTGGGCCATACCCGAGTGGAGAACTCTCACGAGAGTGCTATACGCGCCCTCCTTACTCTTTATATTTTATATATTTTTAATTGAAGAAAGTGTAAGATGGCTTCTGAGCAAGGGGAAGAAGAAGGAAGCAGCAAGAATAATCTTTAAAGTGGCTGCTACCAATAAAAGAAAACTGTCACCAGAAACTATCAAGCAACTGACAGATGAGTCGGCTGAACAGGAAGAGGAGAAACCATCGCTGGGAGACATTAATGATCAACCGTCTCTCGCGCTGCAAGTGCTCAAGTCGCGAGTGATCATGATAAGATTATGCATTTGTTCCTTTTGGTGGATAACTGTGACGTTTATTTATTACGGCCTATCTATAAACTCAGTGTCATTGGCGGGGAACAGCTATGTTAATTATATTTTAACAGCTTTGGTAGAAATCCCTGGTTATTGCATCAGTGTGTTGACTCTAGACAGATTCGGTAGAAAAAGTTCTATTATGACAGCGTTCTTTATTTGTGGAATCTCTTTAGTATGTCTGCCGTTCATACCAGGACATGTGCAATGGGTCCAAACGTGTCTCAACTTGTTGGGCAAATTGTGCATTAGCATGGCTTTTAGCAGTATATATATATACACTTCGGAACTGTATCCAACGATGATCAGACAGAGTTTGCTGTCTCTGTGTTCAGTCTGTGGAAGAATCGGACAGATTGTGGCCCCCCAAACGCCACTCTTGGTTTGTATATTATACTATATTTACTTACCCGTAGATAACAAAAGTGAATCAGTAAATTGA

Protein sequence:

>DPOGS209143-PA
MKKNKLNVDDILKEVGDLGTFQIRNFVYILFAIIFCAFYNTLYLFTASASVARCRVPECEADPPIFDTKHWGAWALPDSRGRCERFLPIGNTCAAGSFHPTETKRCSSWIYENHDSIVSTFDLACEEWKRTLVGTIHSAGLFIALPLTAFISDNFGRRIAFIATAVAPALVGLGRSFTQDYVSYVALEFLDAVVGAGVYSSGFILALEMMGLHRRVLGGNIISCTFAIGQAIVALIAWAIPEWRTLTRVLYAPSLLFIFYIFLIEESVRWLLSKGKKKEAARIIFKVAATNKRKLSPETIKQLTDESAEQEEEKPSLGDINDQPSLALQVLKSRVIMIRLCICSFWWITVTFIYYGLSINSVSLAGNSYVNYILTALVEIPGYCISVLTLDRFGRKSSIMTAFFICGISLVCLPFIPGHVQWVQTCLNLLGKLCISMAFSSIYIYTSELYPTMIRQSLLSLCSVCGRIGQIVAPQTPLLVCILYYIYLPVDNKSESVN-