Monarch geneset OGS2.0

DPOGS209144
TranscriptDPOGS209144-TA2814 bp
ProteinDPOGS209144-PA937 aa
Genomic positionDPSCF300061 - 627550-640397
RNAseq coverage87x (Rank: top 63%)
Annotation
HeliconiusHMEL0101115e-17147.56% 
BombyxBGIBMGA009202-TA2e-16757.41% 
DrosophilaCG7458-PA2e-8132.36% 
EBI UniRef50UniRef50_B0XDF04e-8333.39%Organic cation transporter n=5 Tax=Culicidae RepID=B0XDF0_CULQU
NCBI RefSeqXP_001867673.12e-8633.58%organic cation transporter [Culex quinquefasciatus]
NCBI nr blastpgi|1700647934e-8533.58%organic cation transporter [Culex quinquefasciatus]
NCBI nr blastxgi|1571209652e-8533.89%organic cation transporter [Aedes aegypti]
Group
Gene OntologyGO:00550852.4e-18transmembrane transport
GO:00160212.4e-18integral to membrane
GO:00228572.4e-18transmembrane transporter activity
KEGG pathway 
InterPro domain[176-514] IPR0161962.7e-38Major facilitator superfamily domain, general substrate transporter
[176-374] IPR0058282.4e-18General substrate transporter
[704-829] IPR0117016.7e-10Major facilitator superfamily
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209144-TA
ATGAGACCACGATCTATGGCAAGATATCTGTCTGGAACAGTCAAAGAAATCCTTGGAACTGCCCAATCAGTTGGCTGCACCGTAGAAGGAAGACCCCCTCACGACTTGATTGTTGATATCAATGAGGACCTAGATGAAGTGCTAACAAATGAGTTGGGAGAATTCGGAAAGTTTCAAATTGTAAACCTCCTTCTAGTTTGCTTTCCACTTCTGATTGCAGCATTTCCGAGTGATTACATTTTCAGTGTTGCTGCAATTCCACATCGATGTCGTATACCCGAATGCGGAGAGGAGAATAAACAACAAATTTTTAGCCCTGATTGGATTTCAAATGCGATACCAGAAACTGGTTCAGGATTAGCAAGCTGCGAAAGGTTTGTTCCAAGAGGAAACGGCTCTTTGGACTTCTGTCCAGCCAACATATTCGACCACAATGAAACTGTGGGTTGTAATGATTTTGTGTATGCAAGAGATAATTCAGCCGTGTATGATTTCAATCTGGGTTGTCAAGAATGGTTAAGAGTTTTGGTGGGCACCCTCGGCAATGTTGGTACTCTGCTGGTTCTTCCATTCACTGGGTTCATATCAGATCGGTTTGGGCGCAGACTGGCTTTAGTGATCAGTGCCTTTAATATTGCACTATTTGGCTTAATCCGAGCTTTTTCAACGAGCTATACAATGCTAATAATTACCCAAATTATACAAACAACACTAGGAGGAGGAATTTATAGCTCAGCCTATATATACGGTGCAGAACTTGTAGGACCTAACTATCGGGTATCTGCTACCACGGTGATAGGTCTCTTGTTTTCTGTAGGACTCGTATTGCTTGGAATGTTGGCATGGGTCATTCAACAGTGGCGCATCCTTTTGATGGTTCTTTATATTCCGCCTTTTCTTTTAATTAGTTACTACTGGCTTCTGCCAGAGAGTGTTCGATGGTTAATTTCGAAAAAAAAGTTTCTAGACGCCAGAAAAGTTTTAGAGAACGTTGCCAGAATAAACAAAACTAAAATAAGTGAAGAGTCCTTAAAAGCTTTATTAGACTCACCACAATCCGTTGCTGCTCAGAACGACAATATAGGCGTGTTTCTAAAGGTGATCCGATCCCGTGTCTTACTACAACGAGTGTGTACTACCCCAATCTGGTGGATCACAGCAACTTTCGTGTACTATGGACTATCCATTAACTCGACAAATATGTATGTCTTGCGACTTATAATTTATCTCTTGGGTAAATTCTGTATATCAGCTGTACTCGCGGCTGTATATCTTTACACGTCAGAATTGTATCCAACAGAATATCGGCACACTTTACTTGCCTTCTCGTCTATGATAGGTCGGCTTGGATCTATTCTAGCGCCTCTCACCCCTGTACTTTCACTGTATTGGCATGGCATACCCTCGCTTATGTTTGGAGTGATGGGTCTGATATCTGGTCTGCTGGTTTTAACTCAGCCAGAAACACAAAACACCCGGCTACCTGACACGCTGGCGGAAGCCGAGGCTCTTGAAAAGGAAAATAGAAAAGAAAAAGAAAAGAAGCCAGAAACACAGATCGATCACGATGAGACAACTTTTTTGTTTTGTTTGATTATAGAAACTGCAAGGTTTGGGTCTATCGTGTCTGGTACAATGGACAACGATAAGTGTGTCAACGAAAACAAACCCAAAGAGAAACCATTGGATCTTGACGATGTCTTAATAAACGAGTTGGGCCAGTTTGGAAGGTTTCAGCTCAGAAACTTAGCGCTTCTAGCTATACCGCTCATGATGTCCGCATTCATTAACGAATATGTATTCTCGGCTATGGCCGTAAAGCACAGATGTCGTATACCAGAATGTGGGGATACTAATGTAACCCACCAGATCGATCCTTTTTGGCTCAATAATACCGTGCCCCCTTCAGCAACAGGGCTGTCTAGTTGTGAAAGATATGCTCCTAAGAGTATCGATATAATTCCTGGAGACTTCGAAGAAAAAGACACATGTCCTACAACACTATTTGATAACAATCATGTTATTCCCTGTGACGCTGGCTATGTGTATGAGTCTACTAATTCTGTCGTATACGAATTCGATCTTGGATGCCAAGACTGGCTAAGAGCTCTGGCAGGAACATTGAATAGCATCGGCACATTGTTAGTGTTGCCAATCACTGGCTACATATCAGATCGTTTTGGGCGTAGATTGGTTCTTATACTGAATATTTTTAACTTGGGATTGTTTGGTCTATTGAGGGCATTCTCTGTCAATTATACCATGTATCTGATACTGCAAATAGTTCAAACCACCCTAGGATCTGGCACTGTGAGCGCGGGATATATCCTCGCTGCTGAACTTATTGGACCAAAGTATCGTGTGCTGGGAATAGCCACATTATCTTCCATGTTTGCAACCGGACAGATCGTGATGGCAAGCGTCGCATGGCTGACACCATCGTGGCGCGCTATGCTCATTGCTCTTCATATTCCTTATCTTTATGTTCTCCGTTTGATATTCTATCTGCTCGGAAAGTTTAGTATTTCAACGTCCATGACAGCACTGTATCTGTACACTTCAGAACTCTATCCAACTGAGTACCGTCATAGTTTGCTTGCTTTTTCTTCAATGATTGGCCGAATCGGATCAATCACTGCTCCACTCACACCTGTACTTATGAATTACTGGCACGGAATACCGTCCCTGATGTTCAGCGCTATGGGGATCCTGTCGGGTATATTGGTGCTGACACAGCCAGAAACACTTGGCACAAAGTTGCCAGATACCCTGGAAGAGGCTGAATCTCTTGGCAATGATCAAAATATGACTTAG

Protein sequence:

>DPOGS209144-PA
MRPRSMARYLSGTVKEILGTAQSVGCTVEGRPPHDLIVDINEDLDEVLTNELGEFGKFQIVNLLLVCFPLLIAAFPSDYIFSVAAIPHRCRIPECGEENKQQIFSPDWISNAIPETGSGLASCERFVPRGNGSLDFCPANIFDHNETVGCNDFVYARDNSAVYDFNLGCQEWLRVLVGTLGNVGTLLVLPFTGFISDRFGRRLALVISAFNIALFGLIRAFSTSYTMLIITQIIQTTLGGGIYSSAYIYGAELVGPNYRVSATTVIGLLFSVGLVLLGMLAWVIQQWRILLMVLYIPPFLLISYYWLLPESVRWLISKKKFLDARKVLENVARINKTKISEESLKALLDSPQSVAAQNDNIGVFLKVIRSRVLLQRVCTTPIWWITATFVYYGLSINSTNMYVLRLIIYLLGKFCISAVLAAVYLYTSELYPTEYRHTLLAFSSMIGRLGSILAPLTPVLSLYWHGIPSLMFGVMGLISGLLVLTQPETQNTRLPDTLAEAEALEKENRKEKEKKPETQIDHDETTFLFCLIIETARFGSIVSGTMDNDKCVNENKPKEKPLDLDDVLINELGQFGRFQLRNLALLAIPLMMSAFINEYVFSAMAVKHRCRIPECGDTNVTHQIDPFWLNNTVPPSATGLSSCERYAPKSIDIIPGDFEEKDTCPTTLFDNNHVIPCDAGYVYESTNSVVYEFDLGCQDWLRALAGTLNSIGTLLVLPITGYISDRFGRRLVLILNIFNLGLFGLLRAFSVNYTMYLILQIVQTTLGSGTVSAGYILAAELIGPKYRVLGIATLSSMFATGQIVMASVAWLTPSWRAMLIALHIPYLYVLRLIFYLLGKFSISTSMTALYLYTSELYPTEYRHSLLAFSSMIGRIGSITAPLTPVLMNYWHGIPSLMFSAMGILSGILVLTQPETLGTKLPDTLEEAESLGNDQNMT-