Monarch geneset OGS2.0

DPOGS203230
TranscriptDPOGS203230-TA1803 bp
ProteinDPOGS203230-PA600 aa
Genomic positionDPSCF300035 + 1311391-1321504
RNAseq coverage159x (Rank: top 52%)
Annotation
HeliconiusHMEL0032490.077.57% 
BombyxBGIBMGA001313-TA0.067.03% 
DrosophilaCG7442-PA8e-10136.07% 
EBI UniRef50UniRef50_B0XDF06e-10939.04%Organic cation transporter n=5 Tax=Culicidae RepID=B0XDF0_CULQU
NCBI RefSeqXP_001867673.12e-11239.66%organic cation transporter [Culex quinquefasciatus]
NCBI nr blastpgi|3838630034e-11541.48%PREDICTED: solute carrier family 22 member 21-like [Megachile rotundata]
NCBI nr blastxgi|3838630032e-11441.48%PREDICTED: solute carrier family 22 member 21-like [Megachile rotundata]
Group
Gene OntologyGO:00550851.6e-28transmembrane transport
GO:00160211.6e-28integral to membrane
KEGG pathway 
InterPro domain[1-574] IPR0161963.2e-51Major facilitator superfamily domain, general substrate transporter
[192-533] IPR0117011.6e-28Major facilitator superfamily
Orthology groupMCL26533 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203230-TA
ATGGGCGAATCTAAGGATGTGGATACGGATAGACTGATGAGTGAATTAGGTCAGACTGGTCGGTACCATCTCCGTGTTTATGTGCTGGTAGCGTTAGCAGCTGTTCAGGTTGGATTGCTACACACCACTTATATTTTTCTTGCTGGAGACGTACCATATAGATCTAAAGAAATGGAAAGCGGTGTGGGTTTAGACGTAGATGCATTGATGCAGGAACTGGGTCAGTTCAAAAAGTTTCATTTGTTGAACTATGTGCTGTTATCGCTAGTACCATTTGCACTTAACCATTATGGTGTTAACTATGTCTTCTTAGCAGGTGATGTTCAATACAGATGTCTAGTACCAGAATGTGAAGCAGCAAATACTACCGAATACAGTCCAATTTGGTTAGAAAATGCTTTGCCACCGAATGGTAGAGAAAGACGCTGCAGTGTGAAAGTTCCAATGGAGGACGGATTCTGTCAACCAGATCATTTTTCTGATAAGCTACGACCTTGTGAACAATGGTTATATGAAACACATGATACAATAGTAGCTGAGTTTAATTTAGCATGTCAAGATTGGAAGAGAACCTTGGTGGGCACAATTCATAATATTGGTATGCTCGTGTCTCTGCCTATCTTCGGCTTTATATCAGACCGATGGGGACGCAAACGTTCCTTAATATTAAGTTCCACCTTGTTAGCAATTATAGGCACTATGAAAGCATTTTCTATCTCATATGAAATGTACGTTATAGTGGAATTTCTCGAAACCGTCGCTGGTGCAAGCGCCTTCCCTGCGGCCTATGTCCTCACTATCGAATTGCTAGGACAAAATAAACGTGTTTTGACGACCGCGTTTCTGGGAATTATGTTAGCTCTTGGAGGTATAAGTTTTGCAATGTTTGCAAAGACCTTTCCATATTGGAGAACTTTTATATTAGTGGTCTATCCACCGTCTCTACTTTTTCTCTCATATATATACTTCTTACCTGAAAGCATCAGATGGCTTCTATCGAAAGGACGGAGAGAAGAAGCATTTAAAATAGTAACGAAAGCTGCTAAAATGAACAATGTTACACTTTCCGATGAAACTATCCGTCAATTCACTGTCGAAGAAAAACAAACCAAAGGAGAAAAAACAGAAACAAATGAAGAAGAGACCCAAGGTTTATGGCTGCAGGTTATAAAATCCCCGATTATTATGACTCGTTTGGCGATTTGTTCATGGTGGTGGATAACGTGTACCTTTGTATTTTATGGTCTGGCAATAAATTCTGTGTCTCTGGCTGGAGATAAATACACCAACTACATGTTAGTCAGCAGTGTAGAAGTCATTGCGGTAGTCACAAATGCCTTGGTATTGGATAGAATCGGTAGAAAGAAAACTATGATGATTGCTTATCTTGTATGCGGAGTTTCTTGTGGGTCAATTGCATTTGTACCCAAGAATTTACCCTGGTTGGCAACTGTCTTGTACTTAGTGGGCAAAATAGCTATTACACAGGCTTTTAGCGGCATTTATATGTATACATCGGAACTTTTCCCGACGCATGCGAGGCAATCACTGCTTGGATTTTGCTCTATGATAGGAAGAATTGGATCTATTGTTTCACCTCAGATGCCTTTACTGGCTATTTATATTGAATGGCTGCCATCAGTGCTCTTCGGGGCCACTGCTATAATAGCTGGTGGACTAATGATGACTACCCCTGAAACACTTAACACAAAACTTCCAGATACAATTAAAGAAGCCGAATTTATAGCAAGCAAAAAAGTAAAGCCCAAAAGTACTGACATGCAATTAACATCAAAATTATAA

Protein sequence:

>DPOGS203230-PA
MGESKDVDTDRLMSELGQTGRYHLRVYVLVALAAVQVGLLHTTYIFLAGDVPYRSKEMESGVGLDVDALMQELGQFKKFHLLNYVLLSLVPFALNHYGVNYVFLAGDVQYRCLVPECEAANTTEYSPIWLENALPPNGRERRCSVKVPMEDGFCQPDHFSDKLRPCEQWLYETHDTIVAEFNLACQDWKRTLVGTIHNIGMLVSLPIFGFISDRWGRKRSLILSSTLLAIIGTMKAFSISYEMYVIVEFLETVAGASAFPAAYVLTIELLGQNKRVLTTAFLGIMLALGGISFAMFAKTFPYWRTFILVVYPPSLLFLSYIYFLPESIRWLLSKGRREEAFKIVTKAAKMNNVTLSDETIRQFTVEEKQTKGEKTETNEEETQGLWLQVIKSPIIMTRLAICSWWWITCTFVFYGLAINSVSLAGDKYTNYMLVSSVEVIAVVTNALVLDRIGRKKTMMIAYLVCGVSCGSIAFVPKNLPWLATVLYLVGKIAITQAFSGIYMYTSELFPTHARQSLLGFCSMIGRIGSIVSPQMPLLAIYIEWLPSVLFGATAIIAGGLMMTTPETLNTKLPDTIKEAEFIASKKVKPKSTDMQLTSKL-