Monarch geneset OGS2.0

DPOGS209699
TranscriptDPOGS209699-TA1926 bp
ProteinDPOGS209699-PA641 aa
Genomic positionDPSCF300309 + 72892-88591
RNAseq coverage140x (Rank: top 55%)
Annotation
HeliconiusHMEL0098402e-11942.41% 
BombyxBGIBMGA001601-TA6e-13873.97% 
DrosophilaCG7458-PA5e-8935.20% 
EBI UniRef50UniRef50_B0XDF01e-9538.29%Organic cation transporter n=5 Tax=Culicidae RepID=B0XDF0_CULQU
NCBI RefSeqXP_001867673.12e-9838.35%organic cation transporter [Culex quinquefasciatus]
NCBI nr blastpgi|1700647934e-9738.35%organic cation transporter [Culex quinquefasciatus]
NCBI nr blastxgi|1700647931e-9638.68%organic cation transporter [Culex quinquefasciatus]
Group
Gene OntologyGO:00550852.3e-29transmembrane transport
GO:00160212.3e-29integral to membrane
GO:00228572.3e-29transmembrane transporter activity
KEGG pathway 
InterPro domain[1-626] IPR0161963.2e-57Major facilitator superfamily domain, general substrate transporter
[227-612] IPR0058282.3e-29General substrate transporter
Orthology groupMCL27603 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209699-TA
ATGGATCCAGAAAAAAATAAAAACGATGTGACGGTTGACGATATATTAGAAATATTGGGTCCGTATGGAAAATACAATATTTACAACTACATACTAATACTGTTCCCCGTCTTGCTGTCTGGGATGTATTCGTCTGTATACAACTTCGAGGCCATGGATATAAGATACAGATGCTCGACACCAGAATGCGAGGATCCCAACGTTAATTTCACAGTACCACATCTTGAAGACGGTACACCAAGTAAATGTCTGAAATACGCCACATTTTCCAACCTGACCGATACCAACGACACTTGCTCGCCACATTACTTCGACAAGTCAACTGAGGTTGAATGCGATTCTTTCATGTATTTCGAGGAGCACTCCATTGTTAAAGAGATAGCGATGATTAAATTCTTAGATATTTTTTTCATTCATAATGAAGTCCTGTGGCTTGAATTAGAAAAATTGATAATTAGAAGATGCTCGACACCAGAATGCGAGGATCCCAACGTTAATTTCACAGTACCACATCTTGAAGACGGTACACCAAGTAAATGTCTGAAATACGCCACATTTTCCAACCTGACCGATACCAACGACACTTGCTCGCCACATTACTTCGACAAGTCAACTGAGGTTGAATGCGATTCTTTCATGTATTTCGAGGAGCACTCCATTGTTAAAGAGTTTAATTTGGGATGTCAGCAATGGAAACGTTCCCTCGTGGGTACAATCCACAACGCAGGATTCTTCGTCTCCATCCCTCTGACGGGTCTTATATCAGACAAATTCGGGAGACGGTGGGCACTTGTGTTCGCGAGTGTATCGAACGGTGTGTTCGGTGTCATTCGGGCATTTTCTGTTAATTATAACATGTTCATAATTCTTGAGTTTCTTGAGCCGGCGCTCGGCGGTGGAGTTTATACAGCGTGTTTCGTACTCGCCCTGGAACTGGTGGGTCCACGTGGCAGAGTGCTTGTAAGTCTTCTGTTCAACATGATGTTCATCCTGGGAGGGGTGTCGGTGACTATACTTTCCTGGTGGCTTCAAAGCTGGAGATACCTTCTCTTAGTTATCTATATTCCAGCCATCCTAGTGTTCTTTTATTTATGGTCACTGTCTGAGAGCTTCCGTTGGCTGTTCAGCAAAGGTCGTTATGAGGAAGGTTTGAACGTCCTCGCGAAGTTTGAGAAGGTCAACAACGTGACGGTGCCCAAAGAGCATTACGATTTGGTAGAAAAAGCAGCCATGAAACAAAGAGATCAGAGGAAATTCGAGGAACAGCTTACAAATCAGTCTACCTTCAAACAGCTTTTAATGTCACCAATGATATGGAAGAGACTGTTCACTTCGTCCTTCCTCTGGTGTTCATCCACATTGGTTTATTACGGTCTTTCGATTAATGCCATCGAATTATCAGGCAATAGTTATCTTAATTACATAGTTGTGATTGCAATTGAGGCCCCGGCGAACATTTGCAAGCTCATCTGTCTAGATCGTTTTGGAAGGAAACGGGTTATAGCTATAGCATATATGATGGCCGGCATAATTCTCATCTGTTGCGGGTTCATACCAGGTGGCAATTGGTCGACCGCTTTTTATTTAGGCGGTAAATTCTTTATAACGCTCGCCTACAACTCGCTTTATATTTTTGCTGCTGAGATTTTCCCAACCAATTATAGAACAACATTGTTAGCGATTTGCTCCACTCTTGGCAGAATCGGTTCTGCGGTTGCTCCCCAAACTCCATTATTATCACACGTATATCAATTCCTGCCAACTCTTGTCTTCGGTATAATGTCAGCGATGTCCGGTTTACTAGTGCTCACGTTACCAGAGACAAACAAACAGAAATTACCTGATTCACTCAAAGAAGCGCAGGACCAAGATAGAAAAATAGATGCACATGAGAATGAGACAGTTGAAAATAAAACACGGCTTTAA

Protein sequence:

>DPOGS209699-PA
MDPEKNKNDVTVDDILEILGPYGKYNIYNYILILFPVLLSGMYSSVYNFEAMDIRYRCSTPECEDPNVNFTVPHLEDGTPSKCLKYATFSNLTDTNDTCSPHYFDKSTEVECDSFMYFEEHSIVKEIAMIKFLDIFFIHNEVLWLELEKLIIRRCSTPECEDPNVNFTVPHLEDGTPSKCLKYATFSNLTDTNDTCSPHYFDKSTEVECDSFMYFEEHSIVKEFNLGCQQWKRSLVGTIHNAGFFVSIPLTGLISDKFGRRWALVFASVSNGVFGVIRAFSVNYNMFIILEFLEPALGGGVYTACFVLALELVGPRGRVLVSLLFNMMFILGGVSVTILSWWLQSWRYLLLVIYIPAILVFFYLWSLSESFRWLFSKGRYEEGLNVLAKFEKVNNVTVPKEHYDLVEKAAMKQRDQRKFEEQLTNQSTFKQLLMSPMIWKRLFTSSFLWCSSTLVYYGLSINAIELSGNSYLNYIVVIAIEAPANICKLICLDRFGRKRVIAIAYMMAGIILICCGFIPGGNWSTAFYLGGKFFITLAYNSLYIFAAEIFPTNYRTTLLAICSTLGRIGSAVAPQTPLLSHVYQFLPTLVFGIMSAMSGLLVLTLPETNKQKLPDSLKEAQDQDRKIDAHENETVENKTRL-