Monarch geneset OGS2.0

DPOGS203233
TranscriptDPOGS203233-TA1635 bp
ProteinDPOGS203233-PA544 aa
Genomic positionDPSCF300035 + 1338262-1341320
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0032530.060.15% 
BombyxBGIBMGA009196-TA2e-17454.33% 
DrosophilaCG7458-PA3e-10336.69% 
EBI UniRef50UniRef50_B0XDF01e-10337.38%Organic cation transporter n=5 Tax=Culicidae RepID=B0XDF0_CULQU
NCBI RefSeqXP_001659801.13e-10637.41%organic cation transporter [Aedes aegypti]
NCBI nr blastpgi|1571209656e-10537.41%organic cation transporter [Aedes aegypti]
NCBI nr blastxgi|1571209652e-10537.55%organic cation transporter [Aedes aegypti]
Group
Gene OntologyGO:00550851.4e-25transmembrane transport
GO:00160211.4e-25integral to membrane
GO:00228571.4e-25transmembrane transporter activity
KEGG pathway 
InterPro domain[1-531] IPR0161961.9e-52Major facilitator superfamily domain, general substrate transporter
[139-529] IPR0058281.4e-25General substrate transporter
Orthology groupMCL25586 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203233-TA
ATGTCTAGTGAAGAAAACGGTAATGTGGAAAAGGGGAGGACAAATATTGATCTGGACAATATCCTAGTCGAGGAGGTTGGACAAATCGGGAAATATCAAATTGTAACAGTACTCTTGGCTGCGCTTCCAGTTATATTTTCGGCTTTTGCATCTGGTGAATACATTTTTACAACTGCCAGGATACCTTCCAGGTGCCGCATACCACAATGTGATACAGAAAATCCCATTTACGCTCCTGATTGGATTCTTAACGCAGTGCCAGGGACTAGTTTGACTAATTTTGAAAATTGTGAAAGATATGTTAATTCTTCTCAACTGCTATCTTCCACAAATGTGTGTCCAGCAGAACTATTTGATCGAACACAAACTGAACCATGTCAGGATTATGTGTATGAGAATACACTAAGCGTTGTGTATGATTTTAATATGGCTTGCGATGAATGGAAACGATCACAAATTGGTTCAATTCGTACCATTGGAACTTTGCTGGTACTCCCGATTACTGGTTACATCTCAGATCGTTGGGGTCGTCGGGTTGCTTTAACCATAAATGCTTTTAACACTGGTTGGCTTGGTCTAGTTAGGTCGTTCGTTAATTCGTACGAATGGTTTTTAACCTTAGAAGTTATTGAATCGACAATCGGTGCTGGAGCATATTCATCGTGTTATATTTTAGTCACTGAGCTAGTTGGACCTAAATATCGAGTACCAGTGGGAGCAACTATATCTACAATGTTTGCTTTAGGGCAAGTAATTCTTGGTCTTATAGCTTGGGGTGTACCGTCATGGAGATCCCTTACTCAAGTATTATACGCTCCACAACTTCTGGTAGTATTATATTTCTGGATACTTTCGGAGTCGGTTCGTTGGTTAATGAGTAAGGGACGATATGAAGAAGCCGAAGCAATTTTGCAAAAAGTGGCAAAATGGAATAATAAGAAGCTTTCAGACAAATCACTACAGGCATTGAGAGATACAGCTGAAGCTGAGAAATTAATTGTAAAACCTAAAGAACCATGGCTACCGATTTTGGTATTCAGATCCAAGATAATTCTCTCGAGATGCTGTGTTGCTCCTATATGGTGGATAACGAATACCCTGGTCTATTATGGAATGTCCATAAATGCCGTAAATTTATCAGGAAACCGGTATTTAAATTATGTGTATGTAGCAGCTGTAGAAATTCCGGGTTATTGGACAGCGATTCTTTTGCTAGATAGAATTGGAAGGAAACCTGTTCTAATCGCTGGTTACTGGATATGTGCTGCCTGTCAACTAGCATTTGCATTTATTCCTTCTGGCTACCCTACACTGTCACTTATTTTTTATTTACTCGGCAAGTATTGTATTGCTATCGTGATGACATCGGTATATGTGTATACTGCTGAACTATATCCAACCAAGTATCGACACAGTCTCTTCGCTTTCTCTTCTATGCTTGGTCGCCTTGGATCTATTACAGCACCTCTGACACCCGCACTTGCCCTTACAGTATGGGAGAGTTTGCCCTCCGTGCTTTTTGCTTCTTTTGCGCTCTTATCTGGTCTGCTTATATTCACAACGCCTGAAACTTTGGGTACCAAATTACCCGATACTATAAAGGATGCCGAAGAACTCAGCACGAAAAGAAAATAG

Protein sequence:

>DPOGS203233-PA
MSSEENGNVEKGRTNIDLDNILVEEVGQIGKYQIVTVLLAALPVIFSAFASGEYIFTTARIPSRCRIPQCDTENPIYAPDWILNAVPGTSLTNFENCERYVNSSQLLSSTNVCPAELFDRTQTEPCQDYVYENTLSVVYDFNMACDEWKRSQIGSIRTIGTLLVLPITGYISDRWGRRVALTINAFNTGWLGLVRSFVNSYEWFLTLEVIESTIGAGAYSSCYILVTELVGPKYRVPVGATISTMFALGQVILGLIAWGVPSWRSLTQVLYAPQLLVVLYFWILSESVRWLMSKGRYEEAEAILQKVAKWNNKKLSDKSLQALRDTAEAEKLIVKPKEPWLPILVFRSKIILSRCCVAPIWWITNTLVYYGMSINAVNLSGNRYLNYVYVAAVEIPGYWTAILLLDRIGRKPVLIAGYWICAACQLAFAFIPSGYPTLSLIFYLLGKYCIAIVMTSVYVYTAELYPTKYRHSLFAFSSMLGRLGSITAPLTPALALTVWESLPSVLFASFALLSGLLIFTTPETLGTKLPDTIKDAEELSTKRK-