Monarch geneset OGS2.0

DPOGS209426
TranscriptDPOGS209426-TA3048 bp
ProteinDPOGS209426-PA1015 aa
Genomic positionDPSCF300449 - 66397-74681
RNAseq coverage449x (Rank: top 27%)
Annotation
HeliconiusHMEL0155820.074.78% 
BombyxBGIBMGA001808-TA0.064.42% 
DrosophilaCG11665-PA3e-15848.92% 
EBI UniRef50UniRef50_Q7JWI75e-15648.92%CG11665 n=16 Tax=Endopterygota RepID=Q7JWI7_DROME
NCBI RefSeqXP_001655208.15e-16652.36%monocarboxylate transporter [Aedes aegypti]
NCBI nr blastpgi|1571288189e-16552.36%monocarboxylate transporter [Aedes aegypti]
NCBI nr blastxgi|1892337709e-16252.14%PREDICTED: similar to GA11129-PA [Tribolium castaneum]
Group
Gene OntologyGO:00550851.6e-28transmembrane transport
GO:00160211.6e-28integral to membrane
KEGG pathway 
InterPro domain[38-624] IPR0161962.7e-58Major facilitator superfamily domain, general substrate transporter
[84-579] IPR0117011.6e-28Major facilitator superfamily
Orthology groupMCL14722 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209426-TA
ATGGCGTCCATCACTGAGAATGCCGGGGACGAGGCTGAGACGAAGCTATTGGAGAAGAACAGGGTCATTGTCGTCCAAAATAAAAATGACCCCGGGATAGTGTCCGCGAAATCCTCAAAAAAAGTAAAACTGCCAGAAGATGAAGCTACTGTCGGCGGAAGATTTACCATCGGCCCAGCCCCTGAACGTGACTGGGAGTTGGTGCCCCCTGACGGTAGATGGGGTTGGTGTGTGCTCGTGGGCGCAACCCTAGTGAACATTCTCATACCCGGGACTATAAAATCCTTCGGAGTACTGTTAGTTGAGTTCAATGACGTGTTCAATTCGAGTCCAGCGGCGTCGTCCGGGATTGTAGCCTTATGTTACTTCTTATACAGTAGTCTTGGTCCTATATCTTCGATACTGTCCGTGAAGTGGTCGTATAGGACCGTCACCCTCATCGGCGGCTGTTTCGCAGCCTTCGGGATGATCTTCAGCAGTTGGGCGTTTTCAATAACGTACCTGTATTTCAGTTTCGGCGCCATGGTTGGCACGGGCGCCGGTTTAGCCTTCCCGCCCACCGTGTACATAGTGACCTCGTATTTCGTCCGCCTGCGAGGCCTCGCTAACGGCATCTGTATGTCAGGGAGCGCCTTCGGCAGCATCATCCTGCCGCCCATACTGAGGATACTCTTAGAGACGTACGGATACAAAGGCGCCGTGCTCATCCTGGGCGGGATCATGCTGAACGTGTGGGCCGCCGCGCTGCTCTTCCAGCCGGTGGAGGAGCACATGGTGAAGAAGTACAAGGAGGCCGAGGACGACGGGCCCCAAGATGAGATCTTATTCGAAGAGGAGGAACCGATAGACGAATATGACATGACGGTCGTCACCGAGAAGCTGAACGGCCCCAGCGACAGAGATCCCGCGAGGAGCGCGTCCCAAGGTCACGGGAACTCCTCCCAGAACCTGCGCCCGAGCGTGTCCAAACGTAAGCTGTCGTACCCGCGTCCGATGAGCAAGAACACGGTGAGCTCGACGTCGATAGCTGTGGACGGACAGGACCTGAAGAAGGTGGCGTCTCAAGAGACCTTCGCTCGCAGACTGAGCGCCTGCAGGAGGAACGTGTCGACGACCAGCTTCGCGTACATCTCCACGCCCTTCCACGGCTCCACGCTGTCAGCTTTCGAGAGGCCGAACGAGTTCGCGTCACAGTTTTCGTTGAAATCTTTGACGGAGAGCGTGGCGGACGTGGCCTACTGCTGCTTCTGCTGCAAAAAGACGAACAAACCGAGCAAATTCTTCGACGTCACGCTCCTGACCGACCCGACGTACCTGGTGATCCTCATATCCAGTAGCACCGTGGCTATATCGTGCACTAACTTCATCATCCTGCTGCCGTCGCACGCGCAGAACATTGGTTTCGACAAGGCCAAAGGCGCGTACCTCCTCAGCACGGTGTCGGCTCTGGACCTGGTGGGCCGCGTGGGCGGCTCGGCTCTGTGCGATCTCAATATAATGCCCAAAGCGTTTTATTTCGTCGGCGGCCTCATATTCTCGGGTATAACGCTGGCCGCTATACCGTTTCTGGAGTCCTATGTGGCCATCAGCGTTTTCTGCGCGCTGTTCGGCCTGGCCTCCGGGGTGAACAACGGCGTGACCACGCTGGTCATGGCCGAAATACTGGGCGCGGACAGGCTGATGTCCACGTACGGGATCAGTCTGTTCGTGAACGGCCTGCTGCAGCTGGCCGGCCCGCCGATATGCGGCGTCTGGTACCAGCACGACAAGAACTTCGTGCACATGTTCGTCACCTTCGGCATGATCTTCATAGCCGGCGCGAGCTTGTGGCTGTTCATGCCGTTGGAACCGTTCATAGAGGAGGAACCTGATGAGATAACAGAGAATCCGAAACTAACTCTGACACCGGACCAGGACGAGAAGCTACAGAATGGGTCGCCTCCGATGAGGGAGAGGTCGTTTCTGGACGAGAAATTCTCGCCAAACAGAAGCGACTTCGCCAGATCAGCGAGTGTGGCGCACGTGGCTCACGCAGAGAACAGCAGGTTGAGGAAAATATCCACCCCCATCCCGCCGAAACACGGCCTGCAGAGTATATCCAGCAAGATATCGAATCACCTGCCCAGCAACCCGTCGCTGCTGGAGTCGGTGCCCGAAGGGAAACCGAGCCGCGTCAATTCCCAGGAAGCGTTCGGAAAGAAGTTGGTCGTGCCAAAAACACCCAAACGCAGTCCATCGACCTCAAGCTTTCAGTACATGTCCACTCCCTTCCACGGCTCCACGCTGTCAGCTTTCGAAAAACCAAGCGAATTCGCGTCGCAATTCAGCCTAAAGTCCGTAACGGACAGCCTGGCGCCCATATCATATTGTTGTGGCTGCAAAAAGTCCAAAGAGGACGAATCGAAGAAGGAGCCGAGCAAATACTTCGACCTCCAGCTCTTCAAAGACCCGATATATCTCGTCATTCTGATATCGAATTGCACTTGCGCGATATCGTATACGAATTTCATAATTCTCGTGCCGTCGTACGCTAAAGAGTGCGGTTTTGATAAGGCTCGCGGCGCCTACCTCCTATCTATCATATCCGCGTTGGATTTGGTCGGTAGGATCGGCGGGTCGGCGTTATCCGACGTGGTCACGACCCCGAAGCGTTACTTCTTCATCACCGGACTCCTGTTGTCCGGGATCAGTCTGGCCATGATACCGCTGGTGACGACTTATTCCGCTATCAGCGCTTTTTGCTGTATCTTTGGGATAGCGTCCGGTATCAATGTGGGTGTCACGGCCTTGGTGATGACAGAGATGCTCGGCACCGAGAGGCTCATGAGCTCGTACGGTATAAGTTTGTTCATGAACGGTATCCTGCAGCTGGTCGGGCCACCGGTCTGTGGGATTTGGTTCGAATATACCAAATCTTACAGGTCCTTGTTCGTGACGCTTGGCTTCATTTTGGTTTTTGGGGCTAGCTTATGGCTCTTCGTGCCGTTTATACACAGGCGTAGGAAAAGGGCGCAGGCGCTCAAAGATAACTCGGAAAATAGGGCCTAG

Protein sequence:

>DPOGS209426-PA
MASITENAGDEAETKLLEKNRVIVVQNKNDPGIVSAKSSKKVKLPEDEATVGGRFTIGPAPERDWELVPPDGRWGWCVLVGATLVNILIPGTIKSFGVLLVEFNDVFNSSPAASSGIVALCYFLYSSLGPISSILSVKWSYRTVTLIGGCFAAFGMIFSSWAFSITYLYFSFGAMVGTGAGLAFPPTVYIVTSYFVRLRGLANGICMSGSAFGSIILPPILRILLETYGYKGAVLILGGIMLNVWAAALLFQPVEEHMVKKYKEAEDDGPQDEILFEEEEPIDEYDMTVVTEKLNGPSDRDPARSASQGHGNSSQNLRPSVSKRKLSYPRPMSKNTVSSTSIAVDGQDLKKVASQETFARRLSACRRNVSTTSFAYISTPFHGSTLSAFERPNEFASQFSLKSLTESVADVAYCCFCCKKTNKPSKFFDVTLLTDPTYLVILISSSTVAISCTNFIILLPSHAQNIGFDKAKGAYLLSTVSALDLVGRVGGSALCDLNIMPKAFYFVGGLIFSGITLAAIPFLESYVAISVFCALFGLASGVNNGVTTLVMAEILGADRLMSTYGISLFVNGLLQLAGPPICGVWYQHDKNFVHMFVTFGMIFIAGASLWLFMPLEPFIEEEPDEITENPKLTLTPDQDEKLQNGSPPMRERSFLDEKFSPNRSDFARSASVAHVAHAENSRLRKISTPIPPKHGLQSISSKISNHLPSNPSLLESVPEGKPSRVNSQEAFGKKLVVPKTPKRSPSTSSFQYMSTPFHGSTLSAFEKPSEFASQFSLKSVTDSLAPISYCCGCKKSKEDESKKEPSKYFDLQLFKDPIYLVILISNCTCAISYTNFIILVPSYAKECGFDKARGAYLLSIISALDLVGRIGGSALSDVVTTPKRYFFITGLLLSGISLAMIPLVTTYSAISAFCCIFGIASGINVGVTALVMTEMLGTERLMSSYGISLFMNGILQLVGPPVCGIWFEYTKSYRSLFVTLGFILVFGASLWLFVPFIHRRRKRAQALKDNSENRA-