Monarch geneset OGS2.0

DPOGS216005
TranscriptDPOGS216005-TA1077 bp
ProteinDPOGS216005-PA358 aa
Genomic positionDPSCF300078 + 492117-493929
RNAseq coverage414x (Rank: top 29%)
Annotation
HeliconiusHMEL0022407e-13878.42% 
BombyxBGIBMGA000937-TA1e-15474.86% 
DrosophilaUpdo-PB9e-15070.25% 
EBI UniRef50UniRef50_Q9V5954e-14870.54%Uroporphyrinogen decarboxylase n=43 Tax=cellular organisms RepID=DCUP_DROME
NCBI RefSeqXP_002080776.14e-14970.54%GD10665 [Drosophila simulans]
NCBI nr blastpgi|1953328557e-14870.54%GM21135 [Drosophila sechellia]
NCBI nr blastxgi|1953328554e-14370.54%GM21135 [Drosophila sechellia]
Group
Gene OntologyGO:00048537.3e-150uroporphyrinogen decarboxylase activity
GO:00067797.3e-150porphyrin biosynthetic process
KEGG pathwaydsi:Dsim_GD106651e-148 
 K01599 (E4.1.1.37, hemE)maps-> Porphyrin and chlorophyll metabolism
InterPro domain[6-356] IPR0063617.3e-150Uroporphyrinogen decarboxylase HemE
[12-356] IPR0002572e-124Uroporphyrinogen decarboxylase (URO-D)
Orthology groupMCL13335 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216005-TA
ATGGATTTCAGCAACAAAAAATTCCCTTTATTAAAAAATGATAGACTATTAAGAGCAGCAAGTGGTCAGGAGGTCGATAAAGTACCAGTGTGGGTAATGCGTCAAGCCGGTCGTTACTTACCTGAATTTCAAGAAGTTCGGGCAAAACATGATTTTTTCACCGTTTGCCGTACACCTGAATTGGCTTGTGAAGTAACACTGCAGCCGTTGCGCCGTTTTGAACATTTGGATGCTTCAATCATCTTTAGCGACATTCTTGTTATTCCACAAGCATTAGGAATGACAGTAGAAATGCATCCAGGCCAGGGTCCTGTTTTTCCTAACCCTCTTCAAGATGTGTCTGAAATAAACAACCTTAAAGAAGAAGGTGCAGTATCTAGACTGTCCTATGTTGGAGATGCCATAACATTAACAAGACACAAAATTGAAGGAAAAGTACCGTTAATTGGTTTTACAGGAGCACCATTCACTCTAATGGGATACATGATTGAAGGAGGGGGAAGCAAAACCATGAGTAAAACGAAGGATTGGTTGGAAAAACACCCAAAAGATGTCCATAGACTGCTAGCTTTACTGACCAGAGTTATAATAAACTATTTGGTGATGCAGGTTGAAAGTGGAGCTCAATTACTACAAGTTTTTGAATCCAGTGCAGATCATTTGACCAGAGAGCAGTTCATTGAATTTTCGGCACCCTATCTTAAGGATATAAGTAGTGGTGTAAAAAATATATTAAATGAAAAAAAGATAGATCAAGTACCAATGACAGTTTTTGCAAAAGGAGGTGGTCACTCACTAGATGTTCAGGCAGACTTGGGATATGAAACCATTGGGCTTGATTGGACTGTTGATCCAATTGAAGCAAGGAAGATAGTTGGCGAAAATATAACTTTACAAGGCAATTTAGATCCACAAGACTTATACAAAACACCAGATGAAATAAAAACACTTACTATAGATATGGTAAAGAAATTCGGTAAACACAGATACATTGCAAATTTGGGTCATGGTATAACCCCGCAAACTCCAATTGAGAGCATGACAGTATTTACCGAATCTGTCCATGAAGCTGTATAA

Protein sequence:

>DPOGS216005-PA
MDFSNKKFPLLKNDRLLRAASGQEVDKVPVWVMRQAGRYLPEFQEVRAKHDFFTVCRTPELACEVTLQPLRRFEHLDASIIFSDILVIPQALGMTVEMHPGQGPVFPNPLQDVSEINNLKEEGAVSRLSYVGDAITLTRHKIEGKVPLIGFTGAPFTLMGYMIEGGGSKTMSKTKDWLEKHPKDVHRLLALLTRVIINYLVMQVESGAQLLQVFESSADHLTREQFIEFSAPYLKDISSGVKNILNEKKIDQVPMTVFAKGGGHSLDVQADLGYETIGLDWTVDPIEARKIVGENITLQGNLDPQDLYKTPDEIKTLTIDMVKKFGKHRYIANLGHGITPQTPIESMTVFTESVHEAV-