Monarch geneset OGS2.0

DPOGS205877
TranscriptDPOGS205877-TA1215 bp
ProteinDPOGS205877-PA404 aa
Genomic positionDPSCF300339 + 72557-75270
RNAseq coverage1235x (Rank: top 10%)
Annotation
HeliconiusHMEL0149945e-17773.84% 
BombyxBGIBMGA000119-TA5e-15967.57% 
Drosophilal(3)02640-PA7e-10260.87% 
EBI UniRef50UniRef50_UPI00017922DC1e-9848.33%UPI00017922DC related cluster n=1 Tax=unknown RepID=UPI00017922DC
NCBI RefSeqXP_967479.15e-10349.67%PREDICTED: similar to porphobilinogen deaminase [Tribolium castaneum]
NCBI nr blastpgi|910841359e-10249.67%PREDICTED: similar to porphobilinogen deaminase [Tribolium castaneum]
NCBI nr blastxgi|3071876981e-9851.75%Porphobilinogen deaminase [Camponotus floridanus]
Group
Gene OntologyGO:00330146.4e-157tetrapyrrole biosynthetic process
GO:00044186.4e-157hydroxymethylbilane synthase activity
KEGG pathwaytca:6583981e-102 
 K01749 (E2.5.1.61, hemC)maps-> Porphyrin and chlorophyll metabolism
InterPro domain[7-285] IPR0008606.4e-157Tetrapyrrole biosynthesis, hydroxymethylbilane synthase
[9-222] IPR0224173.1e-85Porphobilinogen deaminase, N-terminal
[232-287] IPR0224183.6e-14Porphobilinogen deaminase, C-terminal domain
Orthology groupMCL12000 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205877-TA
ATGGAAAGTGAACGGAAAAATGTAGTTCGTGTTGGTTCGAGAAAAAGCGAGTTAGCGCTTATTCAAACTAACTTTGTGATAGACAGCTTAAAGAAAATCTATCCAGATAAAGAATTCACCATAGTTTCGATGACAACATTAGGTGACAGAGTGTTAGATATCTCGTTACCAAAAATAGGTGAAAAATCGTTATTCACTAAAGACCTAGAGGAAGCCTTAAGGAACAACACTGTCGATTTTGTTGTGCATTCGTTAAAAGACTTGCCAACTACCTTACCAGAGGGTCTTGCTATTGGCGCTGTGTTTGAAAGAGAAGATCCCCGAGATGCTCTCGTACTAAGAGAAGACATCAAAGAAGCGACACTCAGTGCTTTGCCGGCTGGATCTATCATAGGAACATCATCTTTACGTCGAACAGCACAGCTTAGAGGGAGTTATCCCGAACTGTCGGTGCAAGATGTCAGAGGAAACCTTAATACAAGATTGAAGAAATTAGATAGCGGAGCATATTCTGCATTACTGCTAGCGACTGCCGGATTAGAAAGAATGGGCTGGGAAAAACGAATCACTAAGATTCTTCCGTGTTCTGAGATGATGTACGCTGTAGGTCAAGGTGCCCTCGCAGTGGAATGTCGGTCGGATAATGCTGAAATTTTAACATTATTATCCCCGTTCAATCATGTAGAGACATATTGTAGAGTATTGGCCGAGAGGAGCTTCTTGAAAACATTGGGTGGTGGTTGCAGTGCACCAGTCGGTGTGTCAACAAAGTTAAAAGCTTTGGATTCTGATTGGAAACTAAGTATAACAGGTGGAGTATGGAGTTTGGATGGAAAAACAAAAGTAACGGACACATTGGAAAAGACATTTACACAAATTAAAAAGTCACAAAAACACAAACTAAGTCCTACTGAAGATAATATGAACAAAAAAATTAAAATTGATGATAATAACGACAATATTACGCATCCACTAGCTGAATTAGACAATATAATAGAAAAGAACAACGGAAATTTAAATTGTGAGGAATCATCCAAGGAGATAACATGCAGGCTATTCTGCGGTCTGATTGAAAATAATAATATACCAGTAGACGTGATTATGAAATGTGAAGATCTTGGCAAAGAATTAGCTAATAATTTAATAACAAACGGTGCTTTGGATGTTATGAAAGTAACACAGGATCTTATAAGGAATTCGATAAAAAGTTCATGA

Protein sequence:

>DPOGS205877-PA
MESERKNVVRVGSRKSELALIQTNFVIDSLKKIYPDKEFTIVSMTTLGDRVLDISLPKIGEKSLFTKDLEEALRNNTVDFVVHSLKDLPTTLPEGLAIGAVFEREDPRDALVLREDIKEATLSALPAGSIIGTSSLRRTAQLRGSYPELSVQDVRGNLNTRLKKLDSGAYSALLLATAGLERMGWEKRITKILPCSEMMYAVGQGALAVECRSDNAEILTLLSPFNHVETYCRVLAERSFLKTLGGGCSAPVGVSTKLKALDSDWKLSITGGVWSLDGKTKVTDTLEKTFTQIKKSQKHKLSPTEDNMNKKIKIDDNNDNITHPLAELDNIIEKNNGNLNCEESSKEITCRLFCGLIENNNIPVDVIMKCEDLGKELANNLITNGALDVMKVTQDLIRNSIKSS-