DPGLEAN01816 in OGS1.0

New model in OGS2.0DPOGS205877 
Genomic Positionscaffold1969:+ 17843-20556
See gene structure
CDS Length1215
Paired RNAseq reads  1694
Single RNAseq reads  3923
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000119 (9e-147)
Best Drosophila hit  lethal (3) 02640 (5e-94)
Best Human hitporphobilinogen deaminase isoform 1 (3e-79)
Best NR hit (blastp)  PREDICTED: similar to porphobilinogen deaminase [Tribolium castaneum] (6e-102)
Best NR hit (blastx)  porphobilinogen deaminase [Aedes aegypti] (7e-95)
GeneOntology terms

  
GO:0004418 hydroxymethylbilane synthase activity
GO:0018160 peptidyl-pyrromethane cofactor linkage
GO:0033014 tetrapyrrole biosynthetic process
InterPro families


  
IPR022419 Porphobilinogen deaminase, dipyrromethane cofactor binding site
IPR000860 Tetrapyrrole biosynthesis, hydroxymethylbilane synthase
IPR022418 Porphobilinogen deaminase, C-terminal domain
IPR022417 Porphobilinogen deaminase, N-terminal
Orthology groupMCL12385

Nucleotide sequence:

ATGGAAAGTGAACGGAAAAATGTAGTTCGTGTTGGTTCGAGAAAAAGCGAGTTAGCGCTT
ATTCAAACTAACTTTGTGATAGACAGCTTAAAGAAAATCTATCCAGATAAAGAATTCACC
ATAGTTTCGATGACAACATTAGGTGACAGAGTGTTAGATATCTCGTTACCAAAAATAGGT
GAAAAATCGTTATTCACTAAAGACCTAGAGGAAGCCTTAAGGAACAACACTGTCGATTTT
GTTGTGCATTCGTTAAAAGACTTGCCAACTACCTTACCAGAGGGTCTTGCTATTGGCGCT
GTGTTTGAAAGAGAAGATCCCCGAGATGCTCTCGTACTAAGAGAAGACATCAAAGAAGCG
ACACTCAGTGCTTTGCCGGCTGGATCTATCATAGGAACATCATCTTTACGTCGAACAGCA
CAGCTTAGAGGGAGTTATCCCGAACTGTCGGTGCAAGATGTCAGAGGAAACCTTAATACA
AGATTGAAGAAATTAGATAGCGGAGCATATTCTGCATTACTGCTAGCGACTGCCGGATTA
GAAAGAATGGGCTGGGAAAAACGAATCACTAAGATTCTTCCGTGTTCTGAGATGATGTAC
GCTGTAGGTCAAGGTGCCCTCGCAGTGGAATGTCGGTCGGATAATGCTGAAATTTTAACA
TTATTATCCCCGTTCAATCATGTAGAGACATATTGTAGAGTATTGGCCGAGAGGAGCTTC
TTGAAAACATTGGGTGGTGGTTGCAGTGCACCAGTCGGTGTGTCAACAAAGTTAAAAGCT
TTGGATTCTGATTGGAAACTAAGTATAACAGGTGGAGTATGGAGTTTGGATGGAAAAACA
AAAGTAACGGACACATTGGAAAAGACATTTACACAAATTAAAAAGTCACAAAAACACAAA
CTAAGTCCTACTGAAGATAATATGAACAAAAAAATTAAAATTGATGATAATAACGACAAT
ATTACGCATCCACTAGCTGAATTAGACAATATAATAGAAAAGAACAACGGAAATTTAAAT
TGTGAGGAATCATCCAAGGAGATAACATGCAGGCTATTCTGCGGTCTGATTGAAAATAAT
AATATACCAGTAGACGTGATTATGAAATGTGAAGATCTTGGCAAAGAATTAGCTAATAAT
TTAATAACAAACGGTGCTTTGGATGTTATGAAAGTAACACAGGATCTTATAAGGAATTCG
ATAAAAAGTTCATGA

Protein sequence:

MESERKNVVRVGSRKSELALIQTNFVIDSLKKIYPDKEFTIVSMTTLGDRVLDISLPKIG
EKSLFTKDLEEALRNNTVDFVVHSLKDLPTTLPEGLAIGAVFEREDPRDALVLREDIKEA
TLSALPAGSIIGTSSLRRTAQLRGSYPELSVQDVRGNLNTRLKKLDSGAYSALLLATAGL
ERMGWEKRITKILPCSEMMYAVGQGALAVECRSDNAEILTLLSPFNHVETYCRVLAERSF
LKTLGGGCSAPVGVSTKLKALDSDWKLSITGGVWSLDGKTKVTDTLEKTFTQIKKSQKHK
LSPTEDNMNKKIKIDDNNDNITHPLAELDNIIEKNNGNLNCEESSKEITCRLFCGLIENN
NIPVDVIMKCEDLGKELANNLITNGALDVMKVTQDLIRNSIKSS