New model in OGS2.0 | DPOGS205877  |
---|---|
Genomic Position | scaffold1969:+ 17843-20556 |
See gene structure | |
CDS Length | 1215 |
Paired RNAseq reads   | 1694 |
Single RNAseq reads   | 3923 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000119 (9e-147) |
Best Drosophila hit   | lethal (3) 02640 (5e-94) |
Best Human hit | porphobilinogen deaminase isoform 1 (3e-79) |
Best NR hit (blastp)   | PREDICTED: similar to porphobilinogen deaminase [Tribolium castaneum] (6e-102) |
Best NR hit (blastx)   | porphobilinogen deaminase [Aedes aegypti] (7e-95) |
GeneOntology terms    | GO:0004418 hydroxymethylbilane synthase activity GO:0018160 peptidyl-pyrromethane cofactor linkage GO:0033014 tetrapyrrole biosynthetic process |
InterPro families    | IPR022419 Porphobilinogen deaminase, dipyrromethane cofactor binding site IPR000860 Tetrapyrrole biosynthesis, hydroxymethylbilane synthase IPR022418 Porphobilinogen deaminase, C-terminal domain IPR022417 Porphobilinogen deaminase, N-terminal |
Orthology group | MCL12385 |
Nucleotide sequence:
ATGGAAAGTGAACGGAAAAATGTAGTTCGTGTTGGTTCGAGAAAAAGCGAGTTAGCGCTT
ATTCAAACTAACTTTGTGATAGACAGCTTAAAGAAAATCTATCCAGATAAAGAATTCACC
ATAGTTTCGATGACAACATTAGGTGACAGAGTGTTAGATATCTCGTTACCAAAAATAGGT
GAAAAATCGTTATTCACTAAAGACCTAGAGGAAGCCTTAAGGAACAACACTGTCGATTTT
GTTGTGCATTCGTTAAAAGACTTGCCAACTACCTTACCAGAGGGTCTTGCTATTGGCGCT
GTGTTTGAAAGAGAAGATCCCCGAGATGCTCTCGTACTAAGAGAAGACATCAAAGAAGCG
ACACTCAGTGCTTTGCCGGCTGGATCTATCATAGGAACATCATCTTTACGTCGAACAGCA
CAGCTTAGAGGGAGTTATCCCGAACTGTCGGTGCAAGATGTCAGAGGAAACCTTAATACA
AGATTGAAGAAATTAGATAGCGGAGCATATTCTGCATTACTGCTAGCGACTGCCGGATTA
GAAAGAATGGGCTGGGAAAAACGAATCACTAAGATTCTTCCGTGTTCTGAGATGATGTAC
GCTGTAGGTCAAGGTGCCCTCGCAGTGGAATGTCGGTCGGATAATGCTGAAATTTTAACA
TTATTATCCCCGTTCAATCATGTAGAGACATATTGTAGAGTATTGGCCGAGAGGAGCTTC
TTGAAAACATTGGGTGGTGGTTGCAGTGCACCAGTCGGTGTGTCAACAAAGTTAAAAGCT
TTGGATTCTGATTGGAAACTAAGTATAACAGGTGGAGTATGGAGTTTGGATGGAAAAACA
AAAGTAACGGACACATTGGAAAAGACATTTACACAAATTAAAAAGTCACAAAAACACAAA
CTAAGTCCTACTGAAGATAATATGAACAAAAAAATTAAAATTGATGATAATAACGACAAT
ATTACGCATCCACTAGCTGAATTAGACAATATAATAGAAAAGAACAACGGAAATTTAAAT
TGTGAGGAATCATCCAAGGAGATAACATGCAGGCTATTCTGCGGTCTGATTGAAAATAAT
AATATACCAGTAGACGTGATTATGAAATGTGAAGATCTTGGCAAAGAATTAGCTAATAAT
TTAATAACAAACGGTGCTTTGGATGTTATGAAAGTAACACAGGATCTTATAAGGAATTCG
ATAAAAAGTTCATGA
Protein sequence:
MESERKNVVRVGSRKSELALIQTNFVIDSLKKIYPDKEFTIVSMTTLGDRVLDISLPKIG
EKSLFTKDLEEALRNNTVDFVVHSLKDLPTTLPEGLAIGAVFEREDPRDALVLREDIKEA
TLSALPAGSIIGTSSLRRTAQLRGSYPELSVQDVRGNLNTRLKKLDSGAYSALLLATAGL
ERMGWEKRITKILPCSEMMYAVGQGALAVECRSDNAEILTLLSPFNHVETYCRVLAERSF
LKTLGGGCSAPVGVSTKLKALDSDWKLSITGGVWSLDGKTKVTDTLEKTFTQIKKSQKHK
LSPTEDNMNKKIKIDDNNDNITHPLAELDNIIEKNNGNLNCEESSKEITCRLFCGLIENN
NIPVDVIMKCEDLGKELANNLITNGALDVMKVTQDLIRNSIKSS