DPGLEAN00087 in OGS1.0

New model in OGS2.0DPOGS215049 
Genomic Positionscaffold498:+ 4061-10050
See gene structure
CDS Length3561
Paired RNAseq reads  1240
Single RNAseq reads  2986
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005680 (1e-145)
Best Drosophila hit  CG42331, isoform B (8e-143)
Best Human hitperoxidasin homolog precursor (2e-77)
Best NR hit (blastp)  PREDICTED: similar to oxidase/peroxidase [Nasonia vitripennis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to oxidase/peroxidase [Nasonia vitripennis] (0.0)
GeneOntology terms


  
GO:0004601 peroxidase activity
GO:0020037 heme binding
GO:0055114 oxidation reduction
GO:0006979 response to oxidative stress
InterPro families

  
IPR002007 Haem peroxidase, animal
IPR010255 Haem peroxidase
IPR019791 Haem peroxidase, animal, subgroup
Orthology groupMCL16605

Nucleotide sequence:

ATGTGTGAACCGGCGGCCGAAGGAAACGTTGACAGTGACAATCTTGAGAGTCGATCGGAA
GAAAACCAGTTTCCTGATGCGAGCTCCCTGCGCCAGTCCCCACTAGTTCAACAATGTCCC
CTTCGAGGCGCTCCCAAATGTCCGCCAGCATCCAAAAGATATCGCACTCACGATGGTACT
TGCAACAACCTGAGTCGGCCTCGCTGGGGTGCCACCATGACCCCGGTACAGCGGTTCCTT
CCACCAGTTTACTCCGATGGTATTCAAGCACCAAGGAAATCAATTTTCGGCTCTACCTTA
CCTTCAGCCAGAGAAATCAGCGCGCTAGTACACGAGGACAAGGATTCTGAAAATTCTGGA
ATAACGCATTTGCTCATGCAGTGGGGTCAATTTCTGGACCACGACATAACGTCTTCCTCC
CAGTCCCGAGGGTTCAACGGTTCAGTGCCGCGGTGCTGCAAGGACGGAGGAAGAGACTTC
ATACCTCAAGAGTTCATGCACCCGGAATGTCTTCCGATCGCCGTCCCACCCTCGGATCCC
TTCTACGGTCCCCGCGGTGTTCGTTGCCTGGACTTCGTTCGATCGTCTCCGGCGCCGCGG
GAGGACTGCGCCCTCGGCTGGAGGGAACAGTTCAACCACGTGTCCTCGTACATCGACGGA
TCACCACTTTACGCCAGCTCCGCGAGACAGTCGGACAGGTTGAGACTGTTCAGGAATGGT
ATGCTGCAGTATGGGCGGGTGCAGCAGCGCCGTCCTCTGCTGCCGGCTGAACGTGATGAG
TTGTGCCGCGGGGGCGCTCTATCCACGGACTGCTTCAAGTCGGGGGACGCGCGGGTCAAT
GAACACCCCGGTCTCGTCGCCAAACACATCGTCTGGCTCAGACAACATAACCGAATGGCC
CAGGAACTGGCGCACCTTAACCCTCACTGGAGCGATGAAAAAATTTATCAGGAAACTCGA
AAAATAGTGGGAGCTATGATACAACACATTACTTACAGGGAATTCCTACCGATCGTTTTG
GGTCCTGAAGTGATGCGTCTTTTTGAGCTGGAGCTTCTTCCGAAGGGCTATTTCAAGGGC
TACAGCGCCAAGACCAATCCGAACCCAGCCAGTTCTTTCGGTACAGCCGCTTTCCGCTTC
GGACACAGTCTGGTTCAGTCGTCGATGATGCGCTTTGATAAGTTCCACAGACCGATCAAT
AACAACGTTTCCCTCCACGCGGAGCTTACAAACCCGTCCAACATCTGGAGCGTGGGTGCC
GTGGACCGACTGCTGCTGGGGATGCTGAACCAGCACATACAGAAGAGGGACGAATTCATT
ACGGAAGAACTCACCAACCACTTGTTTCAAACCAATCACTTCAACTTTGGGATGGATTTG
GCTGCTATTAACATCCAGCGAGGAAGAGACCACGGGGTGCCGCCGTATACCGCGTGGAGG
GAGCCCTGCGGACTGACGCCCATCACGGACTTCGATGACTTAGTGAGGGTGATGCCGGCA
CGGGTCGTGAGGAAGTTGAAGGTGTTATACAGGAACCTCAATTCGACGTCGTGGATAAGA
ATTGCAAATTTCAAGAACTTCAACTTTGGGATGGATCTGGCTGCTATTAACATCCAGCGA
GGAAGAGACCACGGGGTGCCGCCGTATACCGCGTGGAGGGAGCCCTGCGGACTGACGCCC
ATCACGGACTTCGATGACTTAGTGAGGGTGATGCCGGCACGGGTCGTGAGGAAGTTGAAG
GTGTTATACAGACACGTGGATGACCTGGACCTGTTCACGGGCGGCGTGTCCGAGCGCCCC
GTGGCAGGCGCCCTCGTCGGGCCGGTGTTTGCATGCATCATAGCTCAACAGTTCGCAAAC
TTACGGAAAGGGGATCGCTTCTGGTACGAAAATGGTGGTTTCGATTCATCTTTCACTCCG
GCTCAATTGCAACAGATAAGACGAATATCTCTGTCACAGGTCCTTTGTAGTACTCTGGAC
TCAATAGATAACATACAACCTTTCGCTTTCCTCTCACATGAAAATCCAAAAAATGACAGG
ATATCATGCCGTAATGGCTTACTTAACAATTTTGACCTATCCGCTTGGATCGAATTACAT
TCAGACTCAAATAATATTAAGAAATCTGACGAAAATCAACAGAGTTCTAAGACGAAAACC
AAGCGAACAACCGTGAGACCGACCACGACAACTAAACATCCCCAAAAACTATCTCAAACT
TTGACTCAACAAAAATCTGAAAAATTTAGACTTAATAATATGACAGAGACCGATGACACT
GATAAAGACAAAGACAAAAACGCAGACGATGAACCGACTGGGATCAAACCTAACGCTACA
GTAGTTATAGACGATAAACTAGACTTTAGAAACAAATCACGACGCTTTACCGACTTCGAC
GACGAGAGAAACCCCCCGACTAGACAATACAATGACTATTATGATGACGTACAAAGCGTA
CAATCAGTTGTTATCAACAACATACCAAATAACAGACCAAACAGACGACCTTACATATCC
GTTACTGAGAATATTGCTGACAAATACACGTATCTTATTAACTATGTTCCCCGACCGACT
CACTCCTGGCGGCAGACCACTAGACGTTCTCACGATCGTGACGTTGTTAAAGTCACATAT
CAGACTTACGAAGACACTTACGGCCGTCCAAACAGACCTTACTTCAACAGAGACGAACTT
GACAATGACTTTGAATCGCGGCAACAGAAGCCTGTGACGGAAAGCTTCCAGTCATCAGCC
AGATCGATTGACAACGAAGCACCGACTCCACAGTTGAAGTTATCAACTGAGAGCTCAGTG
CAGACTGAAAAAAACAGACCAATAGATACACAGACAGACAAAACTGACTCAACAACAGAA
AATTTGTACAAACTTTTAACTTTTGGTTATGTAGGAACTTATAAACGAGACAAGATTGTA
AATGATGACACTAAAGACTCGAAAGACAATACGAACAAACATGACTCTGGCGACCATAAC
GTCAGTTTAGACTTCTCGACCGTAGTAAACAATGAGACGGACGATGATGACAAACAAAAC
GTAAAACTTTCAACTTTCATAGTTTACGATACAGCCACTAAACCTTACCTGACCAGCTCA
CAGAGACCGACGAGACGTAACGATGACGAGACCACGGAAAAGAAAGACAAATATTATTTC
ATTCAAAACGTCTTACATAAATACTCTGAAACAAAGAGCGACGACCTCAAGAAAACGAGC
AGCGGAAAAGATAAAAACAACACTGACCAATACATAGGAATCGAGGAGAGGTTAGGCAAC
GACAGCTTGGACGATGACGAGAGACCAGTGAATGTGAGAGCGAAAATAAAATCAAGAAAA
CCATCGAGTTCAGCGAAAACTCCATCGGTCGCTTTTCAAATTATTCCTAGCGAAAACAAT
CCATCACAATGGGCGGTTTATGAGGAGAAAGAAGATCTTTCGGGGCAAATACCACAGATG
CCAAGCATTAAGATCGACCCACACGCTCTACGGGAAGTGCCAAGACCTATGAATTTCGGT
TTTAGAAAACGACACGGATAA

Protein sequence:

MCEPAAEGNVDSDNLESRSEENQFPDASSLRQSPLVQQCPLRGAPKCPPASKRYRTHDGT
CNNLSRPRWGATMTPVQRFLPPVYSDGIQAPRKSIFGSTLPSAREISALVHEDKDSENSG
ITHLLMQWGQFLDHDITSSSQSRGFNGSVPRCCKDGGRDFIPQEFMHPECLPIAVPPSDP
FYGPRGVRCLDFVRSSPAPREDCALGWREQFNHVSSYIDGSPLYASSARQSDRLRLFRNG
MLQYGRVQQRRPLLPAERDELCRGGALSTDCFKSGDARVNEHPGLVAKHIVWLRQHNRMA
QELAHLNPHWSDEKIYQETRKIVGAMIQHITYREFLPIVLGPEVMRLFELELLPKGYFKG
YSAKTNPNPASSFGTAAFRFGHSLVQSSMMRFDKFHRPINNNVSLHAELTNPSNIWSVGA
VDRLLLGMLNQHIQKRDEFITEELTNHLFQTNHFNFGMDLAAINIQRGRDHGVPPYTAWR
EPCGLTPITDFDDLVRVMPARVVRKLKVLYRNLNSTSWIRIANFKNFNFGMDLAAINIQR
GRDHGVPPYTAWREPCGLTPITDFDDLVRVMPARVVRKLKVLYRHVDDLDLFTGGVSERP
VAGALVGPVFACIIAQQFANLRKGDRFWYENGGFDSSFTPAQLQQIRRISLSQVLCSTLD
SIDNIQPFAFLSHENPKNDRISCRNGLLNNFDLSAWIELHSDSNNIKKSDENQQSSKTKT
KRTTVRPTTTTKHPQKLSQTLTQQKSEKFRLNNMTETDDTDKDKDKNADDEPTGIKPNAT
VVIDDKLDFRNKSRRFTDFDDERNPPTRQYNDYYDDVQSVQSVVINNIPNNRPNRRPYIS
VTENIADKYTYLINYVPRPTHSWRQTTRRSHDRDVVKVTYQTYEDTYGRPNRPYFNRDEL
DNDFESRQQKPVTESFQSSARSIDNEAPTPQLKLSTESSVQTEKNRPIDTQTDKTDSTTE
NLYKLLTFGYVGTYKRDKIVNDDTKDSKDNTNKHDSGDHNVSLDFSTVVNNETDDDDKQN
VKLSTFIVYDTATKPYLTSSQRPTRRNDDETTEKKDKYYFIQNVLHKYSETKSDDLKKTS
SGKDKNNTDQYIGIEERLGNDSLDDDERPVNVRAKIKSRKPSSSAKTPSVAFQIIPSENN
PSQWAVYEEKEDLSGQIPQMPSIKIDPHALREVPRPMNFGFRKRHG