DPGLEAN04832 in OGS1.0

New model in OGS2.0DPOGS209262 
Genomic Positionscaffold431:+ 119835-131423
See gene structure
CDS Length4014
Paired RNAseq reads  2957
Single RNAseq reads  7484
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007042 (0.0)
Best Drosophila hit  CG10211 (0.0)
Best Human hitperoxidasin homolog precursor (3e-89)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC005493 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC005493 [Tribolium castaneum] (0.0)
GeneOntology terms


  
GO:0004601 peroxidase activity
GO:0006979 response to oxidative stress
GO:0055114 oxidation reduction
GO:0020037 heme binding
InterPro families

  
IPR002007 Haem peroxidase, animal
IPR010255 Haem peroxidase
IPR019791 Haem peroxidase, animal, subgroup
Orthology groupMCL15210

Nucleotide sequence:

GTTATCTTTATTTGTTTCAGGTTAAAACGAAGTATATTGCTATACGCATTATTTTTAACT
CAAATATTAGCACAGAAAACAAAAACAACAGATGACAATCTACAGACAGTTCCAAGTGAC
CTCAGAAGAGCTGTTAGCGAGGCACTAGCTTTAGAGAGGAGGTTTTTATTAGGAAGTAAC
GACGTGAGGAACTGTTCATACGACGAGATCAGCAATACGCCATGTCCCCCCAGCAAGTAT
CGAAGCGCGAGTGGAGAGTGCAACAACGTTCGACATAGACCATGGGGAAGGCGAGGCGAC
GTGTTCCTTAGATTATTACAACCACACTACGCTGATGGAATATCACAACCTTTCGCAAGT
CCGAAGCTGCCGGAGCCAAGGCTGGCCGTCCAGGCAGTATCGCAGTTGGCTGAATCAGTC
GGTCACGATTACGTAACCAGCTTACTCGCTGCCTGGGGACAGTTTCTTATGGATGATCTC
ATTGCCACCTCTAATCAGAACCAAAAGTGTGAATCTGGTTGCGAATATGTTCGCTCAGCG
CCTACAAGAAATTACGATTCCTGTGGATTTGAATACCGTGATCAGATGAATTTAGCGACA
TCGGTACTAGATGGCTCTGCTCTGTATGGAAATTCTGAAAAAGAGCTGTTGTCTCTCAGA
CTATACGATGCTGGCAAAATGGATATATCCTCCTGCCGAAAATGCAACGAAAATTCACCT
ACCGCACCACTTTATAAGGCTCTCTTGACTGAACACAACCGTATTGCTGGCGAATTATTT
TCGATGAATCCCTTCTGGGAAGATAATGCCCTGTTCTTAGAAGCAAGACGAACAATAGCA
GCAGTTATTCAACATATTACATACAATGAATTTTTGCCTGTACTTTTGGGCGAAGTTGGA
ATGGCAAAAGCAGATTTAAAACTAACCACCCACGGTTTCTGGCGTGGATATTCAAGTGCA
AATCGTGTCGGAGCTTATGCGGAGCTTGTTGCGGTTGCACCAATTTTTAACGCCATGATG
AATGAGAAACTTATAAACACAACAATTCTTCTAAAAGACTTGGTCAAGACCAGCGCTCAT
CAAATCTCAAGATTCGCCTTATCAGCTCAATGGGATCTTAACCGAGCTCGTGACCATGGT
GTACCTTCGTATCTAAAAGTCTTGCAGCTGTGTGATCCCATGGCAAATGTTAAGTCATAC
GCTGAATTTGAAAAATTAGGTTTTGACAGAAGACATCAAGAAATATTCGCTGATATGTAT
AGAAACGCTGAAGATATTGAACTGATGGTAGCTGGAGCCATGGAGAAACCAGCTACTGGC
GCAGTTATTGGGTCTACCTTAGCATGTGTATTAGCACTCCAATTTGGAAACCTAAAGAAA
AGTGATAGGTTTTGGTATGAAAATGACATTCCTCCATCATCATTTTCAATAGAGCAGTTA
GCAGCAATCAGAAAAGTATCCATTGCTGGACTTTTATGTTCTGCTGATGAAGGACTTGAT
AATGTGCAACCTAGAGCTTTCGTAAAGGAAGATACATTCTTAAATGCAGCCCAACAATGC
TCTCAACACCCTCGGCTGGAGCTTTCTTCGTGGCGCGATGAAAGTGGTGCCCGGGCTGCA
GAGCGCCTCTCACAAGACATGCTGGCAGCTGCACTTCAGAAGGCAAAGCAGGAGATGGCT
GACCGCAAGAAACTGGAGTACATGTTATGGGAAGCACATGGAGGAGCCGACCCAAAATCT
CCTGTCGGCACAGCTGCATCTTTCTCAAAAGCAAATAAATATGCCCTTAAGTTGGCAAAC
ACCTCCTTATTCTTCGAATTCGCTACTAACGAACTCGTTAACTCTCTTAATGGACGACGA
CGCAAACGTCAGATCTTCGATGACTCCCTCGGCTTTGGATCAACCGATTTTGTAGAGTCT
CTTCAATCAGTAGACGTCAGTGGTTTTCTAGGGCAGGACCAACTTGGGCCGGTTATTGAA
CCGCAGTGCGATGATAATGGAAACTGCGATCCTGATAGTCCTTTCCGAAGCTACACTGGC
CACTGTAATAATTTGAGAAATCCTAATTTGGGAAAAAGTTTAACAACCTTCGCGAGACTG
TTACCTCCTGTTTATGAAGATGGTGTAAGCCGGCCCCGCATCAACTCAGTAACAGGCACC
CCGCTACCTTCCCCCCGTATAGTTTCTACGGTCATACATCCCGATATATCAAATCTTCAT
ACGCGCTACACTTTGATGGTGATGCAGTTCGCCCAATTCCTGGACCATGAACTGACAATG
ACTCCCATTCACAAAGGCTTCCACGAATCTATTCCGGACTGCAGATCTTGCGACTCTCCC
CGTACAGTGCATCCAGAATGCAACCCTTTCCCAGTACCTCGCGGTGACCATTATTATCCA
GAAGTTAACATAACTTCCGGAGAACGATTATGTTTTCCATTCATGAGAAGTTTACCAGGA
CAGCAGCAATTAGGACCGCGTGAACAAGTCAATCAAAATACAGCTTTCATTGATGGATCG
GCGATTTATGGAGAAAACCCTTGCATTGTTCGTAAACTGCGAGGTTTCAATGGCAGGCTC
AACGCTACTGCAAACCCGATTAATGGAAGAGATTTGTTGCCTAGAACAGATAACCACCCT
GAATGCAAAGCAGCCAGTGGTTTTTGTTTTATTGCTGGTGACGGTCGAGCTTCAGAACAG
CCTGGACTCACGGCTTTACACACAATCTTCATGCGCGAACACAACCGCATAGTTGAAGGA
CTTCGTGGTGTCAACCCTCATTGGGATGCGGAATTATTATTCGAACACACTAGGCGTATA
GTTGCTGCCTCATTCACACATATCATTTATAATGAATTTTTGCCAAGACTTTTGTCTTGG
AACGCTGTTAACTTGTATGGACTCAAATTATTACCTTCAGGTTACTATAAGGAATACTCT
CCAACCTGCAACCCGTCTATTGTAACGGAATTTGCAGCAGCCGCCTTCAGATTTGGTCAC
TCGTTGTTGAGACCACACTTACCGAGACTCTCACCTTCCTATCAACCAGTTGATCCACCA
ATATTGTTGAGAGATGGATTTTTCAGGCCTGACATGTTCATGAATCATCCACCAATGGTT
GACGAACTTATTCGTGGTTTATCTTCCACGCCCATGGAGACCCTTGACCAATTCATAACA
GGAGAAGTTACCAACCATCTATTCGAAGACCGGAGAATTCCGTTTTCGGGTATAGACTTA
GTAGCTCTTAATATCCAAAGAGGTAGAGATCACGGTATACCGAGTTATAACAACTATAGA
GCTTTGTGTAACTTAAAGAGAGCAGCAACTTTCGAAGATTTGGCGAGGGAAATTCCCGAT
GAAGTAATTGCGAGATTTAAGCGAATATACGCTACAGTAGACGACATTGATCTATTCCCT
GGTGGTATGAGCGAACGACCACTACAGGGCGGTCTAGTTGGACCCACCTTCGCCTGCATC
ATCGCTATACAGTTCAGGCAGTTAAGGAAATGCGATCGATTCTGGTATGAGAACGACAAC
AGAGCAGCTCGCTTCACCGAACAACAATTGTCGGAAATTCGCAAAGTAACATTGTCCAAG
GTTCTATGTGACAATTTCGATTTGCCAAGCGACATTCAACGCGCCTCTTTCGATCTACCT
AGCAACTTTTTGAATCCTCGCGTGCCATGCGCGTCTCTTCCAAAACTGGACCTTTCCGCA
TGGCGTGAGAGTTCAGCCCAGGGCTGTCTCATCGCGGGTCGCTCGGTACGACTTGGTGAC
TCCGCCTTCCCTTCGCCCTGTACATCGTGTATATGCACCGTTGACGGGGCGCAGTGCGCA
TCCCTACGCATCACAGACTGTGCACAGCTATGGCGTGAATGGCCACGAGAAGCTGTGCTA
AGAGATGATGTATGCACAGCACAGTGCGGCGCCGCCCCCGCAGGTCAGAGAGCGCCGCGG
AGACCACACGCTCACTTCAAATTCCCCGATCTTACACCATTCATCGCTAAATAG

Protein sequence:

VIFICFRLKRSILLYALFLTQILAQKTKTTDDNLQTVPSDLRRAVSEALALERRFLLGSN
DVRNCSYDEISNTPCPPSKYRSASGECNNVRHRPWGRRGDVFLRLLQPHYADGISQPFAS
PKLPEPRLAVQAVSQLAESVGHDYVTSLLAAWGQFLMDDLIATSNQNQKCESGCEYVRSA
PTRNYDSCGFEYRDQMNLATSVLDGSALYGNSEKELLSLRLYDAGKMDISSCRKCNENSP
TAPLYKALLTEHNRIAGELFSMNPFWEDNALFLEARRTIAAVIQHITYNEFLPVLLGEVG
MAKADLKLTTHGFWRGYSSANRVGAYAELVAVAPIFNAMMNEKLINTTILLKDLVKTSAH
QISRFALSAQWDLNRARDHGVPSYLKVLQLCDPMANVKSYAEFEKLGFDRRHQEIFADMY
RNAEDIELMVAGAMEKPATGAVIGSTLACVLALQFGNLKKSDRFWYENDIPPSSFSIEQL
AAIRKVSIAGLLCSADEGLDNVQPRAFVKEDTFLNAAQQCSQHPRLELSSWRDESGARAA
ERLSQDMLAAALQKAKQEMADRKKLEYMLWEAHGGADPKSPVGTAASFSKANKYALKLAN
TSLFFEFATNELVNSLNGRRRKRQIFDDSLGFGSTDFVESLQSVDVSGFLGQDQLGPVIE
PQCDDNGNCDPDSPFRSYTGHCNNLRNPNLGKSLTTFARLLPPVYEDGVSRPRINSVTGT
PLPSPRIVSTVIHPDISNLHTRYTLMVMQFAQFLDHELTMTPIHKGFHESIPDCRSCDSP
RTVHPECNPFPVPRGDHYYPEVNITSGERLCFPFMRSLPGQQQLGPREQVNQNTAFIDGS
AIYGENPCIVRKLRGFNGRLNATANPINGRDLLPRTDNHPECKAASGFCFIAGDGRASEQ
PGLTALHTIFMREHNRIVEGLRGVNPHWDAELLFEHTRRIVAASFTHIIYNEFLPRLLSW
NAVNLYGLKLLPSGYYKEYSPTCNPSIVTEFAAAAFRFGHSLLRPHLPRLSPSYQPVDPP
ILLRDGFFRPDMFMNHPPMVDELIRGLSSTPMETLDQFITGEVTNHLFEDRRIPFSGIDL
VALNIQRGRDHGIPSYNNYRALCNLKRAATFEDLAREIPDEVIARFKRIYATVDDIDLFP
GGMSERPLQGGLVGPTFACIIAIQFRQLRKCDRFWYENDNRAARFTEQQLSEIRKVTLSK
VLCDNFDLPSDIQRASFDLPSNFLNPRVPCASLPKLDLSAWRESSAQGCLIAGRSVRLGD
SAFPSPCTSCICTVDGAQCASLRITDCAQLWREWPREAVLRDDVCTAQCGAAPAGQRAPR
RPHAHFKFPDLTPFIAK