DPGLEAN17144 in OGS1.0

New model in OGS2.0DPOGS202185 
Genomic Positionscaffold4468:- 6370-15017
See gene structure
CDS Length2439
Paired RNAseq reads  756
Single RNAseq reads  1868
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013482 (0.0)
Best Drosophila hit  CG6969, isoform A (8e-159)
Best Human hitperoxidasin homolog precursor (7e-96)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC004579 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to CG6969-PA isoform 1 [Apis mellifera] (0.0)
GeneOntology terms


  
GO:0004601 peroxidase activity
GO:0006979 response to oxidative stress
GO:0020037 heme binding
GO:0055114 oxidation reduction
InterPro families

  
IPR002007 Haem peroxidase, animal
IPR019791 Haem peroxidase, animal, subgroup
IPR010255 Haem peroxidase
Orthology groupMCL15031

Nucleotide sequence:

ATGCAGAGACCTCAGGGCAGCAGTGAGCGGACTCCGTTGGTGCCGCCCACTTACATGTTC
GAGTCCAGCATCTCCAGGAGCTACCAGAAACGTCTCAGAAACTTCCAGTGCGCCGTCTGT
GTCGTATTGATCCTACTACTGTCGGTAACATTGTTGGTGACCATATCCTACAACCTCACC
CTCGGCGGCCCTGAGTTTCCGGAGGTGGTGTCTCCGACGACTCCGTCCTCTCCTGAGGGC
AACCTCACCCGCCGTCTCATGCTCAGCCCTGACCTGATGCCGATCATGAACAGAACCTGG
CCTCTCAATGGTCGCCCTATCCCAAAATGGAAAGCCGAAACAGTAAGTCCGGAAGCCATA
GACGCCGCTGTACAGAAAGGCAAAGCTATGCTAGTGAAGCGTAGAATAATAGAGCGAAGC
CTTACTCCTCTCGACTCAGAGTCGCCGGCCTTCAGAGGCCAAAGGGCGGCAGCTACGTCG
GCCCTGGTTAAACCGATCGCCGAGACAGCGTACGCCGTAGAAGAAGCAACCAGAGAATTA
CTGAACAGCACTGAGATACCCGATGCGGTTGGCGCAGTCGGTGTGGGTCCAGCAACCAAC
GGGTCGTTCCCGGAGCCAGCGTACTGCCGCCCGCCCACCGCGCCCTGCGTCATCTCTAAG
TACAGGACGCAGGATGGCTCTTGTAACAACCTGGACCATCCTCTACTCTGGGGCGTCTCC
AATACACCGTTCAGACGAGTCCTCCCACCAGACTACGGTGACGGTGTAAGCTCCCCCCGC
ACTGGATGGAACGGCGCTCCTCTGCCCAGCGCTCGAGATGTCAGCGTAACGGTGCACAGA
CCCAGCTACGCTCACGACACACAGTTCACCGTGATGCTCGCCGTGTGGGGACAGTTCATA
GACCACGACATCACAGCCACCGCTCTCAACAAGGGAGCCAACAGCACTCCCATCTCTTGC
TGCACCGACATGACAATACACCCGGAGTGCTTCCCGGTGAAGCTGGACCCGGAGGACCCC
TTCTACCAGGACTACAACCTCACATGCATGGAGTTTGTGAGGTCAGCGCCTGCGCCTACC
TGCCATTTCGGTCACCGCGAGCAGCTGAACCAGGCGACAGCGTTCCTGGACGCGTCGACG
GTCTACAGCTTCATGGAGAACAAGACCAACCAGCTCCGTGCGGGAGCCAACGGTCAGCTG
CGGATGTTGAAGCTCGGCCCCTGGGAGCTGCTGCCGCCCTCCACCGACCCCAACGACGGA
TGTAACACGGTCGAGATGAACGCCAAAGGACGCTACTGCTTCGAATCGGGCGACGACCGC
GCTAACGAGAACCTCCATCTGACGACGATGCACCTGCTATGGGCCCGACAACACAACCGC
GTGGCAGCGCGCCTCCAGCAGCTCAACCCCGCCTGGGACGACCAGCAGCTGTTCCAGGAG
ACGCGCAGGATAGTCGGAGCCCAGATGCAGCATATCACATACGCAGAATTCTTACCATCT
ATACTAGGGGAGGACGTGATGTGGTCGTTGAACCTCACGCTGCAGGAGTCAGGGTACGCG
ACCGTGTACGACTCCGCAGTGGACCCTTCCATCGCGAACCACTTCTCCGCCGCAGCCTTC
AGATTCGCTCACACGCTGCTGCCGGGCCTGATCCATAACGTGGACCTGAGCACGGGCACG
GTGAGCTACACGCACCTCCACGAGATGTTGTTCAACCCGTACGCGCTGTACAACGAGCAG
GGGTCCAAGAGGTCCGTGAGGTCCGCCATCTACACGCCCGTGCACGCCGTGGATCCCCAC
ATCACCAGCGAGCTGAGCAATCATCTCTTCGAGCGCAGCGTCGCCAACAGCAGCAGCAGT
GTGAAGGGTGCCAATCCCCTGCCGTGCGGACTGGACCTGGTGTCGCTGAACATCCAGCGA
GGCCGCGACCACGGCTTGCCCGCCTACCCTGCCTGGAGGGAGCACTGCGGCCTCTCCCGC
CCGCACACCTTCGAGGACCTGGAACCGATCTTTGACGAACTGTCCTTGAGCAGGATTTGC
AAAATATACAAGAGCGTCGATGACATAGACCTGTACACGGGCGCCCTGGCTGAGGACCCC
AAAGGCCGTCTCCTGGGCCCCACGCTCACATGTCTCGTAGCGGATCAGTTTCTGCGCATC
AAGGTCGGCGACCGCTACTGGTACGAGACCTCGGATCCAGATATTAAATTTACTCCAGAA
CAACTGTACGAAATCCGTAAGACGACCCTGGCGGGAGTGATCTGCGCTAACGAGGGTCTG
CTGGATCAGGCGCAGCCGCGCGTCATGGAGGCTCTGAGCGCCACCAACCCGCTGGTCGAC
TGCAAGGAACTCCCGCAACCTGACTTCAAACCTTGGAAGGATCCCGACCCGAACCAGCCG
ACCAAGAAACCATCGAGCAAAAACAACAACAAAGGATAA

Protein sequence:

MQRPQGSSERTPLVPPTYMFESSISRSYQKRLRNFQCAVCVVLILLLSVTLLVTISYNLT
LGGPEFPEVVSPTTPSSPEGNLTRRLMLSPDLMPIMNRTWPLNGRPIPKWKAETVSPEAI
DAAVQKGKAMLVKRRIIERSLTPLDSESPAFRGQRAAATSALVKPIAETAYAVEEATREL
LNSTEIPDAVGAVGVGPATNGSFPEPAYCRPPTAPCVISKYRTQDGSCNNLDHPLLWGVS
NTPFRRVLPPDYGDGVSSPRTGWNGAPLPSARDVSVTVHRPSYAHDTQFTVMLAVWGQFI
DHDITATALNKGANSTPISCCTDMTIHPECFPVKLDPEDPFYQDYNLTCMEFVRSAPAPT
CHFGHREQLNQATAFLDASTVYSFMENKTNQLRAGANGQLRMLKLGPWELLPPSTDPNDG
CNTVEMNAKGRYCFESGDDRANENLHLTTMHLLWARQHNRVAARLQQLNPAWDDQQLFQE
TRRIVGAQMQHITYAEFLPSILGEDVMWSLNLTLQESGYATVYDSAVDPSIANHFSAAAF
RFAHTLLPGLIHNVDLSTGTVSYTHLHEMLFNPYALYNEQGSKRSVRSAIYTPVHAVDPH
ITSELSNHLFERSVANSSSSVKGANPLPCGLDLVSLNIQRGRDHGLPAYPAWREHCGLSR
PHTFEDLEPIFDELSLSRICKIYKSVDDIDLYTGALAEDPKGRLLGPTLTCLVADQFLRI
KVGDRYWYETSDPDIKFTPEQLYEIRKTTLAGVICANEGLLDQAQPRVMEALSATNPLVD
CKELPQPDFKPWKDPDPNQPTKKPSSKNNNKG