DPGLEAN11910 in OGS1.0

New model in OGS2.0DPOGS210084 
Genomic Positionscaffold500:+ 65143-70939
See gene structure
CDS Length1875
Paired RNAseq reads  110
Single RNAseq reads  247
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011886 (6e-133)
Best Drosophila hit  peroxidase, isoform B (3e-49)
Best Human hitperoxidasin homolog precursor (4e-33)
Best NR hit (blastp)  oxidase/peroxidase [Aedes aegypti] (3e-70)
Best NR hit (blastx)  PREDICTED: similar to AGAP010734-PA, partial [Acyrthosiphon pisum] (2e-68)
GeneOntology terms







  
GO:0004601 peroxidase activity
GO:0005506 iron ion binding
GO:0005576 extracellular region
GO:0020037 heme binding
GO:0006979 response to oxidative stress
GO:0055114 oxidation reduction
GO:0007306 eggshell chorion assembly
GO:0042600 chorion
GO:0006911 phagocytosis, engulfment
InterPro families

  
IPR019791 Haem peroxidase, animal, subgroup
IPR002007 Haem peroxidase, animal
IPR010255 Haem peroxidase
Orthology groupMCL40686

Nucleotide sequence:

ATGAATCAAGAGGTTGAGGTGGTAGTGCCTACTTGTGGTCAAGTTACTACGGCTCGAACA
TGGGCCGGCTCCACTTGGGAAACAGACATCCTCACTCTAATGATTAATTATTATTTTAAA
AATAGATGGTGTGTGAGTGAAGTGCTACCATGTAACCCACAAGAGACAAGACGACTGGAT
GGAACCTGTAACAACCTCAAGCACCCCAACAGAGGAGCCTCGCACACCCCGACATATAGA
CTGCTACCAGCCCATTATGATAAGGATTTTGAGCCAAGAAAATCCAAGAGCGGTAAACCT
CTGTCGCTATGTAGGAAAATCCGTACGTCATTGTTAGCCGAGGGGAGAGTCCCTGACACC
GAATTGACACAGCTATTGTTGCATTTCTGGGTATTTGTTTCGTCTGACGTGTTATCATTG
CATGATACAGTAAACTACATTTTATGGAAACCATATTGCTGTCAAGAAAGAGGGAAGACA
GACAAGGGTTGTATTCCGAACATAATCCCTGAAGATGATCCTGTTCATCGCTTCTCTTCT
ATCCGCTGCATGAACTTAACCAGACCTTGGAGTTACCAATCTACAGGGTGTTATAGAAAT
GACACTACCCCAGAAAGAATAATAACAGCTAGTCCCGCATACGATTTATCTCACGTGTAT
GGTCTCTCTTTAAAATTAATAAATGAAAAGCACAGAAGTTTTAAAAATGGTATGCTCAAG
TTTGAAGTTGAAAATAATATGATATGGCCCCCAAGCACAAAGACTCCGGTCAACCTATGT
CTTCTTAATCAAAAACCGAAGGAAACTCGTTGTCACGATACTCCTGAATCGGGCTCCAAT
AGTGTACTGGGTCTTAATCTTTTTGTCATCTGGACTTGGCGTTTTCACAATTTCGTGGCA
TCAGAACTATCAAGAATCAATCCTTGCTGGTCTGATGACAGACTCTTCTTCACAGCCAGA
GATATCGTCATTGCTTACTATATGCAGATGTTCTATTATGAATTGTCACCGACACTGTTG
GGTTACGAAAATCTTCTTCGAGACGGAGTTCTTTCACCTTTCAAAGACTTTAGAGATCTT
TACAAGGAGGATCTCTTGCCACAAATATCTATAGAATATCCTGTGGTTCTTAGATGGGCT
CACACTATAACTGAGGGAGTACTAAAAATGTACGACGCAAAAGGAAATTATTTGAACGAG
ACAAAGATCGTCGATTTAACATTAAGAACGGGATATTTAGTTGAAAATTTAGAATTCATC
ACACATGGTGCATACAGACAGCCGGCTGCTAAGAATGATGGAGTTGTCGATCCAGATATT
TCAGAAAAAGGTCTCGGGCCTCATCAAAGAGCATCCGATTTACCAACAAGTGATATGTGC
AAGAATCGTTACTTTGGGTTGGCACCGTATATTAAGTATAGAAAACTGTGCTCGGGAGTA
GATTACCGGAGTTTTGATGATTTAATAGAAGTCATGGATCCAGAAAGGATAGAGATTCTA
AAGGAATTGTATGAACACGTTGAAGACATAGATTTAATGGCTGGAATATATTCAGAGAGG
TATGTTCAAGGAGGTCATGTTCCCCTCACCCTGTACTGTGTCGTCGTAGAACAGATGATG
AGGACGATGATGTCTGACAGACATTGGTACGAGAGACCGAATAGACCGAACGCGTTTACC
AGGAATCAGCTGTTACAGATTAGAAAGGCATCTGTAGCTCAGATGCTGTGTTTGGTTGGA
GATGGAGTGACACATATACAGCCTCATGCTTTCTCTATGCCAGGGCCCGGGAATGAGATG
TGTAGCTGTAAAATGATCGAGAAAATCAATTTTTGGGCTTGGAAAGATACAAGTTGTGGA
TTAAGCAACGCATAA

Protein sequence:

MNQEVEVVVPTCGQVTTARTWAGSTWETDILTLMINYYFKNRWCVSEVLPCNPQETRRLD
GTCNNLKHPNRGASHTPTYRLLPAHYDKDFEPRKSKSGKPLSLCRKIRTSLLAEGRVPDT
ELTQLLLHFWVFVSSDVLSLHDTVNYILWKPYCCQERGKTDKGCIPNIIPEDDPVHRFSS
IRCMNLTRPWSYQSTGCYRNDTTPERIITASPAYDLSHVYGLSLKLINEKHRSFKNGMLK
FEVENNMIWPPSTKTPVNLCLLNQKPKETRCHDTPESGSNSVLGLNLFVIWTWRFHNFVA
SELSRINPCWSDDRLFFTARDIVIAYYMQMFYYELSPTLLGYENLLRDGVLSPFKDFRDL
YKEDLLPQISIEYPVVLRWAHTITEGVLKMYDAKGNYLNETKIVDLTLRTGYLVENLEFI
THGAYRQPAAKNDGVVDPDISEKGLGPHQRASDLPTSDMCKNRYFGLAPYIKYRKLCSGV
DYRSFDDLIEVMDPERIEILKELYEHVEDIDLMAGIYSERYVQGGHVPLTLYCVVVEQMM
RTMMSDRHWYERPNRPNAFTRNQLLQIRKASVAQMLCLVGDGVTHIQPHAFSMPGPGNEM
CSCKMIEKINFWAWKDTSCGLSNA