DPGLEAN21738 in OGS1.0

New model in OGS2.0DPOGS206025 
Genomic Positionscaffold35:- 61246-69600
See gene structure
CDS Length3753
Paired RNAseq reads  1243
Single RNAseq reads  3194
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000553 (0.0)
Best Drosophila hit  peroxidasin, isoform E (0.0)
Best Human hitperoxidasin homolog precursor (0.0)
Best NR hit (blastp)  PREDICTED: similar to peroxidasin [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to peroxidasin [Tribolium castaneum] (0.0)
GeneOntology terms




  
GO:0005578 proteinaceous extracellular matrix
GO:0004601 peroxidase activity
GO:0005515 protein binding
GO:0006979 response to oxidative stress
GO:0055114 oxidation reduction
GO:0020037 heme binding
InterPro families









  
IPR019791 Haem peroxidase, animal, subgroup
IPR013783 Immunoglobulin-like fold
IPR002007 Haem peroxidase, animal
IPR010255 Haem peroxidase
IPR007110 Immunoglobulin-like
IPR001611 Leucine-rich repeat
IPR003591 Leucine-rich repeat, typical subtype
IPR000483 Cysteine-rich flanking region, C-terminal domain
IPR003599 Immunoglobulin subtype
IPR003598 Immunoglobulin subtype 2
IPR013098 Immunoglobulin I-set
Orthology groupMCL11446

Nucleotide sequence:

ATGTCTCATTTGGAACAGCTGTACTTGCACGTAAACGAGATTCATCAGATCGAACCAGAA
ACCTTTTCCAACCTTCCCCGATTAGGTCGACTGTACCTTCATAACAATAACTTGAAAACG
ATACCCCCTGGTTCATTCCGGGGTATGCCGAAACTGAGCAAACTCCGATTAGACAGTAAT
GCGTTGGTTTGTGATTGTAATATGTTATGGTTTGCTCGAATGCTCGCTGAACATCGTAAC
ATTACTATTGCCGCAACTTGCTATGAACCTGCGAAAGCAACTGGAACATCTTTAGCAGCA
ATGCAGGAAAAAGATTTCCACTGTCGTCAACCGGAGATTATGTCTGATCCTGAGGATGTG
GTTGTTAATTTTGGAGATGAAGCTATTTTTACTTGTATGGCTAGTGGCGAACCAGCACCT
GAAATAGTGTGGTTCCGCGACTCAGCCGCCTTACCTGACGATACAAGCAGATACGAAATT
ATGGATAATGGAACTCTTATGGTTCATCATGCAGATGAAAATGATATTGGTGTTTTCGAA
TGTTCGGCCAAAAATCCTGCTGGTGAAGCGCGATCCAAGCCAGCCAGAATGATGCTGCAA
ACTAAACCAGATAATAATGGTGCGATTACCTTTCCTGTTTTTACCATCTTGCCCCGAAAA
AGTGTGGTTAATATTAATCAACCCTACGCACGTTTCGATTGTGTGGCAAAAGGCAATCCA
AAACCTCATATTTCTTGGTATTTCAATGGAGAGCGTATACTGTTAACTGATCGAATAACT
ATGCATCACAATGGATCTATAGTTATTGAAAATATAAAATACGAAGATACAGGATCTTAC
ACATGTCAAGCTGAAAATGTCAACGGGAAGATAACGGCATCTGTTACTTTAGAAGTTATG
GTGGCGCCTGCATTTATCATAGTTCCAAAAGACCAAACTGTAACAATTGGTGATTCAGCA
CATTTTCGATGCACAGCTAGAGGAACTCCGACACCTATTATAAAATGGTACAGAAACACT
ATGTCTTTGCCACCAAGTGAAAATATCGTTTTTAGCGATAACGATCAAAATTTGACAATC
GTAGAAACTTCCGAAGATGATGCAGGATTATATCATTGCAGAGCAGAAAATTCCGAAGGT
CTCACTGAAATATCTGCTGTTTTGAAAATAGAAAGTTTTGAAATAATTCCACCAAAAATT
ACCTTGAAACCAGAAGATACAGATGCATTTAAGGAAACAACAGTTCAGTTGCCTTGTGAA
TATGAGAGTGATCCACCAGCACTTGTGGAATGGAGAAAAGATGGAAGCCGTATTATAACT
AATGACAGAATAAGTATATCTTTAATTGGGAGTTTGATTATTAACAACGTTTCCATAACC
GATACTGGAAGTTACGAGTGTTCTGTTCACAACGAACATGGACGTGATACGGCTTCATCA
TTTTTGACGGTAAAAGATCACATTTTACCTGGCGATGAATATGTAAATATAGCTATAACT
GAAGCTATAAGAGATGTTGATCAGGCAATAGCGAAAAGTATAGACAATTTGTTTAACAAC
AAAAGTTCCAACATTAGTTTTCAAGATCTGTATAGAATTACTAGATTCCCAAATGCCCCA
GCTAGAGAAGTTGCTCGGGCGGCTGAAATATATGAGAGAACTTTAGATAAAGTAAAAGGA
TTTATACAATCTGGATTGAAAATAACATCGGCACAACCATTTAATTATGAAAATATATTA
TCAGCACAACATTTAGAAATCATAGCCAGACTCTCTGGTTGCGTAGCACACCGTGAAAGC
AAAGACTGTTCTGACATGTGTTTCCATAAAAAATACCGAAGTATTGATGGCAGTTGCAAT
AACTTTGATCAACCAACATGGGGTACATCGCTCACTGCATTTCGACGCATTCTCTTTCCT
ATTTACGAAAATGGCTTTAGTGAACCAACAGGTTGGAACAAGAAAGTTAAATATAATGGT
TATTCTTTACCGAGTGCTCGGTTAGTTTCTACAACAATTATTAGTACCACTGAAATTTCT
GAGGATGTTCGGATTACTCATATGACAATGCAATGGGGTCAGTGGTTAGATCATGACTTA
GATCACGCTTTGCCATCTGCAAGTTCTCAAACGTGGGATGGTGTTGACTGTAAAAAAACA
TGTGACTACGCGGCTCCTTGTTTTCCGATAGATGTCCCTAAAAATGATCCTCGAATAACC
AACCGCCGATGCATTGATTTTATTCGAACTAGCGCTGTATGTGGATCGGGTATGACCTCG
GTTTTATTTGGCAGACTGCAGCCAAGAGAGCAAATAAATCAACTCACGTCTTACATTGAT
GCCTCTCAAGTATATGGTTTTGAGAAATCTGTAGCTGAGGATCTCCGTGATTTGACGAAC
ACTAACGGTACTCTCCGAGTAGGAGCTAAGTTCCCGGGTAAGAAACCATTACTACCAACA
ACAGGTTTAAATGGTATGGACTGCAGACGTAATCTTGCAGAAAGCAATCGTAATTGCTTT
GTTGCGGGTGATATAAGGGCAAATGAACAGATTGGTTTAGCTGCTATGCACACTATCTGG
ATGAGAGAGCATAACCGTATCGCAACAGAACTAAAAGCCATAAATCCCTTCTGGGACGGA
GAAAAATTATACCAAGAAGCGAGAAAAATTGTCGGAGCGCAAATGCAAGTCATAACTTAC
GAACAATGGCTGCCTCTCATTCTTGGTCCAGAGGGATACGAACAGCTGGGAAAATACAAG
GAATATGACCCTAATCTAAACCCTTCAGTCTCAAACGTTTTCGCCACTGCTGCTCTTCGA
TTTGGACACTCTATCATTAATCCACTTTTACATCGTTATGACGAGAACTTTGAGCCGATC
CCTCAAGGTCATTTACTGTTGCGTCATGCATTTTTCTCCCCATGGAGACTAGTCGATGAG
GGTGGAGTTGATCCGCTATTTAGAGGAATGTTCACGACGCCTGCTAAATTGAAGACACCA
ACACAGAATTTAAACTCTGAACTTACGGAAAAACTATTCCATACTGCACATGCAGTCGCT
CTTGACTTAGCTGCAATAAATATTCAACGAGGACGTGATCATGCTATTCCACCGTACAAT
AAATGGCGGCAATTTTGCAATATGACCGAGGCTAACGATTTCGATGACTTGGCCAATGAG
ATCACTGACAAAACCGTACGAGACAAGCTAAGAGAATTGTATGGCTCTGTGCACAATATT
GATGTTTGGGTTGGTGGCATTTTAGAGGATCAAGTTGAGGGAGGTAAAATAGGACCTCTT
TTCCGATGCTTACTTATTGAACAGTTTCAACGATTACGTCATGGCGATCGTTTGTGGTAT
GAAAATCCGTCGACATTCTCAAGAGACCAATTGCGACAAATCAAAAACGCAAACTTTGCA
AGGGTTTTATGTGATAATGGTGACAATATTGATACAATAAGTGAGAATGTATTCTTGTTA
CCTGAATTACAGGACGGTCTTGTATCTTGCGAGGATGTCCCTAAGATCGATCTACGTTTT
TGGGCCGACTGTGAATCATGCGGCGATGATGATTACGAAACTGAATCAAATCGAGTGCGC
AGAGATGTAATGTCAAGTGCCGATCTTTACACTGAACTGACAGAAAATGATCACCGTCTA
AATACCCTAGAAGATTCTCACGAGGAATTGGTGAAAGCAATTAATAAGCTTAAAAAGAGG
GTCAAAGAGTTAGAGAAAGCATGCAATAAGTAA

Protein sequence:

MSHLEQLYLHVNEIHQIEPETFSNLPRLGRLYLHNNNLKTIPPGSFRGMPKLSKLRLDSN
ALVCDCNMLWFARMLAEHRNITIAATCYEPAKATGTSLAAMQEKDFHCRQPEIMSDPEDV
VVNFGDEAIFTCMASGEPAPEIVWFRDSAALPDDTSRYEIMDNGTLMVHHADENDIGVFE
CSAKNPAGEARSKPARMMLQTKPDNNGAITFPVFTILPRKSVVNINQPYARFDCVAKGNP
KPHISWYFNGERILLTDRITMHHNGSIVIENIKYEDTGSYTCQAENVNGKITASVTLEVM
VAPAFIIVPKDQTVTIGDSAHFRCTARGTPTPIIKWYRNTMSLPPSENIVFSDNDQNLTI
VETSEDDAGLYHCRAENSEGLTEISAVLKIESFEIIPPKITLKPEDTDAFKETTVQLPCE
YESDPPALVEWRKDGSRIITNDRISISLIGSLIINNVSITDTGSYECSVHNEHGRDTASS
FLTVKDHILPGDEYVNIAITEAIRDVDQAIAKSIDNLFNNKSSNISFQDLYRITRFPNAP
AREVARAAEIYERTLDKVKGFIQSGLKITSAQPFNYENILSAQHLEIIARLSGCVAHRES
KDCSDMCFHKKYRSIDGSCNNFDQPTWGTSLTAFRRILFPIYENGFSEPTGWNKKVKYNG
YSLPSARLVSTTIISTTEISEDVRITHMTMQWGQWLDHDLDHALPSASSQTWDGVDCKKT
CDYAAPCFPIDVPKNDPRITNRRCIDFIRTSAVCGSGMTSVLFGRLQPREQINQLTSYID
ASQVYGFEKSVAEDLRDLTNTNGTLRVGAKFPGKKPLLPTTGLNGMDCRRNLAESNRNCF
VAGDIRANEQIGLAAMHTIWMREHNRIATELKAINPFWDGEKLYQEARKIVGAQMQVITY
EQWLPLILGPEGYEQLGKYKEYDPNLNPSVSNVFATAALRFGHSIINPLLHRYDENFEPI
PQGHLLLRHAFFSPWRLVDEGGVDPLFRGMFTTPAKLKTPTQNLNSELTEKLFHTAHAVA
LDLAAINIQRGRDHAIPPYNKWRQFCNMTEANDFDDLANEITDKTVRDKLRELYGSVHNI
DVWVGGILEDQVEGGKIGPLFRCLLIEQFQRLRHGDRLWYENPSTFSRDQLRQIKNANFA
RVLCDNGDNIDTISENVFLLPELQDGLVSCEDVPKIDLRFWADCESCGDDDYETESNRVR
RDVMSSADLYTELTENDHRLNTLEDSHEELVKAINKLKKRVKELEKACNK