New model in OGS2.0 | DPOGS206025  |
---|---|
Genomic Position | scaffold35:- 61246-69600 |
See gene structure | |
CDS Length | 3753 |
Paired RNAseq reads   | 1243 |
Single RNAseq reads   | 3194 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000553 (0.0) |
Best Drosophila hit   | peroxidasin, isoform E (0.0) |
Best Human hit | peroxidasin homolog precursor (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to peroxidasin [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to peroxidasin [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005578 proteinaceous extracellular matrix GO:0004601 peroxidase activity GO:0005515 protein binding GO:0006979 response to oxidative stress GO:0055114 oxidation reduction GO:0020037 heme binding |
InterPro families    | IPR019791 Haem peroxidase, animal, subgroup IPR013783 Immunoglobulin-like fold IPR002007 Haem peroxidase, animal IPR010255 Haem peroxidase IPR007110 Immunoglobulin-like IPR001611 Leucine-rich repeat IPR003591 Leucine-rich repeat, typical subtype IPR000483 Cysteine-rich flanking region, C-terminal domain IPR003599 Immunoglobulin subtype IPR003598 Immunoglobulin subtype 2 IPR013098 Immunoglobulin I-set |
Orthology group | MCL11446 |
Nucleotide sequence:
ATGTCTCATTTGGAACAGCTGTACTTGCACGTAAACGAGATTCATCAGATCGAACCAGAA
ACCTTTTCCAACCTTCCCCGATTAGGTCGACTGTACCTTCATAACAATAACTTGAAAACG
ATACCCCCTGGTTCATTCCGGGGTATGCCGAAACTGAGCAAACTCCGATTAGACAGTAAT
GCGTTGGTTTGTGATTGTAATATGTTATGGTTTGCTCGAATGCTCGCTGAACATCGTAAC
ATTACTATTGCCGCAACTTGCTATGAACCTGCGAAAGCAACTGGAACATCTTTAGCAGCA
ATGCAGGAAAAAGATTTCCACTGTCGTCAACCGGAGATTATGTCTGATCCTGAGGATGTG
GTTGTTAATTTTGGAGATGAAGCTATTTTTACTTGTATGGCTAGTGGCGAACCAGCACCT
GAAATAGTGTGGTTCCGCGACTCAGCCGCCTTACCTGACGATACAAGCAGATACGAAATT
ATGGATAATGGAACTCTTATGGTTCATCATGCAGATGAAAATGATATTGGTGTTTTCGAA
TGTTCGGCCAAAAATCCTGCTGGTGAAGCGCGATCCAAGCCAGCCAGAATGATGCTGCAA
ACTAAACCAGATAATAATGGTGCGATTACCTTTCCTGTTTTTACCATCTTGCCCCGAAAA
AGTGTGGTTAATATTAATCAACCCTACGCACGTTTCGATTGTGTGGCAAAAGGCAATCCA
AAACCTCATATTTCTTGGTATTTCAATGGAGAGCGTATACTGTTAACTGATCGAATAACT
ATGCATCACAATGGATCTATAGTTATTGAAAATATAAAATACGAAGATACAGGATCTTAC
ACATGTCAAGCTGAAAATGTCAACGGGAAGATAACGGCATCTGTTACTTTAGAAGTTATG
GTGGCGCCTGCATTTATCATAGTTCCAAAAGACCAAACTGTAACAATTGGTGATTCAGCA
CATTTTCGATGCACAGCTAGAGGAACTCCGACACCTATTATAAAATGGTACAGAAACACT
ATGTCTTTGCCACCAAGTGAAAATATCGTTTTTAGCGATAACGATCAAAATTTGACAATC
GTAGAAACTTCCGAAGATGATGCAGGATTATATCATTGCAGAGCAGAAAATTCCGAAGGT
CTCACTGAAATATCTGCTGTTTTGAAAATAGAAAGTTTTGAAATAATTCCACCAAAAATT
ACCTTGAAACCAGAAGATACAGATGCATTTAAGGAAACAACAGTTCAGTTGCCTTGTGAA
TATGAGAGTGATCCACCAGCACTTGTGGAATGGAGAAAAGATGGAAGCCGTATTATAACT
AATGACAGAATAAGTATATCTTTAATTGGGAGTTTGATTATTAACAACGTTTCCATAACC
GATACTGGAAGTTACGAGTGTTCTGTTCACAACGAACATGGACGTGATACGGCTTCATCA
TTTTTGACGGTAAAAGATCACATTTTACCTGGCGATGAATATGTAAATATAGCTATAACT
GAAGCTATAAGAGATGTTGATCAGGCAATAGCGAAAAGTATAGACAATTTGTTTAACAAC
AAAAGTTCCAACATTAGTTTTCAAGATCTGTATAGAATTACTAGATTCCCAAATGCCCCA
GCTAGAGAAGTTGCTCGGGCGGCTGAAATATATGAGAGAACTTTAGATAAAGTAAAAGGA
TTTATACAATCTGGATTGAAAATAACATCGGCACAACCATTTAATTATGAAAATATATTA
TCAGCACAACATTTAGAAATCATAGCCAGACTCTCTGGTTGCGTAGCACACCGTGAAAGC
AAAGACTGTTCTGACATGTGTTTCCATAAAAAATACCGAAGTATTGATGGCAGTTGCAAT
AACTTTGATCAACCAACATGGGGTACATCGCTCACTGCATTTCGACGCATTCTCTTTCCT
ATTTACGAAAATGGCTTTAGTGAACCAACAGGTTGGAACAAGAAAGTTAAATATAATGGT
TATTCTTTACCGAGTGCTCGGTTAGTTTCTACAACAATTATTAGTACCACTGAAATTTCT
GAGGATGTTCGGATTACTCATATGACAATGCAATGGGGTCAGTGGTTAGATCATGACTTA
GATCACGCTTTGCCATCTGCAAGTTCTCAAACGTGGGATGGTGTTGACTGTAAAAAAACA
TGTGACTACGCGGCTCCTTGTTTTCCGATAGATGTCCCTAAAAATGATCCTCGAATAACC
AACCGCCGATGCATTGATTTTATTCGAACTAGCGCTGTATGTGGATCGGGTATGACCTCG
GTTTTATTTGGCAGACTGCAGCCAAGAGAGCAAATAAATCAACTCACGTCTTACATTGAT
GCCTCTCAAGTATATGGTTTTGAGAAATCTGTAGCTGAGGATCTCCGTGATTTGACGAAC
ACTAACGGTACTCTCCGAGTAGGAGCTAAGTTCCCGGGTAAGAAACCATTACTACCAACA
ACAGGTTTAAATGGTATGGACTGCAGACGTAATCTTGCAGAAAGCAATCGTAATTGCTTT
GTTGCGGGTGATATAAGGGCAAATGAACAGATTGGTTTAGCTGCTATGCACACTATCTGG
ATGAGAGAGCATAACCGTATCGCAACAGAACTAAAAGCCATAAATCCCTTCTGGGACGGA
GAAAAATTATACCAAGAAGCGAGAAAAATTGTCGGAGCGCAAATGCAAGTCATAACTTAC
GAACAATGGCTGCCTCTCATTCTTGGTCCAGAGGGATACGAACAGCTGGGAAAATACAAG
GAATATGACCCTAATCTAAACCCTTCAGTCTCAAACGTTTTCGCCACTGCTGCTCTTCGA
TTTGGACACTCTATCATTAATCCACTTTTACATCGTTATGACGAGAACTTTGAGCCGATC
CCTCAAGGTCATTTACTGTTGCGTCATGCATTTTTCTCCCCATGGAGACTAGTCGATGAG
GGTGGAGTTGATCCGCTATTTAGAGGAATGTTCACGACGCCTGCTAAATTGAAGACACCA
ACACAGAATTTAAACTCTGAACTTACGGAAAAACTATTCCATACTGCACATGCAGTCGCT
CTTGACTTAGCTGCAATAAATATTCAACGAGGACGTGATCATGCTATTCCACCGTACAAT
AAATGGCGGCAATTTTGCAATATGACCGAGGCTAACGATTTCGATGACTTGGCCAATGAG
ATCACTGACAAAACCGTACGAGACAAGCTAAGAGAATTGTATGGCTCTGTGCACAATATT
GATGTTTGGGTTGGTGGCATTTTAGAGGATCAAGTTGAGGGAGGTAAAATAGGACCTCTT
TTCCGATGCTTACTTATTGAACAGTTTCAACGATTACGTCATGGCGATCGTTTGTGGTAT
GAAAATCCGTCGACATTCTCAAGAGACCAATTGCGACAAATCAAAAACGCAAACTTTGCA
AGGGTTTTATGTGATAATGGTGACAATATTGATACAATAAGTGAGAATGTATTCTTGTTA
CCTGAATTACAGGACGGTCTTGTATCTTGCGAGGATGTCCCTAAGATCGATCTACGTTTT
TGGGCCGACTGTGAATCATGCGGCGATGATGATTACGAAACTGAATCAAATCGAGTGCGC
AGAGATGTAATGTCAAGTGCCGATCTTTACACTGAACTGACAGAAAATGATCACCGTCTA
AATACCCTAGAAGATTCTCACGAGGAATTGGTGAAAGCAATTAATAAGCTTAAAAAGAGG
GTCAAAGAGTTAGAGAAAGCATGCAATAAGTAA
Protein sequence:
MSHLEQLYLHVNEIHQIEPETFSNLPRLGRLYLHNNNLKTIPPGSFRGMPKLSKLRLDSN
ALVCDCNMLWFARMLAEHRNITIAATCYEPAKATGTSLAAMQEKDFHCRQPEIMSDPEDV
VVNFGDEAIFTCMASGEPAPEIVWFRDSAALPDDTSRYEIMDNGTLMVHHADENDIGVFE
CSAKNPAGEARSKPARMMLQTKPDNNGAITFPVFTILPRKSVVNINQPYARFDCVAKGNP
KPHISWYFNGERILLTDRITMHHNGSIVIENIKYEDTGSYTCQAENVNGKITASVTLEVM
VAPAFIIVPKDQTVTIGDSAHFRCTARGTPTPIIKWYRNTMSLPPSENIVFSDNDQNLTI
VETSEDDAGLYHCRAENSEGLTEISAVLKIESFEIIPPKITLKPEDTDAFKETTVQLPCE
YESDPPALVEWRKDGSRIITNDRISISLIGSLIINNVSITDTGSYECSVHNEHGRDTASS
FLTVKDHILPGDEYVNIAITEAIRDVDQAIAKSIDNLFNNKSSNISFQDLYRITRFPNAP
AREVARAAEIYERTLDKVKGFIQSGLKITSAQPFNYENILSAQHLEIIARLSGCVAHRES
KDCSDMCFHKKYRSIDGSCNNFDQPTWGTSLTAFRRILFPIYENGFSEPTGWNKKVKYNG
YSLPSARLVSTTIISTTEISEDVRITHMTMQWGQWLDHDLDHALPSASSQTWDGVDCKKT
CDYAAPCFPIDVPKNDPRITNRRCIDFIRTSAVCGSGMTSVLFGRLQPREQINQLTSYID
ASQVYGFEKSVAEDLRDLTNTNGTLRVGAKFPGKKPLLPTTGLNGMDCRRNLAESNRNCF
VAGDIRANEQIGLAAMHTIWMREHNRIATELKAINPFWDGEKLYQEARKIVGAQMQVITY
EQWLPLILGPEGYEQLGKYKEYDPNLNPSVSNVFATAALRFGHSIINPLLHRYDENFEPI
PQGHLLLRHAFFSPWRLVDEGGVDPLFRGMFTTPAKLKTPTQNLNSELTEKLFHTAHAVA
LDLAAINIQRGRDHAIPPYNKWRQFCNMTEANDFDDLANEITDKTVRDKLRELYGSVHNI
DVWVGGILEDQVEGGKIGPLFRCLLIEQFQRLRHGDRLWYENPSTFSRDQLRQIKNANFA
RVLCDNGDNIDTISENVFLLPELQDGLVSCEDVPKIDLRFWADCESCGDDDYETESNRVR
RDVMSSADLYTELTENDHRLNTLEDSHEELVKAINKLKKRVKELEKACNK