New model in OGS2.0 | DPOGS209262  |
---|---|
Genomic Position | scaffold431:+ 119835-131423 |
See gene structure | |
CDS Length | 4014 |
Paired RNAseq reads   | 2957 |
Single RNAseq reads   | 7484 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007042 (0.0) |
Best Drosophila hit   | CG10211 (0.0) |
Best Human hit | peroxidasin homolog precursor (3e-89) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC005493 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC005493 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0004601 peroxidase activity GO:0006979 response to oxidative stress GO:0055114 oxidation reduction GO:0020037 heme binding |
InterPro families    | IPR002007 Haem peroxidase, animal IPR010255 Haem peroxidase IPR019791 Haem peroxidase, animal, subgroup |
Orthology group | MCL15210 |
Nucleotide sequence:
GTTATCTTTATTTGTTTCAGGTTAAAACGAAGTATATTGCTATACGCATTATTTTTAACT
CAAATATTAGCACAGAAAACAAAAACAACAGATGACAATCTACAGACAGTTCCAAGTGAC
CTCAGAAGAGCTGTTAGCGAGGCACTAGCTTTAGAGAGGAGGTTTTTATTAGGAAGTAAC
GACGTGAGGAACTGTTCATACGACGAGATCAGCAATACGCCATGTCCCCCCAGCAAGTAT
CGAAGCGCGAGTGGAGAGTGCAACAACGTTCGACATAGACCATGGGGAAGGCGAGGCGAC
GTGTTCCTTAGATTATTACAACCACACTACGCTGATGGAATATCACAACCTTTCGCAAGT
CCGAAGCTGCCGGAGCCAAGGCTGGCCGTCCAGGCAGTATCGCAGTTGGCTGAATCAGTC
GGTCACGATTACGTAACCAGCTTACTCGCTGCCTGGGGACAGTTTCTTATGGATGATCTC
ATTGCCACCTCTAATCAGAACCAAAAGTGTGAATCTGGTTGCGAATATGTTCGCTCAGCG
CCTACAAGAAATTACGATTCCTGTGGATTTGAATACCGTGATCAGATGAATTTAGCGACA
TCGGTACTAGATGGCTCTGCTCTGTATGGAAATTCTGAAAAAGAGCTGTTGTCTCTCAGA
CTATACGATGCTGGCAAAATGGATATATCCTCCTGCCGAAAATGCAACGAAAATTCACCT
ACCGCACCACTTTATAAGGCTCTCTTGACTGAACACAACCGTATTGCTGGCGAATTATTT
TCGATGAATCCCTTCTGGGAAGATAATGCCCTGTTCTTAGAAGCAAGACGAACAATAGCA
GCAGTTATTCAACATATTACATACAATGAATTTTTGCCTGTACTTTTGGGCGAAGTTGGA
ATGGCAAAAGCAGATTTAAAACTAACCACCCACGGTTTCTGGCGTGGATATTCAAGTGCA
AATCGTGTCGGAGCTTATGCGGAGCTTGTTGCGGTTGCACCAATTTTTAACGCCATGATG
AATGAGAAACTTATAAACACAACAATTCTTCTAAAAGACTTGGTCAAGACCAGCGCTCAT
CAAATCTCAAGATTCGCCTTATCAGCTCAATGGGATCTTAACCGAGCTCGTGACCATGGT
GTACCTTCGTATCTAAAAGTCTTGCAGCTGTGTGATCCCATGGCAAATGTTAAGTCATAC
GCTGAATTTGAAAAATTAGGTTTTGACAGAAGACATCAAGAAATATTCGCTGATATGTAT
AGAAACGCTGAAGATATTGAACTGATGGTAGCTGGAGCCATGGAGAAACCAGCTACTGGC
GCAGTTATTGGGTCTACCTTAGCATGTGTATTAGCACTCCAATTTGGAAACCTAAAGAAA
AGTGATAGGTTTTGGTATGAAAATGACATTCCTCCATCATCATTTTCAATAGAGCAGTTA
GCAGCAATCAGAAAAGTATCCATTGCTGGACTTTTATGTTCTGCTGATGAAGGACTTGAT
AATGTGCAACCTAGAGCTTTCGTAAAGGAAGATACATTCTTAAATGCAGCCCAACAATGC
TCTCAACACCCTCGGCTGGAGCTTTCTTCGTGGCGCGATGAAAGTGGTGCCCGGGCTGCA
GAGCGCCTCTCACAAGACATGCTGGCAGCTGCACTTCAGAAGGCAAAGCAGGAGATGGCT
GACCGCAAGAAACTGGAGTACATGTTATGGGAAGCACATGGAGGAGCCGACCCAAAATCT
CCTGTCGGCACAGCTGCATCTTTCTCAAAAGCAAATAAATATGCCCTTAAGTTGGCAAAC
ACCTCCTTATTCTTCGAATTCGCTACTAACGAACTCGTTAACTCTCTTAATGGACGACGA
CGCAAACGTCAGATCTTCGATGACTCCCTCGGCTTTGGATCAACCGATTTTGTAGAGTCT
CTTCAATCAGTAGACGTCAGTGGTTTTCTAGGGCAGGACCAACTTGGGCCGGTTATTGAA
CCGCAGTGCGATGATAATGGAAACTGCGATCCTGATAGTCCTTTCCGAAGCTACACTGGC
CACTGTAATAATTTGAGAAATCCTAATTTGGGAAAAAGTTTAACAACCTTCGCGAGACTG
TTACCTCCTGTTTATGAAGATGGTGTAAGCCGGCCCCGCATCAACTCAGTAACAGGCACC
CCGCTACCTTCCCCCCGTATAGTTTCTACGGTCATACATCCCGATATATCAAATCTTCAT
ACGCGCTACACTTTGATGGTGATGCAGTTCGCCCAATTCCTGGACCATGAACTGACAATG
ACTCCCATTCACAAAGGCTTCCACGAATCTATTCCGGACTGCAGATCTTGCGACTCTCCC
CGTACAGTGCATCCAGAATGCAACCCTTTCCCAGTACCTCGCGGTGACCATTATTATCCA
GAAGTTAACATAACTTCCGGAGAACGATTATGTTTTCCATTCATGAGAAGTTTACCAGGA
CAGCAGCAATTAGGACCGCGTGAACAAGTCAATCAAAATACAGCTTTCATTGATGGATCG
GCGATTTATGGAGAAAACCCTTGCATTGTTCGTAAACTGCGAGGTTTCAATGGCAGGCTC
AACGCTACTGCAAACCCGATTAATGGAAGAGATTTGTTGCCTAGAACAGATAACCACCCT
GAATGCAAAGCAGCCAGTGGTTTTTGTTTTATTGCTGGTGACGGTCGAGCTTCAGAACAG
CCTGGACTCACGGCTTTACACACAATCTTCATGCGCGAACACAACCGCATAGTTGAAGGA
CTTCGTGGTGTCAACCCTCATTGGGATGCGGAATTATTATTCGAACACACTAGGCGTATA
GTTGCTGCCTCATTCACACATATCATTTATAATGAATTTTTGCCAAGACTTTTGTCTTGG
AACGCTGTTAACTTGTATGGACTCAAATTATTACCTTCAGGTTACTATAAGGAATACTCT
CCAACCTGCAACCCGTCTATTGTAACGGAATTTGCAGCAGCCGCCTTCAGATTTGGTCAC
TCGTTGTTGAGACCACACTTACCGAGACTCTCACCTTCCTATCAACCAGTTGATCCACCA
ATATTGTTGAGAGATGGATTTTTCAGGCCTGACATGTTCATGAATCATCCACCAATGGTT
GACGAACTTATTCGTGGTTTATCTTCCACGCCCATGGAGACCCTTGACCAATTCATAACA
GGAGAAGTTACCAACCATCTATTCGAAGACCGGAGAATTCCGTTTTCGGGTATAGACTTA
GTAGCTCTTAATATCCAAAGAGGTAGAGATCACGGTATACCGAGTTATAACAACTATAGA
GCTTTGTGTAACTTAAAGAGAGCAGCAACTTTCGAAGATTTGGCGAGGGAAATTCCCGAT
GAAGTAATTGCGAGATTTAAGCGAATATACGCTACAGTAGACGACATTGATCTATTCCCT
GGTGGTATGAGCGAACGACCACTACAGGGCGGTCTAGTTGGACCCACCTTCGCCTGCATC
ATCGCTATACAGTTCAGGCAGTTAAGGAAATGCGATCGATTCTGGTATGAGAACGACAAC
AGAGCAGCTCGCTTCACCGAACAACAATTGTCGGAAATTCGCAAAGTAACATTGTCCAAG
GTTCTATGTGACAATTTCGATTTGCCAAGCGACATTCAACGCGCCTCTTTCGATCTACCT
AGCAACTTTTTGAATCCTCGCGTGCCATGCGCGTCTCTTCCAAAACTGGACCTTTCCGCA
TGGCGTGAGAGTTCAGCCCAGGGCTGTCTCATCGCGGGTCGCTCGGTACGACTTGGTGAC
TCCGCCTTCCCTTCGCCCTGTACATCGTGTATATGCACCGTTGACGGGGCGCAGTGCGCA
TCCCTACGCATCACAGACTGTGCACAGCTATGGCGTGAATGGCCACGAGAAGCTGTGCTA
AGAGATGATGTATGCACAGCACAGTGCGGCGCCGCCCCCGCAGGTCAGAGAGCGCCGCGG
AGACCACACGCTCACTTCAAATTCCCCGATCTTACACCATTCATCGCTAAATAG
Protein sequence:
VIFICFRLKRSILLYALFLTQILAQKTKTTDDNLQTVPSDLRRAVSEALALERRFLLGSN
DVRNCSYDEISNTPCPPSKYRSASGECNNVRHRPWGRRGDVFLRLLQPHYADGISQPFAS
PKLPEPRLAVQAVSQLAESVGHDYVTSLLAAWGQFLMDDLIATSNQNQKCESGCEYVRSA
PTRNYDSCGFEYRDQMNLATSVLDGSALYGNSEKELLSLRLYDAGKMDISSCRKCNENSP
TAPLYKALLTEHNRIAGELFSMNPFWEDNALFLEARRTIAAVIQHITYNEFLPVLLGEVG
MAKADLKLTTHGFWRGYSSANRVGAYAELVAVAPIFNAMMNEKLINTTILLKDLVKTSAH
QISRFALSAQWDLNRARDHGVPSYLKVLQLCDPMANVKSYAEFEKLGFDRRHQEIFADMY
RNAEDIELMVAGAMEKPATGAVIGSTLACVLALQFGNLKKSDRFWYENDIPPSSFSIEQL
AAIRKVSIAGLLCSADEGLDNVQPRAFVKEDTFLNAAQQCSQHPRLELSSWRDESGARAA
ERLSQDMLAAALQKAKQEMADRKKLEYMLWEAHGGADPKSPVGTAASFSKANKYALKLAN
TSLFFEFATNELVNSLNGRRRKRQIFDDSLGFGSTDFVESLQSVDVSGFLGQDQLGPVIE
PQCDDNGNCDPDSPFRSYTGHCNNLRNPNLGKSLTTFARLLPPVYEDGVSRPRINSVTGT
PLPSPRIVSTVIHPDISNLHTRYTLMVMQFAQFLDHELTMTPIHKGFHESIPDCRSCDSP
RTVHPECNPFPVPRGDHYYPEVNITSGERLCFPFMRSLPGQQQLGPREQVNQNTAFIDGS
AIYGENPCIVRKLRGFNGRLNATANPINGRDLLPRTDNHPECKAASGFCFIAGDGRASEQ
PGLTALHTIFMREHNRIVEGLRGVNPHWDAELLFEHTRRIVAASFTHIIYNEFLPRLLSW
NAVNLYGLKLLPSGYYKEYSPTCNPSIVTEFAAAAFRFGHSLLRPHLPRLSPSYQPVDPP
ILLRDGFFRPDMFMNHPPMVDELIRGLSSTPMETLDQFITGEVTNHLFEDRRIPFSGIDL
VALNIQRGRDHGIPSYNNYRALCNLKRAATFEDLAREIPDEVIARFKRIYATVDDIDLFP
GGMSERPLQGGLVGPTFACIIAIQFRQLRKCDRFWYENDNRAARFTEQQLSEIRKVTLSK
VLCDNFDLPSDIQRASFDLPSNFLNPRVPCASLPKLDLSAWRESSAQGCLIAGRSVRLGD
SAFPSPCTSCICTVDGAQCASLRITDCAQLWREWPREAVLRDDVCTAQCGAAPAGQRAPR
RPHAHFKFPDLTPFIAK