New model in OGS2.0 | DPOGS202185  |
---|---|
Genomic Position | scaffold4468:- 6370-15017 |
See gene structure | |
CDS Length | 2439 |
Paired RNAseq reads   | 756 |
Single RNAseq reads   | 1868 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013482 (0.0) |
Best Drosophila hit   | CG6969, isoform A (8e-159) |
Best Human hit | peroxidasin homolog precursor (7e-96) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC004579 [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to CG6969-PA isoform 1 [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0004601 peroxidase activity GO:0006979 response to oxidative stress GO:0020037 heme binding GO:0055114 oxidation reduction |
InterPro families    | IPR002007 Haem peroxidase, animal IPR019791 Haem peroxidase, animal, subgroup IPR010255 Haem peroxidase |
Orthology group | MCL15031 |
Nucleotide sequence:
ATGCAGAGACCTCAGGGCAGCAGTGAGCGGACTCCGTTGGTGCCGCCCACTTACATGTTC
GAGTCCAGCATCTCCAGGAGCTACCAGAAACGTCTCAGAAACTTCCAGTGCGCCGTCTGT
GTCGTATTGATCCTACTACTGTCGGTAACATTGTTGGTGACCATATCCTACAACCTCACC
CTCGGCGGCCCTGAGTTTCCGGAGGTGGTGTCTCCGACGACTCCGTCCTCTCCTGAGGGC
AACCTCACCCGCCGTCTCATGCTCAGCCCTGACCTGATGCCGATCATGAACAGAACCTGG
CCTCTCAATGGTCGCCCTATCCCAAAATGGAAAGCCGAAACAGTAAGTCCGGAAGCCATA
GACGCCGCTGTACAGAAAGGCAAAGCTATGCTAGTGAAGCGTAGAATAATAGAGCGAAGC
CTTACTCCTCTCGACTCAGAGTCGCCGGCCTTCAGAGGCCAAAGGGCGGCAGCTACGTCG
GCCCTGGTTAAACCGATCGCCGAGACAGCGTACGCCGTAGAAGAAGCAACCAGAGAATTA
CTGAACAGCACTGAGATACCCGATGCGGTTGGCGCAGTCGGTGTGGGTCCAGCAACCAAC
GGGTCGTTCCCGGAGCCAGCGTACTGCCGCCCGCCCACCGCGCCCTGCGTCATCTCTAAG
TACAGGACGCAGGATGGCTCTTGTAACAACCTGGACCATCCTCTACTCTGGGGCGTCTCC
AATACACCGTTCAGACGAGTCCTCCCACCAGACTACGGTGACGGTGTAAGCTCCCCCCGC
ACTGGATGGAACGGCGCTCCTCTGCCCAGCGCTCGAGATGTCAGCGTAACGGTGCACAGA
CCCAGCTACGCTCACGACACACAGTTCACCGTGATGCTCGCCGTGTGGGGACAGTTCATA
GACCACGACATCACAGCCACCGCTCTCAACAAGGGAGCCAACAGCACTCCCATCTCTTGC
TGCACCGACATGACAATACACCCGGAGTGCTTCCCGGTGAAGCTGGACCCGGAGGACCCC
TTCTACCAGGACTACAACCTCACATGCATGGAGTTTGTGAGGTCAGCGCCTGCGCCTACC
TGCCATTTCGGTCACCGCGAGCAGCTGAACCAGGCGACAGCGTTCCTGGACGCGTCGACG
GTCTACAGCTTCATGGAGAACAAGACCAACCAGCTCCGTGCGGGAGCCAACGGTCAGCTG
CGGATGTTGAAGCTCGGCCCCTGGGAGCTGCTGCCGCCCTCCACCGACCCCAACGACGGA
TGTAACACGGTCGAGATGAACGCCAAAGGACGCTACTGCTTCGAATCGGGCGACGACCGC
GCTAACGAGAACCTCCATCTGACGACGATGCACCTGCTATGGGCCCGACAACACAACCGC
GTGGCAGCGCGCCTCCAGCAGCTCAACCCCGCCTGGGACGACCAGCAGCTGTTCCAGGAG
ACGCGCAGGATAGTCGGAGCCCAGATGCAGCATATCACATACGCAGAATTCTTACCATCT
ATACTAGGGGAGGACGTGATGTGGTCGTTGAACCTCACGCTGCAGGAGTCAGGGTACGCG
ACCGTGTACGACTCCGCAGTGGACCCTTCCATCGCGAACCACTTCTCCGCCGCAGCCTTC
AGATTCGCTCACACGCTGCTGCCGGGCCTGATCCATAACGTGGACCTGAGCACGGGCACG
GTGAGCTACACGCACCTCCACGAGATGTTGTTCAACCCGTACGCGCTGTACAACGAGCAG
GGGTCCAAGAGGTCCGTGAGGTCCGCCATCTACACGCCCGTGCACGCCGTGGATCCCCAC
ATCACCAGCGAGCTGAGCAATCATCTCTTCGAGCGCAGCGTCGCCAACAGCAGCAGCAGT
GTGAAGGGTGCCAATCCCCTGCCGTGCGGACTGGACCTGGTGTCGCTGAACATCCAGCGA
GGCCGCGACCACGGCTTGCCCGCCTACCCTGCCTGGAGGGAGCACTGCGGCCTCTCCCGC
CCGCACACCTTCGAGGACCTGGAACCGATCTTTGACGAACTGTCCTTGAGCAGGATTTGC
AAAATATACAAGAGCGTCGATGACATAGACCTGTACACGGGCGCCCTGGCTGAGGACCCC
AAAGGCCGTCTCCTGGGCCCCACGCTCACATGTCTCGTAGCGGATCAGTTTCTGCGCATC
AAGGTCGGCGACCGCTACTGGTACGAGACCTCGGATCCAGATATTAAATTTACTCCAGAA
CAACTGTACGAAATCCGTAAGACGACCCTGGCGGGAGTGATCTGCGCTAACGAGGGTCTG
CTGGATCAGGCGCAGCCGCGCGTCATGGAGGCTCTGAGCGCCACCAACCCGCTGGTCGAC
TGCAAGGAACTCCCGCAACCTGACTTCAAACCTTGGAAGGATCCCGACCCGAACCAGCCG
ACCAAGAAACCATCGAGCAAAAACAACAACAAAGGATAA
Protein sequence:
MQRPQGSSERTPLVPPTYMFESSISRSYQKRLRNFQCAVCVVLILLLSVTLLVTISYNLT
LGGPEFPEVVSPTTPSSPEGNLTRRLMLSPDLMPIMNRTWPLNGRPIPKWKAETVSPEAI
DAAVQKGKAMLVKRRIIERSLTPLDSESPAFRGQRAAATSALVKPIAETAYAVEEATREL
LNSTEIPDAVGAVGVGPATNGSFPEPAYCRPPTAPCVISKYRTQDGSCNNLDHPLLWGVS
NTPFRRVLPPDYGDGVSSPRTGWNGAPLPSARDVSVTVHRPSYAHDTQFTVMLAVWGQFI
DHDITATALNKGANSTPISCCTDMTIHPECFPVKLDPEDPFYQDYNLTCMEFVRSAPAPT
CHFGHREQLNQATAFLDASTVYSFMENKTNQLRAGANGQLRMLKLGPWELLPPSTDPNDG
CNTVEMNAKGRYCFESGDDRANENLHLTTMHLLWARQHNRVAARLQQLNPAWDDQQLFQE
TRRIVGAQMQHITYAEFLPSILGEDVMWSLNLTLQESGYATVYDSAVDPSIANHFSAAAF
RFAHTLLPGLIHNVDLSTGTVSYTHLHEMLFNPYALYNEQGSKRSVRSAIYTPVHAVDPH
ITSELSNHLFERSVANSSSSVKGANPLPCGLDLVSLNIQRGRDHGLPAYPAWREHCGLSR
PHTFEDLEPIFDELSLSRICKIYKSVDDIDLYTGALAEDPKGRLLGPTLTCLVADQFLRI
KVGDRYWYETSDPDIKFTPEQLYEIRKTTLAGVICANEGLLDQAQPRVMEALSATNPLVD
CKELPQPDFKPWKDPDPNQPTKKPSSKNNNKG