New model in OGS2.0 | DPOGS202909  |
---|---|
Genomic Position | scaffold273:+ 48692-52795 |
See gene structure | |
CDS Length | 1584 |
Paired RNAseq reads   | 443 |
Single RNAseq reads   | 1097 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004182 (1e-82) |
Best Drosophila hit   | CG3589 (3e-16) |
Best Human hit | peroxisomal leader peptide-processing protease isoform a (6e-25) |
Best NR hit (blastp)   | heat shock protein 70 HSP70 interacting protein, putative [Pediculus humanus corporis] (2e-53) |
Best NR hit (blastx)   | hypothetical protein BRAFLDRAFT_85874 [Branchiostoma floridae] (3e-28) |
GeneOntology terms    | GO:0008233 peptidase activity GO:0006508 proteolysis GO:0005777 peroxisome GO:0004252 serine-type endopeptidase activity |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR001254 Peptidase S1/S6, chymotrypsin/Hap |
Orthology group | MCL15592 |
Nucleotide sequence:
ATGACGGTCGAGGGTGTCATGGTGTCCTACAACTATTCGGATGACGCGGAACACGCGAAC
ATACTGACAGTGTCGGCATCAGGAATAAAGTTCTCCAAGCGATGGGTTCTCACGCACGGT
TCAATATTGTCGCCACTAAAACAGGCCAACGTTATCAAGAACGCTCGAGGCAAACCCATT
TTAAACGACGAGTTCTATGACAACCTTCCGGAAATATACGTCACCTGTGAGAAGGTTAAA
TCCAAGACACCAAACATGTACGAGAACCTCGAGATCCTGAGCAGAGAGAGATCGCTTAAC
AACGATGCTGACTTGGAACATAGCAGCTATCAAGTTAGAGTGCTCACTGGAAGGATATGC
CACGTGTGGCAATGTCCTGTGCTGGATCGCTGCGTGGACAACATCCTGTACAGTTGGACC
ATCGGTCACCGGGACGGAGACGTGGAGAAACAGACGCAGCTGGGGAAGGCGCTGCTTTCC
GTGTTCGTGCTGGTAGACCTGGAGACGGATAATAGAGATTTTAAGATCCTGAGGCCGCTG
TCTGAGCTACTGGACATGTGTCAGCCGCCGCCCGACCGCGGCGCCACCGTCGACATACAT
TCCACGCCCTTTGGATGCGAGGTGTTCCTGAACGCGGTGACTCGTGGCTCCGTGTGTGGT
GTCGTGGGCAAGCGACCGTCCCTCTTACTGACAGACGCTGCCACCGCCTTGGGCTCGGAG
GGAGGGCCAGTCTTCACCGCGGGACCCGACAATCATCTGGTGGGCGCGGTGGTGTGTTCC
GTGTCGTGGTGGCGCGGGGAGTGGGTGGGTCTCACCCTGGCCGCCCCTCTCAAGTCGGTG
CTCGCGGCTAAGCTGAGAGTCCAACAGCCGCTCCCCGCTAGGACACCGCCGTCGCCGCTG
TACGTTAGGATACTAGAGCTGGTGGACCGCAGTACGGTGCTGGTGAGGTGCGGGGCGGCC
TGGGGCGCCGGACTCTATCTGGGGGGAGGACACGTGCTCACGTGCGCGCACGTCGTCAAA
CATCACGCGTCTCACAAAGTGTCGGTGTACTGTGACAACGTGAAGGAGACGGCCGCGGTC
CGCTACAAGACCAAAGACGACCTAGCTTACGATCTCGCCTTGCTATACGTGTCACCGGCT
CATTGGAGACACCTCCTGCCGGCGGTCTTCGCTGAGGAGTCAGCACAGAAAGGCGAGTCT
GTGCTGGCGGCGGGGTTCCCGTACTTCAACGAGACCAACCTGGAGGAGCTGAAGCCGACC
GTCACCAGCGGCCACGTCAACAACGTCTCCCCGTCACTCATACAGACCACCTGCTGCGTG
CAATCAGGGTTCAGCGGAGGTCCGATATTCCGTATAACAAAGGAGCTCAAGGTGGAGGTG
CTGGGTACGATCGTGTCCAACGCTAAGACGGAGACGGGCGCTAGCTACCCCTACATCAAC
ATGGCCGTCCCCACCAAGGCGTTCATACGCCTCGTGCAACACTTCATACTGGAGAGGGAT
GAGAATGTCCTCTCCCAGATTGAAAACAAAAAAGATATGATCCAATCACAGTGGAGGTTG
CTGCCTTATAGATCTAAGATATGA
Protein sequence:
MTVEGVMVSYNYSDDAEHANILTVSASGIKFSKRWVLTHGSILSPLKQANVIKNARGKPI
LNDEFYDNLPEIYVTCEKVKSKTPNMYENLEILSRERSLNNDADLEHSSYQVRVLTGRIC
HVWQCPVLDRCVDNILYSWTIGHRDGDVEKQTQLGKALLSVFVLVDLETDNRDFKILRPL
SELLDMCQPPPDRGATVDIHSTPFGCEVFLNAVTRGSVCGVVGKRPSLLLTDAATALGSE
GGPVFTAGPDNHLVGAVVCSVSWWRGEWVGLTLAAPLKSVLAAKLRVQQPLPARTPPSPL
YVRILELVDRSTVLVRCGAAWGAGLYLGGGHVLTCAHVVKHHASHKVSVYCDNVKETAAV
RYKTKDDLAYDLALLYVSPAHWRHLLPAVFAEESAQKGESVLAAGFPYFNETNLEELKPT
VTSGHVNNVSPSLIQTTCCVQSGFSGGPIFRITKELKVEVLGTIVSNAKTETGASYPYIN
MAVPTKAFIRLVQHFILERDENVLSQIENKKDMIQSQWRLLPYRSKI