New model in OGS2.0 | DPOGS215092  |
---|---|
Genomic Position | scaffold1700:+ 48761-57866 |
See gene structure | |
CDS Length | 2793 |
Paired RNAseq reads   | 2392 |
Single RNAseq reads   | 5883 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007194 (0.0) |
Best Drosophila hit   | CG8798, isoform C (0.0) |
Best Human hit | lon protease homolog, mitochondrial precursor (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to AGAP010451-PA [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to AGAP010451-PA [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0005759 mitochondrial matrix GO:0006508 proteolysis GO:0004176 ATP-dependent peptidase activity GO:0005524 ATP binding |
InterPro families    | IPR004815 Peptidase S16, ATP-dependent protease La IPR020568 Ribosomal protein S5 domain 2-type fold IPR015947 Pseudouridine synthase/archaeosine transglycosylase-like IPR001984 Peptidase S16, Lon protease, C-terminal IPR008268 Peptidase S16, active site IPR008269 Peptidase S16, Lon C-terminal IPR003111 Peptidase S16, lon N-terminal IPR003959 ATPase, AAA-type, core IPR003593 ATPase, AAA+ type, core |
Orthology group | MCL11521 |
Nucleotide sequence:
ATGCACATAGCGAGTGTATTAGTGCGTAATACCGCACTTCTTAATCCCTCGATTAGGCCT
TCATCGCAAACTGTTCGTAATGTAACCAAAATTGCATCATATTGCAAGCCGGTAGGAAAT
CGTTTTTTTAATGGACACAATTTGTACGGAACTCGCAACGCTCGGATATGCTCGTATAAC
CAAGAATATGCAGCCGTGAAAAAGGTACAGAACATTAGACATTATTCGAAGAAGCTTAAT
CCGGAAGAAGAGGAATCAGCTGATATTAAAGAGGACCCGCCATTGTTCTCAAGCCAGCTA
CCAGCAACTGTGGCTGTGCCTGAAGTGTGGCCGCAAGTGCCCGTTATTGCCATTAATAGG
AACCCCGTTTTTCCAAGATTTATTAAATTAATTGAGATATCAAACCCAGCTTTAATAGAT
CTAATAAGGCGTAAAGTGAAACTGAATCAGCCGTATGTTGGTATATTTTTGCGTAAGAAA
GAAGACGAGAAATCAGATGTTGTGTCGAGTTTGGACGATCTTCATGATGTGGGGGTGTTC
GCTCAGATCCACGAGATGCAGGATATGGATTACAAGCTACGTCTAGTCGTTATGGCACAC
AGAAGAATAAAAATCACCGGCCAGTTTATAGAAGACGAGATCGAAACTGGCCCAGCCGAA
ATGAAGCTAAAGTTTCCCGTATTTAACGTGGAATTTAACGTTACCCGCGAAGAATCAGAC
GCTGAGCGACGTAGGAGGAAATATCGTAACACGAGACGGCAACGTAACGACTCGGACGCG
GAACACGAGAAGGAGGTGCAGGAACCAAAGGAAGCTAAGAAACCTCCGCCGGACCAGCTT
ATGATGGTCAAAGTGGAGAATATGATGCATGACAAGTTCCAGCAGAACGAGGAGGTGAAA
GCGTTGACGCAGGAGATCATCAAGACTATCAGGGATATCATCAATATGAACCCCCTGTAT
AGAGAATCTCTGCATCACATGCTAGCTCAAGGTCAGCGTGTTGTGGACGATCCCGTGTAC
CTCGCGGATTTAGGCGCCGCCTTAACCGCAGCTGAGCCCAAGGACCTACAGCCGGTTCTT
GAGGAGATGGATATTCCGAAACGACTGTTACTATCATTATCACTGCTGAAGAAGGAATAT
GAACTGTCCAAATTGCAGCAGAAAATCGGTAAGGAAGTTGAAGAAAAGGTGAAACAGCAG
CACAGGAAATACATTCTGCATGAACAACTCAAGGTTATAAAAAAAGAATTAGGTCTTGAG
AAGGATGACAAAGACGCCATTGGTGAGAAATTCCGCGAGAGACTGGCTGATAAAGTGGTA
CCACCCTCTGTTCAGACGGTCATTGACGAGGAGCTCAACAAACTGAACTTCCTAGAGAGT
CATAGCTCAGAGTTCAAGTTAGTATGGTCGATAACGTTCAATAAAAGTATCTGCTTTCAG
TTTGTTGAGCTCCTCGTCAATGACCGTCTGAACAGAGGGTGGTACCACTTTATCAGCCAG
TCTCTCGCGGAATTTCTCACCAATGGCGTGGGCAAAACTAGTATAGCCCGTTCCATAGCC
AGAGCGTTGAACCGTAAGTATTTTAGGTTCTCAGTGGGCGGTATGACGGATGTGGCGGAG
ATAAAGGGACACAGACGTACATACGTGGGCGCTATGCCCGGGAAGCTGGTGCAGTGCTTG
AAGAAGACGAACACAGAGAACCCATTGGTCCTTATAGATGAAGTGGATAAGATCGGGAAA
GGTGTCCACGGTGATCCGTCATCAGCTCTTCTGGAACTGCTGGATCCAGAACAGAACGCG
AATTTCCTGGACCACTACTTGGATGTTCCGGTGGACCTGTCTCGAGTGCTCTTCATCTGC
ACAGCGAACGTACTCGACCTTATACCGGAACCTCTGAGGGACAGGATGGAACTTATAGAA
ATGTCAGGATATGTGGCAGAAGAGAAGCTAGCCATAGCCCAGCAGTACTTGATACCGACA
GCCCTCAAGAACTGTGGTCTCACAGACGAAAAAATCAATATAACACCGGAGGCATTACAC
ACACTCATAAGGTCATACTGCAGGGAGAGCGGAGTCAGGAATCTACAGAAACATATTGAG
AAGATTGCACGTAAGGTAGCCTACAAGCTTGTAAAGAAAGAGACGTCTTCCTTATCTGTG
ACGGACGCTAATTTATCGGAACTGGTTGGGAAGCCGACCTTCAAACACGACCGCATGTAT
GACGTCACACCACCCGGAGTGGTGATGGGCCTAGCGTGGACCGCCATGGGTGGTAGTACG
TTATACATAGAAACAGCTGTACGGAACACTATGAAGGGTGAGAAGCAATCCGGCTCGCTG
GAGCTGACCGGGCACCTGGGTGACGTCATGAAGGAGTCGGCCCGGATCGCGCTCACCGTG
GCCCGCAACTACCTCAAGGAGTCCCAGCCGGACAACGACTTCCTTAACACCAGTCACCTC
CACCTCCACGTGCCCGAGGGCGCGACTCCCAAGGACGGTCCATCAGCGGGCGTGACCATC
GCCACCGCTCTCCTGAGCCTAGCGCTCCAACGACCAGCCAACACCCTCGCTATGACCGGG
GAGCTCACCCTCACTGGACGAGTGCTGCCCGTTGGAGGGATCAAGGAGAAGATTATAGCG
GCTAAGCGTGTCGGAGTGACTTGCGTGATTCTCCCCGAGGACAACAGGCGCGACTTCGAC
GACCTGCCCTCCTTCATCAGGGACGGTATCGACGTGCACTTCGTCAATGTGTATGATGAC
GTGTTCAAGATAGTCTTCGACGGAAAGGTTTAA
Protein sequence:
MHIASVLVRNTALLNPSIRPSSQTVRNVTKIASYCKPVGNRFFNGHNLYGTRNARICSYN
QEYAAVKKVQNIRHYSKKLNPEEEESADIKEDPPLFSSQLPATVAVPEVWPQVPVIAINR
NPVFPRFIKLIEISNPALIDLIRRKVKLNQPYVGIFLRKKEDEKSDVVSSLDDLHDVGVF
AQIHEMQDMDYKLRLVVMAHRRIKITGQFIEDEIETGPAEMKLKFPVFNVEFNVTREESD
AERRRRKYRNTRRQRNDSDAEHEKEVQEPKEAKKPPPDQLMMVKVENMMHDKFQQNEEVK
ALTQEIIKTIRDIINMNPLYRESLHHMLAQGQRVVDDPVYLADLGAALTAAEPKDLQPVL
EEMDIPKRLLLSLSLLKKEYELSKLQQKIGKEVEEKVKQQHRKYILHEQLKVIKKELGLE
KDDKDAIGEKFRERLADKVVPPSVQTVIDEELNKLNFLESHSSEFKLVWSITFNKSICFQ
FVELLVNDRLNRGWYHFISQSLAEFLTNGVGKTSIARSIARALNRKYFRFSVGGMTDVAE
IKGHRRTYVGAMPGKLVQCLKKTNTENPLVLIDEVDKIGKGVHGDPSSALLELLDPEQNA
NFLDHYLDVPVDLSRVLFICTANVLDLIPEPLRDRMELIEMSGYVAEEKLAIAQQYLIPT
ALKNCGLTDEKINITPEALHTLIRSYCRESGVRNLQKHIEKIARKVAYKLVKKETSSLSV
TDANLSELVGKPTFKHDRMYDVTPPGVVMGLAWTAMGGSTLYIETAVRNTMKGEKQSGSL
ELTGHLGDVMKESARIALTVARNYLKESQPDNDFLNTSHLHLHVPEGATPKDGPSAGVTI
ATALLSLALQRPANTLAMTGELTLTGRVLPVGGIKEKIIAAKRVGVTCVILPEDNRRDFD
DLPSFIRDGIDVHFVNVYDDVFKIVFDGKV