New model in OGS2.0 | DPOGS204778  |
---|---|
Genomic Position | scaffold1158:+ 19980-46021 |
See gene structure | |
CDS Length | 1224 |
Paired RNAseq reads   | 720 |
Single RNAseq reads   | 2209 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013714 (5e-93) |
Best Drosophila hit   | neprilysin 3 (2e-17) |
Best Human hit | endothelin-converting enzyme 1 isoform 2 (1e-18) |
Best NR hit (blastp)   | PREDICTED: similar to Endothelin-converting enzyme 1 [Tribolium castaneum] (4e-73) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC010845 [Tribolium castaneum] (6e-63) |
GeneOntology terms    | GO:0006508 proteolysis GO:0008237 metallopeptidase activity GO:0004222 metalloendopeptidase activity |
InterPro families    | IPR000718 Peptidase M13, neprilysin IPR008753 Peptidase M13 |
Orthology group | ND |
Nucleotide sequence:
ATGTCGAGATTATGTAGATACTATGAGGCGAAACTTCTCAGCATTCAATGGATTCGCCGT
GCGGGTGAATCGCTTTGCGGGGAGAGGAGCATTAACAGGGACTTGCGACCCAGAGGAAGT
CAGCTAGCTCGCTGCACGGGTCGTTGCACTGGTATCGGGTATGGCGGGGAAGCAAAACTA
CCGCGGCCACTGAAACACAGCCATTATAGCGTGGCCGCAGACCACGCATGCGCGGCAACG
GTCGTACGGCGCGAAAAAGATTATAGAGACGTTAAGGACATAGACAATGAGAGTCAACAC
TCCGCCAAGATACCGTACACAGACAGCGCACTGGAAGACAAGCTAGCTCGCACTGTGCGC
TGTTCGTGGGTGGTGATCGTATCCCTGGGCTTGTCCCTGGCGGTGTTGGCGGTGTACACC
ACGGTCGTGGTGGTGGTAGACCTTAAAGCACCCAAACCCTGCCTCACTGAAGTGTGTGTC
AATACAGCGTCGAGAGTACTAGCGGCGTTAAACAAGAGCGTGGATCCCTGCGACGACTTC
TACGAGTTCGCTTGTGGCGGGTGGATCGAGAAGAACCCTGTCCCGGAGTGGGCGACCTCC
TGGGATCAGCTCGCCATCCTGCGAGAGAAACTGGTCACTGACCTGAGGGAACTGCTGGAA
GACAAAAACGACCACGGCCTGCCTAAGAGCGTGCTTAAAGCTAAAGCCCTCTACCGCACT
TGTATGGATGTTGACAAGCTAGAGGTGTACGGAACCGCGCCCATCACGGATCTGTTGCTA
CAACTAGGCCTTCCTCCAACGCCCCCTTCCGTGTCCAGTGATAACTTCTCGTGGGAGCAG
GTGTCTGGGCGCGCCCGCAGGACTCTCGGTCTCAGTGTTCTGCTGAGCGTTCAGGTCGCT
GAGGATGTGAGGAACACTAGCAGGAACAGGGTCGTGTTGGAGCAGGTATCTCCAGGGTTC
AGCGATCGTTACCTGCGCCAGGCGGACAAGTTCTCGTTCGAGTTGGAGCAGTACCGGATC
TACATCACGTCAATGATCAAAGCCTTCCATCCCGACACGGACGCGGAACGCTTCGCCGAC
GACATTATAGAATTCAGCAAGACTCTGGCTGGCATCATGACGCCGGTGGAGGTTCGTCGC
AGCGGCACTCACCTGTTCCACGAGCTGAGTGTGACTCAACTGCTGGGAGGGAACGGAGCT
CCTCCTGAATGGCACCAGGTATGA
Protein sequence:
MSRLCRYYEAKLLSIQWIRRAGESLCGERSINRDLRPRGSQLARCTGRCTGIGYGGEAKL
PRPLKHSHYSVAADHACAATVVRREKDYRDVKDIDNESQHSAKIPYTDSALEDKLARTVR
CSWVVIVSLGLSLAVLAVYTTVVVVVDLKAPKPCLTEVCVNTASRVLAALNKSVDPCDDF
YEFACGGWIEKNPVPEWATSWDQLAILREKLVTDLRELLEDKNDHGLPKSVLKAKALYRT
CMDVDKLEVYGTAPITDLLLQLGLPPTPPSVSSDNFSWEQVSGRARRTLGLSVLLSVQVA
EDVRNTSRNRVVLEQVSPGFSDRYLRQADKFSFELEQYRIYITSMIKAFHPDTDAERFAD
DIIEFSKTLAGIMTPVEVRRSGTHLFHELSVTQLLGGNGAPPEWHQV