New model in OGS2.0 | DPOGS206930  |
---|---|
Genomic Position | scaffold1:- 698603-701624 |
See gene structure | |
CDS Length | 1392 |
Paired RNAseq reads   | 301 |
Single RNAseq reads   | 686 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012906 (4e-23) |
Best Drosophila hit   | CG6696 (5e-18) |
Best Human hit | astacin-like metalloendopeptidase precursor (1e-10) |
Best NR hit (blastp)   | high choriolytic enzyme 1 [Culex quinquefasciatus] (1e-19) |
Best NR hit (blastx)   | PREDICTED: similar to CG6696-PA [Apis mellifera] (2e-20) |
GeneOntology terms    | GO:0004222 metalloendopeptidase activity GO:0017090 meprin A complex GO:0006508 proteolysis GO:0008270 zinc ion binding |
InterPro families    | IPR006026 Peptidase, metallopeptidase IPR001506 Peptidase M12A, astacin |
Orthology group | MCL40795 |
Nucleotide sequence:
ATGACTTTTTTATTGAATTTCTTCGTGTTGAATATATTAAATCTAGAATTATTTGCTTAT
GATTTGGAGCCTCCATTAAAAAATTATAAAGATGGAAGTAAATCTGCTTTTATTCAAGCT
CCATACGTAAGCATTGCACGAGATCAAAAAGTAAAGAAAATACAAGAAGAGATTACAAGC
AGTTGGCCCGAAGGAATAATAAAATACTATGTGGAAGAAAAGAGTTATGATTCATCTATC
ATTACTCTTATACGCGCTGCGATGAGTGTTTTGGAATCGTCAGCTTGCATACGTTTCAAG
GCAGTCAAGGATAAGCCAGAGGGCAATGACACATGGCTACACATCACCAATCCAAAAAAG
AAAAGGGAATGCGTGCATGAACCCGAGGTTCTGGAAAGCGGAGAAATTGTTTTAGTTCTT
GGTTATGACTGCCTTAAATCTAGAGACTTGATACATTCTTTGCTCCATGGTATTGGATTA
AAGGACGAAGTGACGCATCCTCACAGAGACAACTATGTCAAAGTTGTGTGGGATAATATA
CAACCTGCTTACAGACATCTATATCGTACCCAACCAGTAGAGAATTCTAGAAGCATAGTT
GAGTACGATCCATTAAGTATTATGCATTTCCACGATCGGGCTTTCAGTATGAATGGCAAA
GCAACAATCCTACCATTGGAAACTGGTTTAAGGATTTCGCCATCAGACGGCTTATCACAG
TTGGATAAAATGAAGTTACATATATATTTTGGACACGAATGTAATAAGAGGAAATTCGTT
TCCCTCATGGAAACATGTAAAATGTCTTTAAAGAGTAAAAAAGAATCGGCTAGTGATGAA
AATCGTGAGAAAGGAAAGGATCGAGATAATGTTACAGGAGAAAAAGGTGATAGTAAAAAT
GAAAATGAAGACCACGGAGGAAAAGGTGGTACGGAGAATGCTAATAAACTTGAAAAGGGT
GAAACAGATGAAAATGAAGGAGAAGAAAATGGAGTAGAAGAAGAAAATGGAGTAGAAGAA
GAAAAATTTACTGAAGAAGCAAATAATTCTGAAAATAAAGAAGAGGTAGATGAAAATACT
ACATGGAGAACCTTACATGGAAATTTAACGGAACTCGAGAAAAATGGCGAAACTGAAGAC
GCTAAACAAAATACTGAAGAGGATAACTCTTCAAAGATTACAGAAAAGGTTCAAGATGAT
GATGAAAATAACGATGAATCGAAAGAAAAATCCAAAAAACGATACATTCCTGCAATAATC
GGAGTAATAGCTACGGCAAACTCTGATATGAGTTCCGGAAACGTAAATCAGTTGACGGAA
TCGGAGTCTGCAACAGAAAAGAAACTGAATTCTGGAATCAACTTAAATTATGATAATTAT
ACCGACAAATAA
Protein sequence:
MTFLLNFFVLNILNLELFAYDLEPPLKNYKDGSKSAFIQAPYVSIARDQKVKKIQEEITS
SWPEGIIKYYVEEKSYDSSIITLIRAAMSVLESSACIRFKAVKDKPEGNDTWLHITNPKK
KRECVHEPEVLESGEIVLVLGYDCLKSRDLIHSLLHGIGLKDEVTHPHRDNYVKVVWDNI
QPAYRHLYRTQPVENSRSIVEYDPLSIMHFHDRAFSMNGKATILPLETGLRISPSDGLSQ
LDKMKLHIYFGHECNKRKFVSLMETCKMSLKSKKESASDENREKGKDRDNVTGEKGDSKN
ENEDHGGKGGTENANKLEKGETDENEGEENGVEEENGVEEEKFTEEANNSENKEEVDENT
TWRTLHGNLTELEKNGETEDAKQNTEEDNSSKITEKVQDDDENNDESKEKSKKRYIPAII
GVIATANSDMSSGNVNQLTESESATEKKLNSGINLNYDNYTDK