New model in OGS2.0 | DPOGS207023  |
---|---|
Genomic Position | scaffold1:+ 1046875-1064780 |
See gene structure | |
CDS Length | 3594 |
Paired RNAseq reads   | 1085 |
Single RNAseq reads   | 2946 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012964 (0.0) |
Best Drosophila hit   | CG2025 (2e-164) |
Best Human hit | nardilysin isoform b (1e-169) |
Best NR hit (blastp)   | PREDICTED: similar to metalloendopeptidase [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to metalloendopeptidase [Nasonia vitripennis] (0.0) |
GeneOntology terms    | GO:0046872 metal ion binding GO:0003824 catalytic activity GO:0004222 metalloendopeptidase activity GO:0006508 proteolysis GO:0008270 zinc ion binding |
InterPro families    | IPR011765 Peptidase M16, N-terminal IPR007863 Peptidase M16, C-terminal IPR001431 Peptidase M16, zinc-binding site IPR011249 Metalloenzyme, LuxS/M16 peptidase-like, metal-binding IPR011237 Peptidase M16, core |
Orthology group | MCL10557 |
Nucleotide sequence:
ATGTCTAAAAGAAATATCTTCCACAGAACACCTAAATTTAAACCTGAAGTGAAAATGGCT
AGTGCCCGTCAGAAACAATCTACCAGTATGAATAGCAAGAAGGTGAAAGTTGAAGTTCTT
CCTGAACCCATCAAGTCTACATCTGATAAAAAACTATATAAGACAATAAAGCTAGAAAAT
GGATTGACAGCACTACTCATATCAGATCCCAGCCGACAGTTTGTACCAGAGGAACTGAGT
TCTAGCGAAGAAGAATCCAGCGGTACTGATGAAAGCTCAGGACTCGAAAGTGACAGTGGC
AAGTCTGGTGGCAGTGACCAACACGGAACTAAGAAGAGAGGGGACTCGGACGAAGAGAAA
CTATCACTAAATCGTCACATCGAGGAGATCCATAAAAAAGTAAACACGCACACTGATAGG
AAAGTAAGACCTGTAAGAGTCGCTGCTCGTCGTGCTAATGAACTGATGTGCATTATTCAA
AATATAGAGCACCATGATGTTGAATGGCTCGATGATAACGACGTAGACATTCCAAAATTG
GATGAAAGGGAGCAGACTCACAATACTGAGCTGCAATCCAAGCAATCGAATGTCATCGAA
AACTTAGAATCATGCAAACGGGCAGCTTGCGCGTTATGCGTCGGTGTTGGAAGCTACAGT
GACCCTCACGACATTCAAGGGTTGGCCCACTTCGTTGAGCACATGGTGTTTATGGGAAGC
GAGAGGTATCCAAAGGAGAATGAATTTGACGCTTTCATTAAGAAAAAAGGGGGTTCGGAC
AACGCGTCCACGGACTGCGAATTGACGACATTCTACTTTGAGATTCAAGAGAAACATCTT
CCGCACGCTATGGACATGTTCAGCCAGTTCTTCGTGAGCCCGCTCATGATGAAGGAGGCC
ATGCAAAGGGAACGTGAGGCCATCGAATCGGAATTCGCGATCGCCTCTCCGTCCGACTCG
AATCGTAAGGACCAGTTGTTGTCAAGTCTGTTCCCGGAGAACCACCCAGCTCGCACATTC
ACCTGGGGAAACCTGAAGAGTCTCAAGGAGGATATAGACGATGATAACAGACTTCACACT
GCAGCTCATGAGTTCAGGAAGAGGCATTACAGCGCTCATAGGATGACTGTAGCGGTTCAG
GCTCGCATGGACCTCGCATCACTGGAACAGTACGTGGTGAACACATTCGGTCAGATACCA
ACAAACAGGCTGCCACCAGAAGACTTCTCCGATTTCAAGTTCAGTCCACGGACCATTACA
CCGGAGTTCACCAGCATTTACTATGTGAAGCCGGTCAGCGATACTACTGAGGTCCATTTG
ACTTGGTGTATGCGGTCTCTACTGTCCGAATACGAGTCAAAGCCTCACCAGTACATATCA
TACCTACTGGGACACGAGGGCAAGGGCAGTTTGCTCTCTTATTTAAGAAAAAAGGTGTGG
GCGTTGGCTATATATACTGGCAACTCTGAGAGCGGTATAGACTATACATCCATGTACAGT
TTGTTCTCAACGCAAGTGGTGCTGACAGAAGACGGATTAGCAAATATTGACAAAGTGCTG
GAAGCGATATTTTCATATATCAATATGCTTAAAAAGCTCGGACCATCTGAGAGGATCTAT
GACGAAATAAGGACAATAGAAGAGACCAGTTTCCGTTTCGACGAGGAATCTCAGCCGTCG
GACTATGTGGAGACATTGTCGGAGAATATGCACTTCTTTCCGCCACAACATTACATAACA
GGGGATCGCCTGTACTACAAATATGACCCTAAGGGTATTAAAAGCTTACTCGATCTCATG
AGAGCAGACACTGTCAATATAATGATACTCAGCAACAAACATCCTAAGCCGATCAAATAT
GATAGTAAAGAGAAATGGTTTGGCACGGAGTACAAGAGGGAGGCTATAAACCCGGCGTGG
TTGAAGAAATGGTTATCAGTCACGCCCTACAGCCAGTTCCACCTGCCGGAGAAGAACGTG
TACATCACGACCAACTTTGATCTCATTCAACCAGCTAAACCATATTTAGAGGAAGCTGAA
CGTTTGGGGATAGATCTCATCAATAATTCAGCAAAAGATATACACAGGAAGGTAGCTGCG
AACGAATTTACAAGCAAGGTCCTTAAACACGGCGAACTTATGGCCACCGTCAATAGATTC
AGGCTCGACCAGCCAAACCTCCTTCGCAAGAACCGGCACATGGAGCTGTGGTATAAACCC
GATTTTAAATTCCGTTTCCCAACAGCGCTGTTGTACTTCTACTTCATAACACCATTAAGT
CTCAAGTCTCCGAGAGAGGCTTGCCTACTTGATCTCTGGAGCGACGTGCTACAACAGGGA
CTTAAGGAAGACGTCTATCCCGCCAATATGGCGGATCTGACGCATTTGTTGTACGTCACC
GACAGAGGTCTGACCCTGAAAATCTCTGGGTACAGTCAGAATCTTCACCTGGTTGTGTCT
CTGATATCACGCGCGATGCGCGACTCTGCCCGCATGCCGCACGCTCTGTTCGAGGCTGTG
CGCGACGTTCGTGCGAGGACCTACCATAACGTCCTAATCAAACCGCACAAACTGGCCAAG
GATGTCCGTATGAGCCTTTTACTGGAGCCCTATATGTCGCCACGTGACAAGGCGACCTTC
ATACAGAACGTCACTTTGCCGGAACTACAGGACTTCACACAGAAGTTGCTCAATAAGATG
TACCTACAGATTCTTGTGCAAGGTAACCTGGCTTGGCACGAGGCTGTGACTATATCAGAG
AATGTTTTGAAAACAATAAAATGGGATGGACTAGAACCACACGAGATCCCTGACATCAAA
GTTCACCAGTTACCACTTGGAGAGCGTAAAATCCGCGTGGCTAGCCTCAACCCGTCATCA
ACGAACAGTATCGTCACCAACTACTACCAGGGGGAGAGGAGCACGCCGCAGGAGGCCGCC
GCGCTTGAAGTACTAATGATGCTGATGGAAGAACCAGTTTTCGATGCTCTTCGTACTAAG
GAGCAGCTTGGATACAGCGTGTTCAGCATGATGCGTTACACCTTCGGCGTGTTGGGCTTC
TCGATTACTGTTAACACTCAAGTCGACAAGTTCAGCGTATCCCATGTTGATCGTCGAGTG
GAGGCGTTCCTCAAGAAGTTCGCTCGTGATGTGAAGAGGGGTGGGGAGAGGGCGCTGGCG
GCGGCCAGGCACGCGCTGGTGCAGCTCAAACATACCGCTGACTACGAGCTCAAGGAAGAG
GTTGAGAGAAACTGGCGCGAAATCCTGACCCAAGAATACCAGTACCAACGTCTATTTGTC
GAGGCTGACGCCATAGAGAGAATCAAACTGTCTGATATCAAAAACTGGATAGATAACCAC
TTCCCCTCAGGAAACAGGTCGCAGTTCAGGAAACTATCAGTACAGGTCGTGGGTAACAAG
CCGCAAGATGAAAGCGTGGACGGACCTAAAAAACTATCACTAATTTATTCCAATGCCAGC
GAGAACAGCGGCGACCCCACAGAGAACGAAGCTGACTTCATCAAAAACATAGAAATATTC
AAGACAGACCTGCCTCTCATAAATGTACCGAAAGTTGAATTAGCGCAATGTTAA
Protein sequence:
MSKRNIFHRTPKFKPEVKMASARQKQSTSMNSKKVKVEVLPEPIKSTSDKKLYKTIKLEN
GLTALLISDPSRQFVPEELSSSEEESSGTDESSGLESDSGKSGGSDQHGTKKRGDSDEEK
LSLNRHIEEIHKKVNTHTDRKVRPVRVAARRANELMCIIQNIEHHDVEWLDDNDVDIPKL
DEREQTHNTELQSKQSNVIENLESCKRAACALCVGVGSYSDPHDIQGLAHFVEHMVFMGS
ERYPKENEFDAFIKKKGGSDNASTDCELTTFYFEIQEKHLPHAMDMFSQFFVSPLMMKEA
MQREREAIESEFAIASPSDSNRKDQLLSSLFPENHPARTFTWGNLKSLKEDIDDDNRLHT
AAHEFRKRHYSAHRMTVAVQARMDLASLEQYVVNTFGQIPTNRLPPEDFSDFKFSPRTIT
PEFTSIYYVKPVSDTTEVHLTWCMRSLLSEYESKPHQYISYLLGHEGKGSLLSYLRKKVW
ALAIYTGNSESGIDYTSMYSLFSTQVVLTEDGLANIDKVLEAIFSYINMLKKLGPSERIY
DEIRTIEETSFRFDEESQPSDYVETLSENMHFFPPQHYITGDRLYYKYDPKGIKSLLDLM
RADTVNIMILSNKHPKPIKYDSKEKWFGTEYKREAINPAWLKKWLSVTPYSQFHLPEKNV
YITTNFDLIQPAKPYLEEAERLGIDLINNSAKDIHRKVAANEFTSKVLKHGELMATVNRF
RLDQPNLLRKNRHMELWYKPDFKFRFPTALLYFYFITPLSLKSPREACLLDLWSDVLQQG
LKEDVYPANMADLTHLLYVTDRGLTLKISGYSQNLHLVVSLISRAMRDSARMPHALFEAV
RDVRARTYHNVLIKPHKLAKDVRMSLLLEPYMSPRDKATFIQNVTLPELQDFTQKLLNKM
YLQILVQGNLAWHEAVTISENVLKTIKWDGLEPHEIPDIKVHQLPLGERKIRVASLNPSS
TNSIVTNYYQGERSTPQEAAALEVLMMLMEEPVFDALRTKEQLGYSVFSMMRYTFGVLGF
SITVNTQVDKFSVSHVDRRVEAFLKKFARDVKRGGERALAAARHALVQLKHTADYELKEE
VERNWREILTQEYQYQRLFVEADAIERIKLSDIKNWIDNHFPSGNRSQFRKLSVQVVGNK
PQDESVDGPKKLSLIYSNASENSGDPTENEADFIKNIEIFKTDLPLINVPKVELAQC