New model in OGS2.0 | DPOGS210786  |
---|---|
Genomic Position | scaffold1688:- 4703-12219 |
See gene structure | |
CDS Length | 3234 |
Paired RNAseq reads   | 361 |
Single RNAseq reads   | 895 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007097 (2e-70) |
Best Drosophila hit   | neprilysin 4, isoform B (0.0) |
Best Human hit | endothelin-converting enzyme 1 isoform 4 (6e-138) |
Best NR hit (blastp)   | hypothetical protein Phum_PHUM474680 [Pediculus humanus corporis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to Neprilysin 4 CG4058-PA, isoform A [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0004222 metalloendopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR000718 Peptidase M13, neprilysin IPR008753 Peptidase M13 IPR018497 Peptidase M13, neprilysin, C-terminal |
Orthology group | MCL10289 |
Nucleotide sequence:
ATGGCGCGCGTTGACTCAACACCCCAAACAGTTTCGATCAAACCGAAAAGTGCCAATTTT
TGTAGATTTATAATTATTATTGTTTTAACATTATTCTGTGCTGTAGCCTATTTTATTTCG
AGGAGACCTCTAGACACATTAGGAGATCCGTCTGATCAGTTTACTGTTAAAAACATCCCA
AATATATATACACGTTTGGAAAGTGTTGATCCTGAGGAGAGGGAGGTGTTCCAAGGATTC
CAAACATCATTCCTGCCGCCGGAGGAGGTGGTGTTATCTAGGAAAGGATTTGAAGATAGT
GACGAAGTTTTAAACAGTAATGTGAAAAGCGATATATTTAAATGGGAAGAAAAAAGACGA
GAAAATCCCACAAGGGTTAAACGAGGAAGCCATTCCACTTTCTCGACACAAGATTACGAT
CAAGAAACACCAACACAAAACCAACAAAATGTCGAAGATATTCAAGAAGATGACGACGAT
GGAAACGAAAAGGAAATGGTATATGGCAGTGACTCCGATCAACACGACGAGGAAAGAAAT
ATGCAAAGTGTGGTGCATCCAGAGATTTACGTTGATCGTGGGCCGTTAGACGACGTGGAA
CAAGTTCGCCGGCAGCCAGAATTAGAGGAAGGTGAAATGGGTTCCGCGGCTCATTTGTAT
AAACCGGTTCGATCTTACACAGGGGTACACGCGTTTTGGAAGGGTCAGGGTGACAAAGAA
ACTATAAGACACACGCAGTCAAAAATAATGCGACAATACATGGATGCTGAAGCGGACCCT
TGTCATGATTTTTATCAATATGCTTGTGGAAATTGGCCAACACTCAACCCAATACCAGCT
GATAAAGCTGGTTACGATACATTTGAGATGTTGAGGGAAAACTTGGATACGGTATTGAAG
GACATGTTGGAGTTCTCTAAAGATGAAGAGATCCCTAGCCAGTATCCGGGGCCACATCTA
GATTTCAATGATAATTTAAAACATGCTATAAATTCGCAATGCTCACAAGAATCTCATGAT
ATCGTTGATTATATTATTACAAATTCCAAAGAAATACTCAATTTGACTGAGAGAAAAAAT
ACTTATGACATTAAAGACGAGAGCAAAACTGAAATAATTAATCGTATCAGACGATATCTT
GATATGAAAACACGTGACATGAAGAAGTCTTCATTTAAAACGAAATTTAAATTACACGAG
TATTTATTCATGAACAATAAAAAGGGAAAAAATTTAAGGAGACCCAAAAGACATACTGAT
AACAATGACACGCGGAACCAAAGCGAAAAGAAAAAACGCAATATAATATACGATAAAAAC
GGATCCAACAGAGAAACACATTTTAAACGTGGTAAAAGAAAAGAAACATTGGAGCAACTT
TTGGAAAATCTTAAACAGAAATATGAATTACCCAAAAACGACCCAGCAAATGGCGACGCG
GCATTGAAAGCTAGATTTTTATTTAAGTCTTGTATGAACCACGATATCTTGCAGAAAAGA
GGCCACGTACCTCTTCTAGATCTACTTGATATTTTAGGAGGCTGGCCGATACTAAAACCC
GGATGGGATTCAAAAAATTTCGACTGGTTGGAACTTATGGCAAAACTAAGGCTATATAAT
AATGACATTTTAATATCTGAATGGGTTGGACCAGATATAAAGAATTCAGATGAATTCGTT
ATACAGTTTGATCAAACGAGTCTAGGTTTGCCTACAAGAGATTATTTTCTACAAGAGTCT
AACAAGGTATATTTAGAGGGTTATAGAGCATATTTGATAAAAATAGCAACTTTACTCGGA
GGAAACATTGAGCATGTAAAAGAGAGTGCAGTAAAACTGATCGATTTCGAAATCAACCTT
GCTAAAATAACTTCCGCCCCAGAAGACAGGCGAAACGTATCAGAACTCTACCGCCGCATG
ACACTCGCCAAGCTGGAAGGACTGGTCCCCGAGATCAAGTGGAGGAAATATTTGTGCATC
GTGATGAACAGGACGATTGACTCAAGCGAAACTGTAGTACTGTTCGCTCTGTCGTACGTA
CGGCACTTAGTTCAATTGATAAAGAAGACGGATCCTAATACTTTATCAAATTACTTATTG
TGGCGTTTCGTGAGACATCGTGTCAACAATCTGGATGATCGCTTCCAATCTGCGAAACAA
CAATTCTATTACATTTTATTTGGACGCGAACAAGCGCCGCCAAGGTGGAAGAACTGTATA
TCCCAAGTGAATTCAAATATGGGCATGGCATTAGGGTCAATGTTTGTTAGGAAATACTTT
GACGAGATGAGCAAAAACGACACGATGACGATGACGAGGGAAATCCAACAGGCGTTCAGA
GAGTTACTGCACATGACGGATTGGATTGATGAGGAGACAAAAAAACTAGCCGCCCATAAA
GTCGACTCTATGATGCTCAGAATAGGCTACCCCGACTTCATTCTGAACAAGAAAGAGCTC
GACGATCGTTATAAGGAAGTGCAAATACATCCAGATAAATATTTTGAGAATATACTGAAT
ATACTTCAACATCTCACTAAAATGGAACAGTCGCGAATCGGCCAGCCTGTTAATAAGACA
CTATGGAATACAGCGCCGGCGGTCGTGAACGCTTATTACAGCCGTAATAAAAATCAGATC
ATGTTCCCCGCTGGGATCCTACAACCACCTTTCTACCATCGACACTTCCCGAGGTCGCTG
AACTTTGGAGGCATCGGAGTGGTTATTGGTCACGAAATTACCCACGGGTTTGACGACAAG
GGTCGTTTGTTTGACTGCGAGGGTAACCTGCACCGCTGGTGGTCTGATTCCGCCATCGAG
GCATTCCATCGTCGAGCTCAGTGCCTCATCGACCAGTACGGACGATACGTAGTGCCAGAA
GTCAATATGAAACTAGACGGTGTTAACACACAGGGTGAGAATATAGCCGACAATGGTGGC
GTGAAGCAGGCGTTCCACGCTTACCAACGCTGGCTGCTACAGCACGGCGCCGTTGACGAG
ACGCTTCCAGAACTCAACCATACCAGCACGCAGTTGTTCTTTCTCAACTTCGCCCAGGTA
TGGTGTGGTGCAATGCGGCCGGAAGCTATGAGAAATAAATTAAAGACAGCTGTCCACTCT
CCAGGAAGGTTCCGTGTAATTGGAACCCTTTCTAATTCCCTGGATTTCGCCAGAGAATTC
CAATGTCCACCGGGATCGCCCATGAATCCGATTCATAAATGTAGTGTTTGGTAG
Protein sequence:
MARVDSTPQTVSIKPKSANFCRFIIIIVLTLFCAVAYFISRRPLDTLGDPSDQFTVKNIP
NIYTRLESVDPEEREVFQGFQTSFLPPEEVVLSRKGFEDSDEVLNSNVKSDIFKWEEKRR
ENPTRVKRGSHSTFSTQDYDQETPTQNQQNVEDIQEDDDDGNEKEMVYGSDSDQHDEERN
MQSVVHPEIYVDRGPLDDVEQVRRQPELEEGEMGSAAHLYKPVRSYTGVHAFWKGQGDKE
TIRHTQSKIMRQYMDAEADPCHDFYQYACGNWPTLNPIPADKAGYDTFEMLRENLDTVLK
DMLEFSKDEEIPSQYPGPHLDFNDNLKHAINSQCSQESHDIVDYIITNSKEILNLTERKN
TYDIKDESKTEIINRIRRYLDMKTRDMKKSSFKTKFKLHEYLFMNNKKGKNLRRPKRHTD
NNDTRNQSEKKKRNIIYDKNGSNRETHFKRGKRKETLEQLLENLKQKYELPKNDPANGDA
ALKARFLFKSCMNHDILQKRGHVPLLDLLDILGGWPILKPGWDSKNFDWLELMAKLRLYN
NDILISEWVGPDIKNSDEFVIQFDQTSLGLPTRDYFLQESNKVYLEGYRAYLIKIATLLG
GNIEHVKESAVKLIDFEINLAKITSAPEDRRNVSELYRRMTLAKLEGLVPEIKWRKYLCI
VMNRTIDSSETVVLFALSYVRHLVQLIKKTDPNTLSNYLLWRFVRHRVNNLDDRFQSAKQ
QFYYILFGREQAPPRWKNCISQVNSNMGMALGSMFVRKYFDEMSKNDTMTMTREIQQAFR
ELLHMTDWIDEETKKLAAHKVDSMMLRIGYPDFILNKKELDDRYKEVQIHPDKYFENILN
ILQHLTKMEQSRIGQPVNKTLWNTAPAVVNAYYSRNKNQIMFPAGILQPPFYHRHFPRSL
NFGGIGVVIGHEITHGFDDKGRLFDCEGNLHRWWSDSAIEAFHRRAQCLIDQYGRYVVPE
VNMKLDGVNTQGENIADNGGVKQAFHAYQRWLLQHGAVDETLPELNHTSTQLFFLNFAQV
WCGAMRPEAMRNKLKTAVHSPGRFRVIGTLSNSLDFAREFQCPPGSPMNPIHKCSVW