DPGLEAN04066 in OGS1.0

New model in OGS2.0DPOGS210786 
Genomic Positionscaffold1688:- 4703-12219
See gene structure
CDS Length3234
Paired RNAseq reads  361
Single RNAseq reads  895
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007097 (2e-70)
Best Drosophila hit  neprilysin 4, isoform B (0.0)
Best Human hitendothelin-converting enzyme 1 isoform 4 (6e-138)
Best NR hit (blastp)  hypothetical protein Phum_PHUM474680 [Pediculus humanus corporis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to Neprilysin 4 CG4058-PA, isoform A [Apis mellifera] (0.0)
GeneOntology terms
  
GO:0004222 metalloendopeptidase activity
GO:0006508 proteolysis
InterPro families

  
IPR000718 Peptidase M13, neprilysin
IPR008753 Peptidase M13
IPR018497 Peptidase M13, neprilysin, C-terminal
Orthology groupMCL10289

Nucleotide sequence:

ATGGCGCGCGTTGACTCAACACCCCAAACAGTTTCGATCAAACCGAAAAGTGCCAATTTT
TGTAGATTTATAATTATTATTGTTTTAACATTATTCTGTGCTGTAGCCTATTTTATTTCG
AGGAGACCTCTAGACACATTAGGAGATCCGTCTGATCAGTTTACTGTTAAAAACATCCCA
AATATATATACACGTTTGGAAAGTGTTGATCCTGAGGAGAGGGAGGTGTTCCAAGGATTC
CAAACATCATTCCTGCCGCCGGAGGAGGTGGTGTTATCTAGGAAAGGATTTGAAGATAGT
GACGAAGTTTTAAACAGTAATGTGAAAAGCGATATATTTAAATGGGAAGAAAAAAGACGA
GAAAATCCCACAAGGGTTAAACGAGGAAGCCATTCCACTTTCTCGACACAAGATTACGAT
CAAGAAACACCAACACAAAACCAACAAAATGTCGAAGATATTCAAGAAGATGACGACGAT
GGAAACGAAAAGGAAATGGTATATGGCAGTGACTCCGATCAACACGACGAGGAAAGAAAT
ATGCAAAGTGTGGTGCATCCAGAGATTTACGTTGATCGTGGGCCGTTAGACGACGTGGAA
CAAGTTCGCCGGCAGCCAGAATTAGAGGAAGGTGAAATGGGTTCCGCGGCTCATTTGTAT
AAACCGGTTCGATCTTACACAGGGGTACACGCGTTTTGGAAGGGTCAGGGTGACAAAGAA
ACTATAAGACACACGCAGTCAAAAATAATGCGACAATACATGGATGCTGAAGCGGACCCT
TGTCATGATTTTTATCAATATGCTTGTGGAAATTGGCCAACACTCAACCCAATACCAGCT
GATAAAGCTGGTTACGATACATTTGAGATGTTGAGGGAAAACTTGGATACGGTATTGAAG
GACATGTTGGAGTTCTCTAAAGATGAAGAGATCCCTAGCCAGTATCCGGGGCCACATCTA
GATTTCAATGATAATTTAAAACATGCTATAAATTCGCAATGCTCACAAGAATCTCATGAT
ATCGTTGATTATATTATTACAAATTCCAAAGAAATACTCAATTTGACTGAGAGAAAAAAT
ACTTATGACATTAAAGACGAGAGCAAAACTGAAATAATTAATCGTATCAGACGATATCTT
GATATGAAAACACGTGACATGAAGAAGTCTTCATTTAAAACGAAATTTAAATTACACGAG
TATTTATTCATGAACAATAAAAAGGGAAAAAATTTAAGGAGACCCAAAAGACATACTGAT
AACAATGACACGCGGAACCAAAGCGAAAAGAAAAAACGCAATATAATATACGATAAAAAC
GGATCCAACAGAGAAACACATTTTAAACGTGGTAAAAGAAAAGAAACATTGGAGCAACTT
TTGGAAAATCTTAAACAGAAATATGAATTACCCAAAAACGACCCAGCAAATGGCGACGCG
GCATTGAAAGCTAGATTTTTATTTAAGTCTTGTATGAACCACGATATCTTGCAGAAAAGA
GGCCACGTACCTCTTCTAGATCTACTTGATATTTTAGGAGGCTGGCCGATACTAAAACCC
GGATGGGATTCAAAAAATTTCGACTGGTTGGAACTTATGGCAAAACTAAGGCTATATAAT
AATGACATTTTAATATCTGAATGGGTTGGACCAGATATAAAGAATTCAGATGAATTCGTT
ATACAGTTTGATCAAACGAGTCTAGGTTTGCCTACAAGAGATTATTTTCTACAAGAGTCT
AACAAGGTATATTTAGAGGGTTATAGAGCATATTTGATAAAAATAGCAACTTTACTCGGA
GGAAACATTGAGCATGTAAAAGAGAGTGCAGTAAAACTGATCGATTTCGAAATCAACCTT
GCTAAAATAACTTCCGCCCCAGAAGACAGGCGAAACGTATCAGAACTCTACCGCCGCATG
ACACTCGCCAAGCTGGAAGGACTGGTCCCCGAGATCAAGTGGAGGAAATATTTGTGCATC
GTGATGAACAGGACGATTGACTCAAGCGAAACTGTAGTACTGTTCGCTCTGTCGTACGTA
CGGCACTTAGTTCAATTGATAAAGAAGACGGATCCTAATACTTTATCAAATTACTTATTG
TGGCGTTTCGTGAGACATCGTGTCAACAATCTGGATGATCGCTTCCAATCTGCGAAACAA
CAATTCTATTACATTTTATTTGGACGCGAACAAGCGCCGCCAAGGTGGAAGAACTGTATA
TCCCAAGTGAATTCAAATATGGGCATGGCATTAGGGTCAATGTTTGTTAGGAAATACTTT
GACGAGATGAGCAAAAACGACACGATGACGATGACGAGGGAAATCCAACAGGCGTTCAGA
GAGTTACTGCACATGACGGATTGGATTGATGAGGAGACAAAAAAACTAGCCGCCCATAAA
GTCGACTCTATGATGCTCAGAATAGGCTACCCCGACTTCATTCTGAACAAGAAAGAGCTC
GACGATCGTTATAAGGAAGTGCAAATACATCCAGATAAATATTTTGAGAATATACTGAAT
ATACTTCAACATCTCACTAAAATGGAACAGTCGCGAATCGGCCAGCCTGTTAATAAGACA
CTATGGAATACAGCGCCGGCGGTCGTGAACGCTTATTACAGCCGTAATAAAAATCAGATC
ATGTTCCCCGCTGGGATCCTACAACCACCTTTCTACCATCGACACTTCCCGAGGTCGCTG
AACTTTGGAGGCATCGGAGTGGTTATTGGTCACGAAATTACCCACGGGTTTGACGACAAG
GGTCGTTTGTTTGACTGCGAGGGTAACCTGCACCGCTGGTGGTCTGATTCCGCCATCGAG
GCATTCCATCGTCGAGCTCAGTGCCTCATCGACCAGTACGGACGATACGTAGTGCCAGAA
GTCAATATGAAACTAGACGGTGTTAACACACAGGGTGAGAATATAGCCGACAATGGTGGC
GTGAAGCAGGCGTTCCACGCTTACCAACGCTGGCTGCTACAGCACGGCGCCGTTGACGAG
ACGCTTCCAGAACTCAACCATACCAGCACGCAGTTGTTCTTTCTCAACTTCGCCCAGGTA
TGGTGTGGTGCAATGCGGCCGGAAGCTATGAGAAATAAATTAAAGACAGCTGTCCACTCT
CCAGGAAGGTTCCGTGTAATTGGAACCCTTTCTAATTCCCTGGATTTCGCCAGAGAATTC
CAATGTCCACCGGGATCGCCCATGAATCCGATTCATAAATGTAGTGTTTGGTAG

Protein sequence:

MARVDSTPQTVSIKPKSANFCRFIIIIVLTLFCAVAYFISRRPLDTLGDPSDQFTVKNIP
NIYTRLESVDPEEREVFQGFQTSFLPPEEVVLSRKGFEDSDEVLNSNVKSDIFKWEEKRR
ENPTRVKRGSHSTFSTQDYDQETPTQNQQNVEDIQEDDDDGNEKEMVYGSDSDQHDEERN
MQSVVHPEIYVDRGPLDDVEQVRRQPELEEGEMGSAAHLYKPVRSYTGVHAFWKGQGDKE
TIRHTQSKIMRQYMDAEADPCHDFYQYACGNWPTLNPIPADKAGYDTFEMLRENLDTVLK
DMLEFSKDEEIPSQYPGPHLDFNDNLKHAINSQCSQESHDIVDYIITNSKEILNLTERKN
TYDIKDESKTEIINRIRRYLDMKTRDMKKSSFKTKFKLHEYLFMNNKKGKNLRRPKRHTD
NNDTRNQSEKKKRNIIYDKNGSNRETHFKRGKRKETLEQLLENLKQKYELPKNDPANGDA
ALKARFLFKSCMNHDILQKRGHVPLLDLLDILGGWPILKPGWDSKNFDWLELMAKLRLYN
NDILISEWVGPDIKNSDEFVIQFDQTSLGLPTRDYFLQESNKVYLEGYRAYLIKIATLLG
GNIEHVKESAVKLIDFEINLAKITSAPEDRRNVSELYRRMTLAKLEGLVPEIKWRKYLCI
VMNRTIDSSETVVLFALSYVRHLVQLIKKTDPNTLSNYLLWRFVRHRVNNLDDRFQSAKQ
QFYYILFGREQAPPRWKNCISQVNSNMGMALGSMFVRKYFDEMSKNDTMTMTREIQQAFR
ELLHMTDWIDEETKKLAAHKVDSMMLRIGYPDFILNKKELDDRYKEVQIHPDKYFENILN
ILQHLTKMEQSRIGQPVNKTLWNTAPAVVNAYYSRNKNQIMFPAGILQPPFYHRHFPRSL
NFGGIGVVIGHEITHGFDDKGRLFDCEGNLHRWWSDSAIEAFHRRAQCLIDQYGRYVVPE
VNMKLDGVNTQGENIADNGGVKQAFHAYQRWLLQHGAVDETLPELNHTSTQLFFLNFAQV
WCGAMRPEAMRNKLKTAVHSPGRFRVIGTLSNSLDFAREFQCPPGSPMNPIHKCSVW