DPGLEAN15507 in OGS1.0

New model in OGS2.0DPOGS206930 
Genomic Positionscaffold1:- 698603-701624
See gene structure
CDS Length1392
Paired RNAseq reads  301
Single RNAseq reads  686
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012906 (4e-23)
Best Drosophila hit  CG6696 (5e-18)
Best Human hitastacin-like metalloendopeptidase precursor (1e-10)
Best NR hit (blastp)  high choriolytic enzyme 1 [Culex quinquefasciatus] (1e-19)
Best NR hit (blastx)  PREDICTED: similar to CG6696-PA [Apis mellifera] (2e-20)
GeneOntology terms


  
GO:0004222 metalloendopeptidase activity
GO:0017090 meprin A complex
GO:0006508 proteolysis
GO:0008270 zinc ion binding
InterPro families
  
IPR006026 Peptidase, metallopeptidase
IPR001506 Peptidase M12A, astacin
Orthology groupMCL40795

Nucleotide sequence:

ATGACTTTTTTATTGAATTTCTTCGTGTTGAATATATTAAATCTAGAATTATTTGCTTAT
GATTTGGAGCCTCCATTAAAAAATTATAAAGATGGAAGTAAATCTGCTTTTATTCAAGCT
CCATACGTAAGCATTGCACGAGATCAAAAAGTAAAGAAAATACAAGAAGAGATTACAAGC
AGTTGGCCCGAAGGAATAATAAAATACTATGTGGAAGAAAAGAGTTATGATTCATCTATC
ATTACTCTTATACGCGCTGCGATGAGTGTTTTGGAATCGTCAGCTTGCATACGTTTCAAG
GCAGTCAAGGATAAGCCAGAGGGCAATGACACATGGCTACACATCACCAATCCAAAAAAG
AAAAGGGAATGCGTGCATGAACCCGAGGTTCTGGAAAGCGGAGAAATTGTTTTAGTTCTT
GGTTATGACTGCCTTAAATCTAGAGACTTGATACATTCTTTGCTCCATGGTATTGGATTA
AAGGACGAAGTGACGCATCCTCACAGAGACAACTATGTCAAAGTTGTGTGGGATAATATA
CAACCTGCTTACAGACATCTATATCGTACCCAACCAGTAGAGAATTCTAGAAGCATAGTT
GAGTACGATCCATTAAGTATTATGCATTTCCACGATCGGGCTTTCAGTATGAATGGCAAA
GCAACAATCCTACCATTGGAAACTGGTTTAAGGATTTCGCCATCAGACGGCTTATCACAG
TTGGATAAAATGAAGTTACATATATATTTTGGACACGAATGTAATAAGAGGAAATTCGTT
TCCCTCATGGAAACATGTAAAATGTCTTTAAAGAGTAAAAAAGAATCGGCTAGTGATGAA
AATCGTGAGAAAGGAAAGGATCGAGATAATGTTACAGGAGAAAAAGGTGATAGTAAAAAT
GAAAATGAAGACCACGGAGGAAAAGGTGGTACGGAGAATGCTAATAAACTTGAAAAGGGT
GAAACAGATGAAAATGAAGGAGAAGAAAATGGAGTAGAAGAAGAAAATGGAGTAGAAGAA
GAAAAATTTACTGAAGAAGCAAATAATTCTGAAAATAAAGAAGAGGTAGATGAAAATACT
ACATGGAGAACCTTACATGGAAATTTAACGGAACTCGAGAAAAATGGCGAAACTGAAGAC
GCTAAACAAAATACTGAAGAGGATAACTCTTCAAAGATTACAGAAAAGGTTCAAGATGAT
GATGAAAATAACGATGAATCGAAAGAAAAATCCAAAAAACGATACATTCCTGCAATAATC
GGAGTAATAGCTACGGCAAACTCTGATATGAGTTCCGGAAACGTAAATCAGTTGACGGAA
TCGGAGTCTGCAACAGAAAAGAAACTGAATTCTGGAATCAACTTAAATTATGATAATTAT
ACCGACAAATAA

Protein sequence:

MTFLLNFFVLNILNLELFAYDLEPPLKNYKDGSKSAFIQAPYVSIARDQKVKKIQEEITS
SWPEGIIKYYVEEKSYDSSIITLIRAAMSVLESSACIRFKAVKDKPEGNDTWLHITNPKK
KRECVHEPEVLESGEIVLVLGYDCLKSRDLIHSLLHGIGLKDEVTHPHRDNYVKVVWDNI
QPAYRHLYRTQPVENSRSIVEYDPLSIMHFHDRAFSMNGKATILPLETGLRISPSDGLSQ
LDKMKLHIYFGHECNKRKFVSLMETCKMSLKSKKESASDENREKGKDRDNVTGEKGDSKN
ENEDHGGKGGTENANKLEKGETDENEGEENGVEEENGVEEEKFTEEANNSENKEEVDENT
TWRTLHGNLTELEKNGETEDAKQNTEEDNSSKITEKVQDDDENNDESKEKSKKRYIPAII
GVIATANSDMSSGNVNQLTESESATEKKLNSGINLNYDNYTDK