DPGLEAN13628 in OGS1.0

New model in OGS2.0DPOGS214246 
Genomic Positionscaffold433:- 8319-11597
See gene structure
CDS Length1503
Paired RNAseq reads  1588
Single RNAseq reads  4133
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005965 (0.0)
Best Drosophila hit  CG3107, isoform C (1e-09)
Best Human hitpresequence protease, mitochondrial precursor (7e-06)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL008862 [Aedes aegypti] (1e-150)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL008862 [Aedes aegypti] (2e-139)
GeneOntology terms



  
GO:0004222 metalloendopeptidase activity
GO:0006508 proteolysis
GO:0008270 zinc ion binding
GO:0003824 catalytic activity
GO:0046872 metal ion binding
InterPro families


  
IPR011237 Peptidase M16, core
IPR007863 Peptidase M16, C-terminal
IPR011765 Peptidase M16, N-terminal
IPR011249 Metalloenzyme, LuxS/M16 peptidase-like, metal-binding
Orthology groupMCL17938

Nucleotide sequence:

ATGTCTCATTTTAAACTAATATCGTCAACAAAGGCTTCCGATGTGATACCTGTAAACAAA
TATTTGTCCGAAAAGACTGGCTTAACCGTAATTATAGCAAACGTTGAAGGACCTGTTGTA
AAAGGATTTTTTTGCTTAGCAACGGAAGCTCACGATGATGACGGTTTGCCTCATACATTG
GAACACTTGATCTTTTTGGGATCAGAGCGTTACCCTTACAAGGGTATTCTCGATCTTTTG
GCGAACCGATGTATGGCTCACGGAACGAACGCGTGGACGGATGTAGACCACACTTGTTAT
ACTATATACACTGCGGGAGATGCGGGTATGTTGACTCTGTTACCCATCTACCTGGACCAT
ATACTGAGACCAACTCTTACGGATCAAGGATTTCTGACGGAGGTTCATCATGTTGATGGT
GACGGAGATGACGCTGGTGTGGTGTACTGCGAGATGCAGGGTAGGGAGAATACAGCGGAT
AGTAAATGTGAGTTAAGAATGCTCCGTGCTATGTATCCCAATAATGGCTATTCTTCTGAA
ACTGGGGGTATCATGAAAAACCTGAGGGAGTCCACTGATAATACTAAAGTGCGAGATTTC
CACAAGAAATTCTATAGAGCTGAAAACCTAACAATAATTCTAACAGGACAAATTGACGCC
CAAGATGTTTTCAATGTTCTCACCACAGTTGAGGATGACATCATTGCTAAGCGGGAGAAG
GAATCTCAGGAAGAGTGGGTGAAACCCTGGCAGACTATACCCCCACCACCAGCTTATGGA
GAACTTATAGAGAAGTGGCCAGCGGATACCGAAGACTGTGGACAGGTATTGTTCGGTTGG
CGTGGACCTCTGTTGATTCAGGTCGGTGCGTTGCACGAGTTGACTGCTTGTGCGGTGCTG
CTGCGGTATCTATGCGACACGGCTGCGGCGCCGCTACAGCGTGCACTTGTCGAGAGAGAG
GACGCGTTGGCTGGAGATGTATCATACAATCTCACAGAGAACATGGCCTCATTGATTAAG
ATAGAGCTGGATAATGTACCAGTTGATAAACTGACTCAAGCTAAAGAGGAGGCGCTGAAG
AGTTTGAGAAGCGTCAGGTCCGGGGAGGAGGCTATCAATATGGACCGCATGAAGAGATTA
CTCAGGAAACAGTTGAGGGAATGTATGGCCAGCCTTGAATCTGAACCACATCATGCTGTG
GCTTTTAGATGTATCGGAGATGCACTTTATTCCCAAAATGAAGACGATTTTATAAAACGG
ATGAATCCACAACAAACGATGCATGATCTACTAAAAGAGAGCAGTGAATTCTGGGTTGAT
TTGTTGAACAAGTACTTCAATGATGATCTGGTGGTCATAGTTGGATCACCTAGCATTGAG
TTGCAAGCAAACGCCCTCCTCCCCCCGGGACCCTGGCGTCTGTTCCTGTACCGTCCTGTG
ACTTCAAGTGTCATTCCATCCGGTCATGGAGTTCCGGAGAAGACTGTCCATATCTCGACC
TAA

Protein sequence:

MSHFKLISSTKASDVIPVNKYLSEKTGLTVIIANVEGPVVKGFFCLATEAHDDDGLPHTL
EHLIFLGSERYPYKGILDLLANRCMAHGTNAWTDVDHTCYTIYTAGDAGMLTLLPIYLDH
ILRPTLTDQGFLTEVHHVDGDGDDAGVVYCEMQGRENTADSKCELRMLRAMYPNNGYSSE
TGGIMKNLRESTDNTKVRDFHKKFYRAENLTIILTGQIDAQDVFNVLTTVEDDIIAKREK
ESQEEWVKPWQTIPPPPAYGELIEKWPADTEDCGQVLFGWRGPLLIQVGALHELTACAVL
LRYLCDTAAAPLQRALVEREDALAGDVSYNLTENMASLIKIELDNVPVDKLTQAKEEALK
SLRSVRSGEEAINMDRMKRLLRKQLRECMASLESEPHHAVAFRCIGDALYSQNEDDFIKR
MNPQQTMHDLLKESSEFWVDLLNKYFNDDLVVIVGSPSIELQANALLPPGPWRLFLYRPV
TSSVIPSGHGVPEKTVHIST