DPGLEAN09652 in OGS1.0

New model in OGS2.0DPOGS202042 
Genomic Positionscaffold795:+ 3370-9997
See gene structure
CDS Length2043
Paired RNAseq reads  769
Single RNAseq reads  2068
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001274 (0.0)
Best Drosophila hit  ND
Best Human hitplasma glutamate carboxypeptidase precursor (4e-108)
Best NR hit (blastp)  PREDICTED: similar to plasma glutamate carboxypeptidase [Acyrthosiphon pisum] (4e-132)
Best NR hit (blastx)  PREDICTED: similar to predicted protein [Tribolium castaneum] (1e-132)
GeneOntology terms









  
GO:0016787 hydrolase activity
GO:0004177 aminopeptidase activity
GO:0004180 carboxypeptidase activity
GO:0005576 extracellular region
GO:0008233 peptidase activity
GO:0046872 metal ion binding
GO:0008237 metallopeptidase activity
GO:0006508 proteolysis
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families
  
IPR007484 Peptidase M28
IPR003137 Protease-associated domain, PA
Orthology groupMCL11779

Nucleotide sequence:

ATGACGCAGGAAAACTCTTTTTGCGAGCCGTCTAACCGGCCTTGGTGGGGCCGAAGAGGA
TGTGTGAGACTCGGCCTGGGCAGGCCCCTACTCACTAAAACCACTGCGAAAGCCAACGGC
CGCTATGTTGGTGTACCCCGGATCTGTCACCGACAGGACACACCAGAGACTGGCCGGTCC
TGGCAAAGACCGGACTTACTCCCTCGGAAAAAGGGGACGATCCCGGCAATTAGGATCGCC
CCGACGATGCCGGGCTATCACAGGAGCCGGGTTACCAGCCCGACCCCAGACCGGAGTGCA
GCGGCGCTATTTGAGGCGTTGCAAATATCGCCTCTGGCGCACCCCCGACCGTCGTCTGCG
GAGGGAGGGTGCATCGACCGCATCCTCCCTTTCCCGCTCAAATGCCTCCTTCGTGGACAT
GACCATCTCACAGAAGGAGGACACCGCCAGCCAGGACCTGTCACTCTCGAGAATCTTCGA
AAGCGAGAAGTTCACTCTGCTGAATGCTGCCACAAGTTCTTGGCGTTGTGCAGCCCATCT
CCGGCACACCTCGAGGGTGTGCTGGGCCAGGTCCACCGCAGCGTCGCACTCATGGCAGCC
TCCTCCCGGCTCTCTCCTGGCCGTCCGATACAAATAATTACCGAAGCACCCGTGCCCGGT
GAAAACCTGCGTCAAACGGAAAACAGAAGAAAAAATTACTCGCAATTAAAATCATGTGAT
ATCGGACCTTTGGCAGAAGAAATAGCATCCTACGAATCCGTGGTCAAAAACATTATAAAC
TACGTAGTGAACGGCCCGTTTAAAGGAAAAACATATGATGAATTATCGAAATTCGTCGAT
ACTTTTGGTGCTCGTCCCTCAGGATCACAAATACTCGAAGACTCTATTGATTACATGATT
CAGCTGACTAAGGACGAGGATATAAATGACATCGTTACGGAGGAACTAGAGGTACCACAT
TGGATGCGAGGAAAAGAAGAAATTACTATGATTGAACCACGAATAAAAAATATTGATTTA
TTAGGTTTAGGACAAAGTGTGAGCACACCATCTGAGGGTATTACCGCTGAGGTGATTGTG
GTAAATAACTTCGAAGAGTTGGCTGAGATACCTAACGAAGTTGTTGAAGGTAAAATTGTG
TTATACGACCCTATTTTTACGACATATCGTGAAACAGTTGTATATAGATCACAGGGTGCT
GTTAGAGCAGCTGAAAAGGGAGCGGTTGCGTCATTAGTGAGGTCTATTGCACCATTCTCT
ATTAATTCACCTCATACTGGTTCACAAAATTATAATAATAATGTTAAAAAGATTCCAACT
GCAGCCATTTCCATCGAAGATGCTGATTTAATGAGAAGACTGTTTAATAGAGGTCAAAAA
ATTATCTTAAATATAACAATGACGTCCACGTCTGAGACAAAAACCTCCAGGAATACGCTT
ATCGACCTAAAGGGAACTTTAAACCCAGAAAAGTTAGTTATTGTTTCTGGTCATATAGAT
AGTTGGGATGTTGGACAAGGTGCCATGGATGATGGTGGTGGCTTATTTGTAAGTTGGGCA
GTACCAGTCATTTTGAAACAACTAAATATGAAACCAAAGAGAACTATAAGGTCTATATTT
TGGACGGCTGAAGAGTTAGGATTAATTGGTGCTTACGCCTATGAGGAAAAACATAGAAAT
GAAAGTCATAACATAAATTTCATAATGGAATCCGATGAAGGTACATTCGCTCCACGTGGA
TTGGCTGTTGGTGGCAGTCAGAAAGCTCGATGTATTATAGCAGAAATTTTAAAACTATTC
GAGTCTATAAATGCTTCTACTCTCGTAGAAGAAGACAGTCCGGGCTCTGATATTAGCGTT
CTTATTAAAACCGGAATTCCAGGAGCCAGTCTTCATAATGCGAATGAAAAGTATTTTTGG
TTTCATCACACGGAGGGAGATACTATGAATGTAGAAAGTCCTGAAGAACTTGATCTATGC
GCGGCATTCTGGACTGCGGTGGCATATATAATAGCAGATATCTCTGCTGATATACCGCGT
TAA

Protein sequence:

MTQENSFCEPSNRPWWGRRGCVRLGLGRPLLTKTTAKANGRYVGVPRICHRQDTPETGRS
WQRPDLLPRKKGTIPAIRIAPTMPGYHRSRVTSPTPDRSAAALFEALQISPLAHPRPSSA
EGGCIDRILPFPLKCLLRGHDHLTEGGHRQPGPVTLENLRKREVHSAECCHKFLALCSPS
PAHLEGVLGQVHRSVALMAASSRLSPGRPIQIITEAPVPGENLRQTENRRKNYSQLKSCD
IGPLAEEIASYESVVKNIINYVVNGPFKGKTYDELSKFVDTFGARPSGSQILEDSIDYMI
QLTKDEDINDIVTEELEVPHWMRGKEEITMIEPRIKNIDLLGLGQSVSTPSEGITAEVIV
VNNFEELAEIPNEVVEGKIVLYDPIFTTYRETVVYRSQGAVRAAEKGAVASLVRSIAPFS
INSPHTGSQNYNNNVKKIPTAAISIEDADLMRRLFNRGQKIILNITMTSTSETKTSRNTL
IDLKGTLNPEKLVIVSGHIDSWDVGQGAMDDGGGLFVSWAVPVILKQLNMKPKRTIRSIF
WTAEELGLIGAYAYEEKHRNESHNINFIMESDEGTFAPRGLAVGGSQKARCIIAEILKLF
ESINASTLVEEDSPGSDISVLIKTGIPGASLHNANEKYFWFHHTEGDTMNVESPEELDLC
AAFWTAVAYIIADISADIPR