DPGLEAN11987 in OGS1.0

New model in OGS2.0DPOGS213400 
Genomic Positionscaffold163:+ 49664-55239
See gene structure
CDS Length2058
Paired RNAseq reads  5894
Single RNAseq reads  14867
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009138 (2e-121)
Best Drosophila hit  granny smith, isoform B (1e-132)
Best Human hitprobable aminopeptidase NPEPL1 (2e-130)
Best NR hit (blastp)  PREDICTED: similar to GA20276-PA [Tribolium castaneum] (5e-162)
Best NR hit (blastx)  PREDICTED: similar to GA20276-PA [Tribolium castaneum] (3e-145)
GeneOntology terms

  
GO:0005622 intracellular
GO:0006508 proteolysis
GO:0004177 aminopeptidase activity
InterPro families
  
IPR000819 Peptidase M17, leucyl aminopeptidase, C-terminal
IPR011356 Peptidase M17
Orthology groupMCL17823

Nucleotide sequence:

ATGTTGAAGTTGTGGCTCGGGCTCGCGCTGTCGCCTGATTCATCAGCTTTATTCGGGGAT
CGCAATAGTTATGGGATGAGTCTAAAAAGGCCATCGGAGCTCTGTAAACACCTAAGAGTT
TCCAAGAGATATATCCTGGGGAAATCCCATGACGATGTCATTCCATCGCTCCCAAAAGAC
AAAGATGTCCCAGAGCTAGAGTCACAACTTCAATTACATAAGCAGTTTATGATAGGAGCA
CAAAGCAAAAGAGTAGGGTTAGGATCAAATCAGAGTAATTTAGTAAGATGGGGTAAAGAT
ACCGACAAAACTTGCTATATCTATGGGAAGGCAGTTGGAACTGCTAAGCATTTGCTGGTG
GGATGTAAGGTACTACTGGATAGCAGTCAATACTCGCGTCGTCACGATAAAGTTCTGGAA
ATCATACGTGAAGCGCGAGGGGCCGAGCGCCTCCTTTACGTAAGGAGGATGTCAAACTCG
GTGAACATCAAGTTTAAATGGGGTCTGAGTACTTCGGACCCCGAGCAAAAGCCTGTGCTG
TTTGTGGGTCAAACGGCACACATAGCAGCTCTGTCCTGGCAAGATGTCCGCTGTAAGCTG
GAGCCTAGAGTCACTGAAGAGGTGTGGCGGCGTGCAGTGTCCGTGATGGAGGGAGGCGAG
GTGTGCGAGGTGTGGCCGCGGGGCGTAGCCCTGGGCGCTCTGCCACCGCGGCGCTCCCGA
CACGCGGCGCCGGCTCGATCACACGCCCTGTCCAAGCTGGTCAGAACGTCTCTGAGGTCC
GCTGCCAGCGAGTTTGTCGTGTTGGTGTGTCGCAAGCGTGACGTATTGTCGAGCGCGGTG
GGTGTGGCGCGCGCCGTGCCGCTGTACTCGGCCAGCTCCGGGCCTGCGCCCCTCGCCCAC
GGGAACCATCACGACGCGGCCTGCGCCACACCGCGCACGCTCACCGTGGAGATACAGCTC
GTCCAAGATGACGGTGTGGAGGACGACGAGGACTTGGACCCCGCGGAGCCAATCTTGAGG
GACGGCGTTCTGTCCTCGGAGGACCTCAAGACCATACAGGACGTCGCGGACGCGACCCGC
CTCGCCGCCCGGATCACCGACACACCCGCCAACATCATGGACGTGGACGCGTTCATACAG
GAAGCTATAAACCTCGCCAAGGAGCTGGAGATCCCCCCGCCCACGATCATCCGCGGCGAA
GAGTTGAAGGCGCGTGGTATGGGCGGCTTGTACGGCGTGGGCAAGGCGGCCGCTCGTCCG
CCCGCCCTCGTCGCGCTGTCCTACCGCCCGCCCTCCGCCAGCCAGACGGTCGCGTGGGTC
GGGAAGGGCATCGTCTACGACACCGGCGGGCTCAGTCTCAAGGCTCCCAAGTCGATGTGC
GGTATGAAGTATGACTGCGGCGGCGCGGCAGCCGTGCTGGGCGCCTTCAGCGCGGTCGTC
AGGGCTCGGCCGTCGGTGGCGCTCCACGCCGTGCTCTGCCTGGCCGAGAACGCGATCGGT
CCGCTCGCCACCAGGCCGGACGACATCCACCAGCTGTACTCGGGCCGCACGGTGGAGATC
AACAACACGGACGCCGAGGGCCGGCTGGTGCTGGCGGACGGCGTGGTGTTCGCGCAGAGA
GACCTCAAGGCCGACACTATCGTGGACGTCGCCACGCTGACGGGAGCCCAGGGCATAGCG
ACGGGCAAGTACCACGCGGCCGTCGTGTCCAACTGCGGCTCCCTGGAGGCGAGCTGCGTC
CGCGCGGGTCGCATCAGCGGCGACCTCACCCACCCACTGCCCTTCGCACCCGAACTGCAC
TTCTACGAGTTCAGCAGCGCCGTCGCCGACATGAAGAACAGCGTCGCCGACCGGGAGAAC
GCGCAGTCTTCGTGCGCCGGACTGTTCGTCCTATCGCACCTCGGTTTCGACTTCCCCGGC
CGCTGGCTGCACGTGGACATGGCCGCTCCCTCCAGATGTGGTGACCGGGCGACCGGGTAC
GGCGTGGCGCTGCTCGCGGTGCTGTTCGGAGGCTCCACGGACAGCCGGCTGCTGCGGGCG
CTGGCTCCCCACAAGTGA

Protein sequence:

MLKLWLGLALSPDSSALFGDRNSYGMSLKRPSELCKHLRVSKRYILGKSHDDVIPSLPKD
KDVPELESQLQLHKQFMIGAQSKRVGLGSNQSNLVRWGKDTDKTCYIYGKAVGTAKHLLV
GCKVLLDSSQYSRRHDKVLEIIREARGAERLLYVRRMSNSVNIKFKWGLSTSDPEQKPVL
FVGQTAHIAALSWQDVRCKLEPRVTEEVWRRAVSVMEGGEVCEVWPRGVALGALPPRRSR
HAAPARSHALSKLVRTSLRSAASEFVVLVCRKRDVLSSAVGVARAVPLYSASSGPAPLAH
GNHHDAACATPRTLTVEIQLVQDDGVEDDEDLDPAEPILRDGVLSSEDLKTIQDVADATR
LAARITDTPANIMDVDAFIQEAINLAKELEIPPPTIIRGEELKARGMGGLYGVGKAAARP
PALVALSYRPPSASQTVAWVGKGIVYDTGGLSLKAPKSMCGMKYDCGGAAAVLGAFSAVV
RARPSVALHAVLCLAENAIGPLATRPDDIHQLYSGRTVEINNTDAEGRLVLADGVVFAQR
DLKADTIVDVATLTGAQGIATGKYHAAVVSNCGSLEASCVRAGRISGDLTHPLPFAPELH
FYEFSSAVADMKNSVADRENAQSSCAGLFVLSHLGFDFPGRWLHVDMAAPSRCGDRATGY
GVALLAVLFGGSTDSRLLRALAPHK