DPGLEAN03343 in OGS1.0

New model in OGS2.0DPOGS209137 
Genomic Positionscaffold2106:+ 19533-27490
See gene structure
CDS Length1974
Paired RNAseq reads  1912
Single RNAseq reads  4964
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001310 (2e-23)
Best Drosophila hit  CG13340 (1e-75)
Best Human hitcytosol aminopeptidase (8e-59)
Best NR hit (blastp)  Cytosol aminopeptidase, putative [Pediculus humanus corporis] (2e-114)
Best NR hit (blastx)  Cytosol aminopeptidase, putative [Pediculus humanus corporis] (2e-87)
GeneOntology terms



  
GO:0004177 aminopeptidase activity
GO:0008235 metalloexopeptidase activity
GO:0030145 manganese ion binding
GO:0006508 proteolysis
GO:0005737 cytoplasm
InterPro families

  
IPR011356 Peptidase M17
IPR000819 Peptidase M17, leucyl aminopeptidase, C-terminal
IPR008283 Peptidase M17, leucyl aminopeptidase, N-terminal
Orthology groupMCL10468

Nucleotide sequence:

ATGGCTTTTACAAAATTATTTTCGAGTGTTAATTTATGGAGAATTAGTTATAGCACAATA
TCAAAGAGATATTGTAGTGCTGTCTCCGAAGATTCACCCACGTGTGGGAAGAAATCTGAT
GAAGCTAGGCAGAGCAGTAATGACCAACCAGAGAACAAAAAGGGTTTGGTCCTTGGCGTA
TATGAAGAGGGGGAAAAGTTTGAATTGACACCAGTCGCTGAGGAAATAAACCAGAAGAGT
GGCGGCAAGATATGCAAGCATCTAAACGAAATGTCATGTCACCTGAAACACGGCAAAGCA
TTCGTGGTGACGGATATTTTGGAGGAGTTTGGACCGGTGGCCATAGCGTCTCTCGGCAAG
AAGAATCCAGGATACAATGAGCTGGAGATGTTGGATGAGACCAGGAGATATTGTAGTGCT
GTCTCCGAAGATTCACCCACGTGTGGGAAGAAATCTGATGAAGCTAGGCAGAGCGGTCGC
AGTAATGACCAGCCAGAGAACAAAAAGGGTTTGGTCCTTGGCGTATATGAAGAGGGGGAA
AAGTTTGAATTGACACCAGTCGCTGAGGAAATAAACCAGAAGAGTGGCGGCAAGATATGC
AAGCATCTAAACGAAATGTCATGTCACCTGAAACACGGCAAAGCATTCGTGGTGACGGAT
ATTTTGGAGGAGTTTGGACCGGTGGCCATAGCGTCTCTCGGCAAGAAAAATCCAGGGTAC
AATGAGCTGGAGATGTTGGATGAGACCAGGGAAAATCTCCGCGTGGGTGTGGGTGTGGGG
GTGCGTGAGTTGGTGAAGAGAGGTTGTGATCATGTGTACGTGGACGGAGGAACAGAGCCT
GACGCCGCCGCCGAGGCCGCCCATCTAGCAGCTTGGAGGTTCGAGGAGTTCAAATCGTCT
GGGGCGAAGTCCTTCCAGACAGATGTATTCCTCCAGGGGTCGGGTGAGGAGCTGTGGAAA
CGCGGCACGATTTTCGGTTCTGGACAAAACTGGGCCAGACACCTCACCGACATGCCTCCC
AATAAGATGACGCCCGTTGACTTCGCACAGGCGGTGTTAGACATGTTATGTCCCCTGGGC
GTTCACGTGACGGCCCACGACTCGGCGTGGATCGAAGCTCAGCGGATGGAGGCGCTCCTG
TCGGTTTCCCGTGGTTCCTGTGAGCCGGCCGTGTTTCTGGAGTGCGAGTACCGAGCGGGC
GGGGACCGGCCGCCCGTCCTGCTAGCGGCCAAGGGAATCACATTCGACAGTGGCGGTTTA
TGTCTGAAGAAGGCTGATGAAATGCGAGAGAACCCGGACAGCCGCGCGGGGGCCGCCGCC
ACAGTCGGCGCTCTCAAGATACTCGCGGAGATGAAGGTGCCCATTAACGTGGTGGCAGTG
ATACCGCTGTGCGAGAGTATGGTGAGCGGCAGCTGTATGAAGGTCGGGGACGTCTTGAGA
GCACTCAACGGACTCACCATGCAGGTGGAGTGCACAGCCCAAGCAGGCCGCCTCACTCTG
GCAGACGCACTGGTCTACGGACAGGCCAAGCATAGACCCTCGCTAGTCGTAGACCTGGCG
TCACTAACAAGAGGAGTGCAGCTAGCTACGGGCAGCGCGGCTTTCGGCGTGTTCAGTTCC
AGCGGCGAGGCGTGGGCGGCGCTCGCACAGTCCGCGGCACGAGCTGGGGACAGAGGCTGG
AGGCTGCCTCTCTGGAGCTATTACCGCGCTATGATCGATGATGACCCCTCTGTGGATCTG
AGGAACAGGGGTCCAGGAACGGCTGCACCATGCGTGGGAGCCGCGTTTCTCAAGAACTTC
GTGTGTGCACCGTGGCTTCACCTGGACGTGTCGGGCGTGTCCCGGGGCGGCACTCCCTAC
CTGCCCGCGCCCCGGGCCGCCGGTCGGCCTGCGAGGACACTCGCAGAATTCCTCACCGCC
GCCGGCACAGCAAGTGCAAATGTCAAGGACTCCGACTCACCAGCTACATCTTAA

Protein sequence:

MAFTKLFSSVNLWRISYSTISKRYCSAVSEDSPTCGKKSDEARQSSNDQPENKKGLVLGV
YEEGEKFELTPVAEEINQKSGGKICKHLNEMSCHLKHGKAFVVTDILEEFGPVAIASLGK
KNPGYNELEMLDETRRYCSAVSEDSPTCGKKSDEARQSGRSNDQPENKKGLVLGVYEEGE
KFELTPVAEEINQKSGGKICKHLNEMSCHLKHGKAFVVTDILEEFGPVAIASLGKKNPGY
NELEMLDETRENLRVGVGVGVRELVKRGCDHVYVDGGTEPDAAAEAAHLAAWRFEEFKSS
GAKSFQTDVFLQGSGEELWKRGTIFGSGQNWARHLTDMPPNKMTPVDFAQAVLDMLCPLG
VHVTAHDSAWIEAQRMEALLSVSRGSCEPAVFLECEYRAGGDRPPVLLAAKGITFDSGGL
CLKKADEMRENPDSRAGAAATVGALKILAEMKVPINVVAVIPLCESMVSGSCMKVGDVLR
ALNGLTMQVECTAQAGRLTLADALVYGQAKHRPSLVVDLASLTRGVQLATGSAAFGVFSS
SGEAWAALAQSAARAGDRGWRLPLWSYYRAMIDDDPSVDLRNRGPGTAAPCVGAAFLKNF
VCAPWLHLDVSGVSRGGTPYLPAPRAAGRPARTLAEFLTAAGTASANVKDSDSPATS