New model in OGS2.0 | DPOGS209137  |
---|---|
Genomic Position | scaffold2106:+ 19533-27490 |
See gene structure | |
CDS Length | 1974 |
Paired RNAseq reads   | 1912 |
Single RNAseq reads   | 4964 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001310 (2e-23) |
Best Drosophila hit   | CG13340 (1e-75) |
Best Human hit | cytosol aminopeptidase (8e-59) |
Best NR hit (blastp)   | Cytosol aminopeptidase, putative [Pediculus humanus corporis] (2e-114) |
Best NR hit (blastx)   | Cytosol aminopeptidase, putative [Pediculus humanus corporis] (2e-87) |
GeneOntology terms    | GO:0004177 aminopeptidase activity GO:0008235 metalloexopeptidase activity GO:0030145 manganese ion binding GO:0006508 proteolysis GO:0005737 cytoplasm |
InterPro families    | IPR011356 Peptidase M17 IPR000819 Peptidase M17, leucyl aminopeptidase, C-terminal IPR008283 Peptidase M17, leucyl aminopeptidase, N-terminal |
Orthology group | MCL10468 |
Nucleotide sequence:
ATGGCTTTTACAAAATTATTTTCGAGTGTTAATTTATGGAGAATTAGTTATAGCACAATA
TCAAAGAGATATTGTAGTGCTGTCTCCGAAGATTCACCCACGTGTGGGAAGAAATCTGAT
GAAGCTAGGCAGAGCAGTAATGACCAACCAGAGAACAAAAAGGGTTTGGTCCTTGGCGTA
TATGAAGAGGGGGAAAAGTTTGAATTGACACCAGTCGCTGAGGAAATAAACCAGAAGAGT
GGCGGCAAGATATGCAAGCATCTAAACGAAATGTCATGTCACCTGAAACACGGCAAAGCA
TTCGTGGTGACGGATATTTTGGAGGAGTTTGGACCGGTGGCCATAGCGTCTCTCGGCAAG
AAGAATCCAGGATACAATGAGCTGGAGATGTTGGATGAGACCAGGAGATATTGTAGTGCT
GTCTCCGAAGATTCACCCACGTGTGGGAAGAAATCTGATGAAGCTAGGCAGAGCGGTCGC
AGTAATGACCAGCCAGAGAACAAAAAGGGTTTGGTCCTTGGCGTATATGAAGAGGGGGAA
AAGTTTGAATTGACACCAGTCGCTGAGGAAATAAACCAGAAGAGTGGCGGCAAGATATGC
AAGCATCTAAACGAAATGTCATGTCACCTGAAACACGGCAAAGCATTCGTGGTGACGGAT
ATTTTGGAGGAGTTTGGACCGGTGGCCATAGCGTCTCTCGGCAAGAAAAATCCAGGGTAC
AATGAGCTGGAGATGTTGGATGAGACCAGGGAAAATCTCCGCGTGGGTGTGGGTGTGGGG
GTGCGTGAGTTGGTGAAGAGAGGTTGTGATCATGTGTACGTGGACGGAGGAACAGAGCCT
GACGCCGCCGCCGAGGCCGCCCATCTAGCAGCTTGGAGGTTCGAGGAGTTCAAATCGTCT
GGGGCGAAGTCCTTCCAGACAGATGTATTCCTCCAGGGGTCGGGTGAGGAGCTGTGGAAA
CGCGGCACGATTTTCGGTTCTGGACAAAACTGGGCCAGACACCTCACCGACATGCCTCCC
AATAAGATGACGCCCGTTGACTTCGCACAGGCGGTGTTAGACATGTTATGTCCCCTGGGC
GTTCACGTGACGGCCCACGACTCGGCGTGGATCGAAGCTCAGCGGATGGAGGCGCTCCTG
TCGGTTTCCCGTGGTTCCTGTGAGCCGGCCGTGTTTCTGGAGTGCGAGTACCGAGCGGGC
GGGGACCGGCCGCCCGTCCTGCTAGCGGCCAAGGGAATCACATTCGACAGTGGCGGTTTA
TGTCTGAAGAAGGCTGATGAAATGCGAGAGAACCCGGACAGCCGCGCGGGGGCCGCCGCC
ACAGTCGGCGCTCTCAAGATACTCGCGGAGATGAAGGTGCCCATTAACGTGGTGGCAGTG
ATACCGCTGTGCGAGAGTATGGTGAGCGGCAGCTGTATGAAGGTCGGGGACGTCTTGAGA
GCACTCAACGGACTCACCATGCAGGTGGAGTGCACAGCCCAAGCAGGCCGCCTCACTCTG
GCAGACGCACTGGTCTACGGACAGGCCAAGCATAGACCCTCGCTAGTCGTAGACCTGGCG
TCACTAACAAGAGGAGTGCAGCTAGCTACGGGCAGCGCGGCTTTCGGCGTGTTCAGTTCC
AGCGGCGAGGCGTGGGCGGCGCTCGCACAGTCCGCGGCACGAGCTGGGGACAGAGGCTGG
AGGCTGCCTCTCTGGAGCTATTACCGCGCTATGATCGATGATGACCCCTCTGTGGATCTG
AGGAACAGGGGTCCAGGAACGGCTGCACCATGCGTGGGAGCCGCGTTTCTCAAGAACTTC
GTGTGTGCACCGTGGCTTCACCTGGACGTGTCGGGCGTGTCCCGGGGCGGCACTCCCTAC
CTGCCCGCGCCCCGGGCCGCCGGTCGGCCTGCGAGGACACTCGCAGAATTCCTCACCGCC
GCCGGCACAGCAAGTGCAAATGTCAAGGACTCCGACTCACCAGCTACATCTTAA
Protein sequence:
MAFTKLFSSVNLWRISYSTISKRYCSAVSEDSPTCGKKSDEARQSSNDQPENKKGLVLGV
YEEGEKFELTPVAEEINQKSGGKICKHLNEMSCHLKHGKAFVVTDILEEFGPVAIASLGK
KNPGYNELEMLDETRRYCSAVSEDSPTCGKKSDEARQSGRSNDQPENKKGLVLGVYEEGE
KFELTPVAEEINQKSGGKICKHLNEMSCHLKHGKAFVVTDILEEFGPVAIASLGKKNPGY
NELEMLDETRENLRVGVGVGVRELVKRGCDHVYVDGGTEPDAAAEAAHLAAWRFEEFKSS
GAKSFQTDVFLQGSGEELWKRGTIFGSGQNWARHLTDMPPNKMTPVDFAQAVLDMLCPLG
VHVTAHDSAWIEAQRMEALLSVSRGSCEPAVFLECEYRAGGDRPPVLLAAKGITFDSGGL
CLKKADEMRENPDSRAGAAATVGALKILAEMKVPINVVAVIPLCESMVSGSCMKVGDVLR
ALNGLTMQVECTAQAGRLTLADALVYGQAKHRPSLVVDLASLTRGVQLATGSAAFGVFSS
SGEAWAALAQSAARAGDRGWRLPLWSYYRAMIDDDPSVDLRNRGPGTAAPCVGAAFLKNF
VCAPWLHLDVSGVSRGGTPYLPAPRAAGRPARTLAEFLTAAGTASANVKDSDSPATS