New model in OGS2.0 | DPOGS200548  |
---|---|
Genomic Position | scaffold3651:+ 3345-23081 |
See gene structure | |
CDS Length | 2898 |
Paired RNAseq reads   | 3993 |
Single RNAseq reads   | 9790 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010763 (0.0) |
Best Drosophila hit   | SP1029, isoform C (0.0) |
Best Human hit | aminopeptidase N precursor (1e-134) |
Best NR hit (blastp)   | aminopeptidase N-like protein [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to protease m1 zinc metalloprotease [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0004046 aminoacylase activity GO:0004177 aminopeptidase activity GO:0008237 metallopeptidase activity GO:0006508 proteolysis GO:0008270 zinc ion binding |
InterPro families    | IPR001930 Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolase IPR014782 Peptidase M1, membrane alanine aminopeptidase, N-terminal |
Orthology group | MCL10093 |
Nucleotide sequence:
ATGATCTTTTTCCTCTGCACATCACAGATTCAACAAGAGCCACTATTTCTTGTCGCCAGT
CGGCATTTAGGTGTCGAAAGAGAACCTCGCGCCGACACAAGACACAACATGGAGTGCTTG
AAGGTGCTGTTCCTCCTGTCCTCCGTCCAGTTGAGCCGGCAGTACTTGCTGCCAGATCAC
ATCGCTCCCTCACACTACCAACTCAGACTCCTGTACGACATCGACCCCAGCACCAACTTC
AGCTTCTTCGGCGTCGCTGATATTCAGCTAACAGTAAAAAAGAGCACTTCGAAGATAATT
CTCCATGCGCAAGATTATATGATATCAGATGACAAAGTGAGTGTCGTTGGACAAAAAGAG
GTTCCCAAAGTGACGGGAGTAAAACTGAATGATACGTACAACTTCTTAGAAATATCACTT
GATAAGGATTTAGAGGAAAATGGGAAGTACAAACTCACGATACCCTTCTACGGCAACCTG
GTCAAAGGTTTGGACGGAGCCTACATAAGCTCCTACACGAACAGACAGACTCAGAAGACA
GAGTATTTAATTTCCACTCAGTTTGAGGCGATATCAGCTCGCAAGGGTTTCCCGTGTTTC
GACGAACCCATGTACAAAGCCACCTACTCTATCATCATCGGTCACAGCAAGGAGTACACG
GCCGTCTCCAACATGCCACTAGCGGCGTCCGCCTCTGAAAATGCCCTAGAAGATTACTGG
CCCTGGGACGTAGTCGGAAAGAGGTTTAGGAAGGAGAGATCTTCATTTGTCTGGGATCAG
TTCGCCAAGTCTGTGCCTATGTCTACATATCTGGTCGCGTTCGTGGTGTCCAAGTTCTCG
CACGTGGTCAGCCCTCCGGAACTATCGAAGACACAGTTCAGGATATGGGCCAGAGGAGAC
GCCATCGATCAGACATCCTACGCGGCTAAGATCGGTCCTCAAGTGTTGTCCTACTTTGAG
AAGTGGTTCAACGTGTCGTTTCCTCTGCCGAAGCAGGACATGATGGCCATACCAGACTTC
TCAGCGGGGGCTATGGAGAACTGGGGCCTCATCACGTACAGAGAGACGGCACTCCTGTAC
AGCGATAAGGAATCGTCGTTCTTGAACAAGGAGAGGATAGCTGAGGTGGTAGCTCATGAG
CTGGCCCATCAGTGGTTCGGTAACCTGGTGACCATGAAGTGGTGGTCGGACCTGTGGCTG
AACGAGGGGTTCGCGACCTTCGTGTCTAGTGTGGGCGTGTCGGCCGTGGAGCCGACCTGG
CGAGCTGATCGGTCCTACGCCGTGGAGAACACGCTCTCCGTGTTGAGTTTAGACGCCTTG
GAGTCATCTCATCCCGTGTCAGCGCCTCTCGATGATCCGAAGCGCATCTCGGAGATCTTC
GACGCGATCTCTTACAGGAAGGGCTCCACTCTCATCCGCATGATGCTGATGTTCCTCGGA
GAAGGTGTCTTCAGGCAGGCGCTGCACAACTACCTGATGAAGTATTCGTATTCAAACGCC
GAGCAGGATGATCTCTGGGCGGAGCTGACGGCAGCCAGCCTGAGGAGTGGAAGCCTTACG
AGGAACATCACCGTTAAAGAGGTGATGGACACCTGGACCACACAGACGGGATACCCGATC
CTCACCGTCACCAGGGACTACTCCGACAAGTCGCTTACAATCTCACAGAAGCGTTACCTG
TCTCTGGGCGTCGGTCGGACCTCCCAAGCGTGGTGGGTCCCTCTAAGCGTTCTCTGTGAG
AAAGACAGAAAAAGCGAGAGCGAGAGCGTCCAGTGGTTAGGAGATACGGAGGGAGTGACG
AACGAACATAGATACGAACACGGCTCTGGAGCGAGCGAGTGGGTTCTGTTCAACTACAAC
ATGATCGCTCCATACAGAGTCAACTACGATCAGAGAAATTGGAAGCTTCTCATACAGACT
CTGACGAGTGACCAGTACACCCTCATCCCGGTCGAAGGTCGAGTGCAGTTGCTGTCCGAC
GCTTTTGAGCTGGCGTGGAACAATCAGCTCGACTATGGAATGACTTTACAGTTGGCGAGC
TACCTGAAGAGGGAGACGGAATACTTGCCTCTCTACACGGGGCTGTCGGCTTTAGCTAAG
ATTGAGAACGTACTGAAACGAAGTTCCGAGTACGGAGCCTTCCAGAAGTTTATCAGAAGA
CTCCTCAACAACGTCTACCAGAAAGGAGGTTTGGCTCTGAAGAGGATCGTCGACGGCGAC
GACTTGAACAGCGTCAAGCTTCAGACGACTGTGAGCTCTTGGGCCTGCAGCGTGAAGATC
CCCGGCTGTGAGGAGAACGCTATAGACATGTTCAACGACTGGATGAGGACGGACAGACCC
GACGAAAACAATCCGATTCCCGTGGACCTCCGCCGCACTGTATATTGTTCGGCTATCCGT
CGTGGCGGGGTGTCGTTGTGGCGCTGGTCCCTCGCCCGCCGCCGGGCCTCCAACGTGGCG
ACTTCCCGGGACGCCCTGCAGCACGCCCTGGCCTGCAGCAGAGACGTCTGGGTTCTGGCG
CAGTACTTGGAGTGGACGGTGTCTGACGGCAGCGAGGTGCGTCGTCAGGATGCCGGCAAC
GTCATCGCAGCCGTCACCCGGTCTGCCACCGGATACTATGTGGCTAAGGACTTCATATAC
GGACGAATCCAGGAAATTAGCAAAGCGTTCAACGGCCAGGACAGGAGAATGGGCGGCATC
ATAAAGACCCTGTTGGGGCAGTTCACGACCAAGAAGGAACTCGATGAGTTCTTGGAGTGG
AAGAAGCTGAACGAAAAATATTTGTCGGCTTCAAAGATAGCGGTCGCTCAGGGGATAGAG
AACGCTAGAGTGAACATAGAGTGGATCCAGAGAAACAAACGTACCGTAGTGGATAAGATG
AGGGAGTACTCCATGTGA
Protein sequence:
MIFFLCTSQIQQEPLFLVASRHLGVEREPRADTRHNMECLKVLFLLSSVQLSRQYLLPDH
IAPSHYQLRLLYDIDPSTNFSFFGVADIQLTVKKSTSKIILHAQDYMISDDKVSVVGQKE
VPKVTGVKLNDTYNFLEISLDKDLEENGKYKLTIPFYGNLVKGLDGAYISSYTNRQTQKT
EYLISTQFEAISARKGFPCFDEPMYKATYSIIIGHSKEYTAVSNMPLAASASENALEDYW
PWDVVGKRFRKERSSFVWDQFAKSVPMSTYLVAFVVSKFSHVVSPPELSKTQFRIWARGD
AIDQTSYAAKIGPQVLSYFEKWFNVSFPLPKQDMMAIPDFSAGAMENWGLITYRETALLY
SDKESSFLNKERIAEVVAHELAHQWFGNLVTMKWWSDLWLNEGFATFVSSVGVSAVEPTW
RADRSYAVENTLSVLSLDALESSHPVSAPLDDPKRISEIFDAISYRKGSTLIRMMLMFLG
EGVFRQALHNYLMKYSYSNAEQDDLWAELTAASLRSGSLTRNITVKEVMDTWTTQTGYPI
LTVTRDYSDKSLTISQKRYLSLGVGRTSQAWWVPLSVLCEKDRKSESESVQWLGDTEGVT
NEHRYEHGSGASEWVLFNYNMIAPYRVNYDQRNWKLLIQTLTSDQYTLIPVEGRVQLLSD
AFELAWNNQLDYGMTLQLASYLKRETEYLPLYTGLSALAKIENVLKRSSEYGAFQKFIRR
LLNNVYQKGGLALKRIVDGDDLNSVKLQTTVSSWACSVKIPGCEENAIDMFNDWMRTDRP
DENNPIPVDLRRTVYCSAIRRGGVSLWRWSLARRRASNVATSRDALQHALACSRDVWVLA
QYLEWTVSDGSEVRRQDAGNVIAAVTRSATGYYVAKDFIYGRIQEISKAFNGQDRRMGGI
IKTLLGQFTTKKELDEFLEWKKLNEKYLSASKIAVAQGIENARVNIEWIQRNKRTVVDKM
REYSM