DPGLEAN01652 in OGS1.0

New model in OGS2.0DPOGS200548 
Genomic Positionscaffold3651:+ 3345-23081
See gene structure
CDS Length2898
Paired RNAseq reads  3993
Single RNAseq reads  9790
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010763 (0.0)
Best Drosophila hit  SP1029, isoform C (0.0)
Best Human hitaminopeptidase N precursor (1e-134)
Best NR hit (blastp)  aminopeptidase N-like protein [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to protease m1 zinc metalloprotease [Tribolium castaneum] (0.0)
GeneOntology terms



  
GO:0004046 aminoacylase activity
GO:0004177 aminopeptidase activity
GO:0008237 metallopeptidase activity
GO:0006508 proteolysis
GO:0008270 zinc ion binding
InterPro families
  
IPR001930 Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolase
IPR014782 Peptidase M1, membrane alanine aminopeptidase, N-terminal
Orthology groupMCL10093

Nucleotide sequence:

ATGATCTTTTTCCTCTGCACATCACAGATTCAACAAGAGCCACTATTTCTTGTCGCCAGT
CGGCATTTAGGTGTCGAAAGAGAACCTCGCGCCGACACAAGACACAACATGGAGTGCTTG
AAGGTGCTGTTCCTCCTGTCCTCCGTCCAGTTGAGCCGGCAGTACTTGCTGCCAGATCAC
ATCGCTCCCTCACACTACCAACTCAGACTCCTGTACGACATCGACCCCAGCACCAACTTC
AGCTTCTTCGGCGTCGCTGATATTCAGCTAACAGTAAAAAAGAGCACTTCGAAGATAATT
CTCCATGCGCAAGATTATATGATATCAGATGACAAAGTGAGTGTCGTTGGACAAAAAGAG
GTTCCCAAAGTGACGGGAGTAAAACTGAATGATACGTACAACTTCTTAGAAATATCACTT
GATAAGGATTTAGAGGAAAATGGGAAGTACAAACTCACGATACCCTTCTACGGCAACCTG
GTCAAAGGTTTGGACGGAGCCTACATAAGCTCCTACACGAACAGACAGACTCAGAAGACA
GAGTATTTAATTTCCACTCAGTTTGAGGCGATATCAGCTCGCAAGGGTTTCCCGTGTTTC
GACGAACCCATGTACAAAGCCACCTACTCTATCATCATCGGTCACAGCAAGGAGTACACG
GCCGTCTCCAACATGCCACTAGCGGCGTCCGCCTCTGAAAATGCCCTAGAAGATTACTGG
CCCTGGGACGTAGTCGGAAAGAGGTTTAGGAAGGAGAGATCTTCATTTGTCTGGGATCAG
TTCGCCAAGTCTGTGCCTATGTCTACATATCTGGTCGCGTTCGTGGTGTCCAAGTTCTCG
CACGTGGTCAGCCCTCCGGAACTATCGAAGACACAGTTCAGGATATGGGCCAGAGGAGAC
GCCATCGATCAGACATCCTACGCGGCTAAGATCGGTCCTCAAGTGTTGTCCTACTTTGAG
AAGTGGTTCAACGTGTCGTTTCCTCTGCCGAAGCAGGACATGATGGCCATACCAGACTTC
TCAGCGGGGGCTATGGAGAACTGGGGCCTCATCACGTACAGAGAGACGGCACTCCTGTAC
AGCGATAAGGAATCGTCGTTCTTGAACAAGGAGAGGATAGCTGAGGTGGTAGCTCATGAG
CTGGCCCATCAGTGGTTCGGTAACCTGGTGACCATGAAGTGGTGGTCGGACCTGTGGCTG
AACGAGGGGTTCGCGACCTTCGTGTCTAGTGTGGGCGTGTCGGCCGTGGAGCCGACCTGG
CGAGCTGATCGGTCCTACGCCGTGGAGAACACGCTCTCCGTGTTGAGTTTAGACGCCTTG
GAGTCATCTCATCCCGTGTCAGCGCCTCTCGATGATCCGAAGCGCATCTCGGAGATCTTC
GACGCGATCTCTTACAGGAAGGGCTCCACTCTCATCCGCATGATGCTGATGTTCCTCGGA
GAAGGTGTCTTCAGGCAGGCGCTGCACAACTACCTGATGAAGTATTCGTATTCAAACGCC
GAGCAGGATGATCTCTGGGCGGAGCTGACGGCAGCCAGCCTGAGGAGTGGAAGCCTTACG
AGGAACATCACCGTTAAAGAGGTGATGGACACCTGGACCACACAGACGGGATACCCGATC
CTCACCGTCACCAGGGACTACTCCGACAAGTCGCTTACAATCTCACAGAAGCGTTACCTG
TCTCTGGGCGTCGGTCGGACCTCCCAAGCGTGGTGGGTCCCTCTAAGCGTTCTCTGTGAG
AAAGACAGAAAAAGCGAGAGCGAGAGCGTCCAGTGGTTAGGAGATACGGAGGGAGTGACG
AACGAACATAGATACGAACACGGCTCTGGAGCGAGCGAGTGGGTTCTGTTCAACTACAAC
ATGATCGCTCCATACAGAGTCAACTACGATCAGAGAAATTGGAAGCTTCTCATACAGACT
CTGACGAGTGACCAGTACACCCTCATCCCGGTCGAAGGTCGAGTGCAGTTGCTGTCCGAC
GCTTTTGAGCTGGCGTGGAACAATCAGCTCGACTATGGAATGACTTTACAGTTGGCGAGC
TACCTGAAGAGGGAGACGGAATACTTGCCTCTCTACACGGGGCTGTCGGCTTTAGCTAAG
ATTGAGAACGTACTGAAACGAAGTTCCGAGTACGGAGCCTTCCAGAAGTTTATCAGAAGA
CTCCTCAACAACGTCTACCAGAAAGGAGGTTTGGCTCTGAAGAGGATCGTCGACGGCGAC
GACTTGAACAGCGTCAAGCTTCAGACGACTGTGAGCTCTTGGGCCTGCAGCGTGAAGATC
CCCGGCTGTGAGGAGAACGCTATAGACATGTTCAACGACTGGATGAGGACGGACAGACCC
GACGAAAACAATCCGATTCCCGTGGACCTCCGCCGCACTGTATATTGTTCGGCTATCCGT
CGTGGCGGGGTGTCGTTGTGGCGCTGGTCCCTCGCCCGCCGCCGGGCCTCCAACGTGGCG
ACTTCCCGGGACGCCCTGCAGCACGCCCTGGCCTGCAGCAGAGACGTCTGGGTTCTGGCG
CAGTACTTGGAGTGGACGGTGTCTGACGGCAGCGAGGTGCGTCGTCAGGATGCCGGCAAC
GTCATCGCAGCCGTCACCCGGTCTGCCACCGGATACTATGTGGCTAAGGACTTCATATAC
GGACGAATCCAGGAAATTAGCAAAGCGTTCAACGGCCAGGACAGGAGAATGGGCGGCATC
ATAAAGACCCTGTTGGGGCAGTTCACGACCAAGAAGGAACTCGATGAGTTCTTGGAGTGG
AAGAAGCTGAACGAAAAATATTTGTCGGCTTCAAAGATAGCGGTCGCTCAGGGGATAGAG
AACGCTAGAGTGAACATAGAGTGGATCCAGAGAAACAAACGTACCGTAGTGGATAAGATG
AGGGAGTACTCCATGTGA

Protein sequence:

MIFFLCTSQIQQEPLFLVASRHLGVEREPRADTRHNMECLKVLFLLSSVQLSRQYLLPDH
IAPSHYQLRLLYDIDPSTNFSFFGVADIQLTVKKSTSKIILHAQDYMISDDKVSVVGQKE
VPKVTGVKLNDTYNFLEISLDKDLEENGKYKLTIPFYGNLVKGLDGAYISSYTNRQTQKT
EYLISTQFEAISARKGFPCFDEPMYKATYSIIIGHSKEYTAVSNMPLAASASENALEDYW
PWDVVGKRFRKERSSFVWDQFAKSVPMSTYLVAFVVSKFSHVVSPPELSKTQFRIWARGD
AIDQTSYAAKIGPQVLSYFEKWFNVSFPLPKQDMMAIPDFSAGAMENWGLITYRETALLY
SDKESSFLNKERIAEVVAHELAHQWFGNLVTMKWWSDLWLNEGFATFVSSVGVSAVEPTW
RADRSYAVENTLSVLSLDALESSHPVSAPLDDPKRISEIFDAISYRKGSTLIRMMLMFLG
EGVFRQALHNYLMKYSYSNAEQDDLWAELTAASLRSGSLTRNITVKEVMDTWTTQTGYPI
LTVTRDYSDKSLTISQKRYLSLGVGRTSQAWWVPLSVLCEKDRKSESESVQWLGDTEGVT
NEHRYEHGSGASEWVLFNYNMIAPYRVNYDQRNWKLLIQTLTSDQYTLIPVEGRVQLLSD
AFELAWNNQLDYGMTLQLASYLKRETEYLPLYTGLSALAKIENVLKRSSEYGAFQKFIRR
LLNNVYQKGGLALKRIVDGDDLNSVKLQTTVSSWACSVKIPGCEENAIDMFNDWMRTDRP
DENNPIPVDLRRTVYCSAIRRGGVSLWRWSLARRRASNVATSRDALQHALACSRDVWVLA
QYLEWTVSDGSEVRRQDAGNVIAAVTRSATGYYVAKDFIYGRIQEISKAFNGQDRRMGGI
IKTLLGQFTTKKELDEFLEWKKLNEKYLSASKIAVAQGIENARVNIEWIQRNKRTVVDKM
REYSM