DPGLEAN01246 in OGS1.0

New model in OGS2.0DPOGS209841 
Genomic Positionscaffold6461:+ 3653-9458
See gene structure
CDS Length1872
Paired RNAseq reads  1452
Single RNAseq reads  3518
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008066 (0.0)
Best Drosophila hit  CG14516, isoform B (4e-75)
Best Human hitaminopeptidase N precursor (6e-56)
Best NR hit (blastp)  PREDICTED: similar to protease m1 zinc metalloprotease [Nasonia vitripennis] (1e-88)
Best NR hit (blastx)  PREDICTED: similar to protease m1 zinc metalloprotease [Nasonia vitripennis] (1e-91)
GeneOntology terms


  
GO:0004177 aminopeptidase activity
GO:0008270 zinc ion binding
GO:0008237 metallopeptidase activity
GO:0006508 proteolysis
InterPro families
  
IPR014782 Peptidase M1, membrane alanine aminopeptidase, N-terminal
IPR001930 Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolase
Orthology groupMCL10093

Nucleotide sequence:

ATCACTTTCAGAGAGACAACCCTATTATATGATGAAGTTGAGGGTATTCCCCGTGAGAAA
CAAAACGTAGCTATCGACGTAGCTCATGAGTTAGCTCATCAATGGTTTGGCGATCTCGTC
ACTATGAAGTGGTGGACGGACCTGTGGCTCAACGAGGGTTTCGCCACATACATAGAGTAT
GTCGGCGTTGACCATATTGCACCTGAATGGGACATGTTTGAATCATTCACAAGGGACAAA
ATGAATCTTCTTAGAACAGATGCTCTGAAGAACACAGCTCCTGTATCGCAAAAGGTCATC
GACGCCTCAGAAATATCCCAGAAGTTTGATGAGATATCATATTCAAAGGGCGCCAACTTA
ATAAGAATGTTGAACCACACTATATCCAAGGAATTGTTCCACAAGGGATTGCTTATATAT
TTGAACCTTTGGAAATACCGGAACGCCGAGGAGAACGATCTATGGCAGGCGATGTCTCTA
GCTACTAAGGAGTCCCCGCGTCTGAAGGGCCTGTCTGTTGTCGATTTCATGAACACTTGG
ACTAAACAGCCGGGTTACCCTGTGGTCAGGGTCCTCCGAAACTATGAAAACGATTATGTT
ACCTTTGAGCAGAATCTCTTCACCAGCAATAAAAATAATAAGAAGGAACAAAAATGGCAA
ATACCCATAAGTTACAGTACTAACAGCAGCGACTGGAGTACCGAGGCGAAATTCTTTTTG
AACGACGACGCTATCACGACTCAAATTGATATCAACAGCTCGCAGGCGCTGTACGTTAAT
GTTGAAGCTATCGGATACTATCGGGTTAACTACGATCACAGGAACTGGGATCTGTTAAAT
AAGGCTCTGAAGAACGGCACAATCAAAAGCCCGATAGCGAAAGCCCAGCTTATAGATGAC
GCTTTCAATTTGGCCAAAACTAACCAATTGGAGTACAGCTACGCGCTCGGGCTAACCACG
TGCGTCATAGACGGAGAGGAGTCCAAGACTGTTTGGGACTTATTATTAAACAATATGGCG
TTTTTGAGTCACAACCTGAGAGCGACTTCCGGGTACATGTACTTCCAGGACTACATGAAG
ATAATACTGAAAAAACAACTGGAGCGTCTCAACTACGGTCTGAATAAACCCAAGGACGAC
AACGAAGCGTTCTTAATAGAGAACCTGGTGCTGTGGGAGTGTCTCGTGGAGTCGCCGCGG
TGTCTACAGTGGACCAGGGAACAATTTGAGACTTGGACCAGTAAACCAAACATGACTGAT
AATCCTATCCCGAGCTTCCTTCGGTCACTAGTCTACAACATGGCCATCAAAAACGGAGGT
AGACGGGAGTTTGAAATACTTTGGAACATCTTCTTAAACACCACCGACCCTAATATCAAG
AGCCTGATTATATCCAACTTGCCCAGCACCAAGGAGGAATCGTTGATAACTCTACTGCTC
GAGAAGAGTCTGTCGGAGATACCGACGCAGTACGCGATATCGGCTTGGAGCGTGGACGCG
CCAATCGGCACTAAGATAGCTCAGGACTTCCTCATAGACAACTTCGACAAGGTGTACAAG
AGATTCAACGAGATGGACTCCTTCATGTTCGCTGGAGTACTGAACGGAGCGTTCGGCTTC
ATCACTACCAACGACGAATTGAACAGGTTTAAAAAATTCGCTTTGGACCACAAATCTGAG
CTGCAGCCAATGTCTCACACGCTCCAGAAGATAGCTGACAGCGGAGCGGTCAGGATATCC
TGGATCAACACACACGCTAGGAACATTAACAACTGGTTCAAGACATATGTAGAAGAACAT
TCAACAAACAGCCAAACAACGGAGACGCCAAACGAAACCATCACAGACAGCAACGTCACT
TTAAGTTCTTAA

Protein sequence:

ITFRETTLLYDEVEGIPREKQNVAIDVAHELAHQWFGDLVTMKWWTDLWLNEGFATYIEY
VGVDHIAPEWDMFESFTRDKMNLLRTDALKNTAPVSQKVIDASEISQKFDEISYSKGANL
IRMLNHTISKELFHKGLLIYLNLWKYRNAEENDLWQAMSLATKESPRLKGLSVVDFMNTW
TKQPGYPVVRVLRNYENDYVTFEQNLFTSNKNNKKEQKWQIPISYSTNSSDWSTEAKFFL
NDDAITTQIDINSSQALYVNVEAIGYYRVNYDHRNWDLLNKALKNGTIKSPIAKAQLIDD
AFNLAKTNQLEYSYALGLTTCVIDGEESKTVWDLLLNNMAFLSHNLRATSGYMYFQDYMK
IILKKQLERLNYGLNKPKDDNEAFLIENLVLWECLVESPRCLQWTREQFETWTSKPNMTD
NPIPSFLRSLVYNMAIKNGGRREFEILWNIFLNTTDPNIKSLIISNLPSTKEESLITLLL
EKSLSEIPTQYAISAWSVDAPIGTKIAQDFLIDNFDKVYKRFNEMDSFMFAGVLNGAFGF
ITTNDELNRFKKFALDHKSELQPMSHTLQKIADSGAVRISWINTHARNINNWFKTYVEEH
STNSQTTETPNETITDSNVTLSS