DPGLEAN02561 in OGS1.0

New model in OGS2.0DPOGS200905 
Genomic Positionscaffold5:+ 111320-114691
See gene structure
CDS Length1293
Paired RNAseq reads  271
Single RNAseq reads  813
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000673 (1e-43)
Best Drosophila hit  peptidoglycan recognition protein LF (3e-39)
Best Human hitpeptidoglycan recognition protein 3 precursor (1e-28)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC002790 [Tribolium castaneum] (2e-60)
Best NR hit (blastx)  PREDICTED: similar to AGAP005203-PC [Tribolium castaneum] (6e-47)
GeneOntology terms





  
GO:0005887 integral to plasma membrane
GO:0006952 defense response
GO:0042834 peptidoglycan binding
GO:0045087 innate immune response
GO:0009253 peptidoglycan catabolic process
GO:0005515 protein binding
GO:0008745 N-acetylmuramoyl-L-alanine amidase activity
InterPro families

  
IPR002502 N-acetylmuramoyl-L-alanine amidase domain
IPR006619 Peptidoglycan recognition protein family domain, metazoa/bacteria
IPR015510 Peptidoglycan recognition protein
Orthology groupMCL10276

Nucleotide sequence:

ATGGCTACAAAGGCAAGTAAATCCAATACAAATGGGATGACAATAATAGGAAATATCGAT
GATATGCAAATTATAAAAGAAATCAGTGATGAAGATAGTATTAATAGACCGAAAAAATCA
TGTTCTTTGAACACTCCTTTAAATCTTCCCAGTAATTTAGTTCCGAACACTGGAGCACCT
GTTTATGAAAGCGTCTCGGTTACAAATTCGGAAAATGTTCAGTTTGGTAACAATACATAT
TTTAATGGTCCAGTAATCATAAAACAAATAGTCCAACCTAAATCTGGCGTTGATAATGTT
TCCTATACTAAAACAGAAGAAACAGAAAACGAGGATTCCCAGAAGCCTCCTTGTGCCACA
TTGGATGAAACTTATATTAAAAAGAAAGAGATATTATTATGGCACAAGATAACATTCTCC
GCAGTATGTATTACAGTTGTTGGCTGTGTTGTGGCTCTCGTGCTTGTTTTACAAAACAAA
AGTGATGATCAAATTCATACAAATGCACAGTCAGATTATGACAAAGAAGACAAACATACT
CGTTTGGTGGTGGGAGTTTGTGATGAAAATTTTCAACTTGTTATAATATCGGTACTCCTG
TTATCATTAACATTTTTGTTAGTTTCTTACTCATTACTAACACCGAGTGTTAAGAAAATA
AATAATGGTGTGGACATAAACCATCATTTGAACTGTGTCAATAAAAATTGCATGTTCTGC
AAGAAGACAGCAAATTCATTGCTGATAGCACCAAATCATTTGCGGATAGTGTCCAGATCG
GACTGGTTGTCACAGACTGTTGAAGGCGAGGTGAACATGCTTCGTCAACCGGTACCATGG
GTCATTATATCTCACACTGCCACCGAAAGCTGCTATACACAGAGTGAATGTGTTCTACGT
GTAAGATTGATACAAATGTATCATGTTGAAGCTCGAAAGTGGTCAGATATTGGGTACAAC
TTTCTGGTGGGAGGAGATGGTTCGGTGTACCATGGTCGTGGCTGGAACATAGAGGGGGCT
CATACAAAAGGTTACAACAAATATAGTATCGGAATAGCTTTTATTGGTACATTCAACACC
ACTCCGCCTCCAAAACAGCAAGTTGAAGCCTGCGAGAAACTTCTCAAGCTGGGAGTTGAA
TTGGGTAAACTCGCCAAAGACTACAAACTGTTTGCCCACAGGCAATTCATGTCAACTCTC
AGTCCAGGTGATACAGTTTATGATATCATCAAGGAATGGCCACATTTTGTCAACAATTTG
ACAGACACCCAGTCATTGATACCCAATTATTAA

Protein sequence:

MATKASKSNTNGMTIIGNIDDMQIIKEISDEDSINRPKKSCSLNTPLNLPSNLVPNTGAP
VYESVSVTNSENVQFGNNTYFNGPVIIKQIVQPKSGVDNVSYTKTEETENEDSQKPPCAT
LDETYIKKKEILLWHKITFSAVCITVVGCVVALVLVLQNKSDDQIHTNAQSDYDKEDKHT
RLVVGVCDENFQLVIISVLLLSLTFLLVSYSLLTPSVKKINNGVDINHHLNCVNKNCMFC
KKTANSLLIAPNHLRIVSRSDWLSQTVEGEVNMLRQPVPWVIISHTATESCYTQSECVLR
VRLIQMYHVEARKWSDIGYNFLVGGDGSVYHGRGWNIEGAHTKGYNKYSIGIAFIGTFNT
TPPPKQQVEACEKLLKLGVELGKLAKDYKLFAHRQFMSTLSPGDTVYDIIKEWPHFVNNL
TDTQSLIPNY