DPGLEAN00922 in OGS1.0

New model in OGS2.0DPOGS204619 
Genomic Positionscaffold3224:- 4914-12881
See gene structure
CDS Length1854
Paired RNAseq reads  1915
Single RNAseq reads  4862
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012217 (8e-86)
Best Drosophila hit  CG31217 (9e-54)
Best Human hitlow-density lipoprotein receptor-related protein 2 precursor (1e-18)
Best NR hit (blastp)  pattern recognition serine proteinase precursor [Manduca sexta] (0.0)
Best NR hit (blastx)  pattern recognition serine proteinase precursor [Manduca sexta] (0.0)
GeneOntology terms

  
GO:0006508 proteolysis
GO:0004252 serine-type endopeptidase activity
GO:0045087 innate immune response
InterPro families




  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR016060 Complement control module
IPR002172 Low-density lipoprotein (LDL) receptor class A repeat
IPR000436 Sushi/SCR/CCP
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR023415 Low-density lipoprotein (LDL) receptor class A, conserved site
Orthology groupMCL10336

Nucleotide sequence:

ATGGCTTGCAACGGACTATCGGACCCGCTCTCCGATTTGATGATCCGCAGACCCAAACGT
CAGACGCAAAATTGTCGCAAGAACCAGTGGCAGTGTCGTGACGGCACCTGCATAGGGTTC
GACGGTAAATGTGACGGTGTGGTCGACTGTCCCGACTTCAGCGACGAGACCTTCGCGCTG
TGCAGGGACATGCAATGCCAGAGCAATTGGTTCCGCTGTACTTACGGCGCCTGCGTCGAC
GGCAGCGCCCCTTGTAATGGTGTGCAAGAGTGCGCTGATAACTCCGACGAGTTGCTGCCT
AGGTGCCGCAATCAAACAATTGGTTCCAGGGGTAAGCACACGTGCGACAATGGTCAGGTG
ATATCCTCGGTGGACATATGCGATGGGAAGAAGGACTGCGCTGATGGCTCTGACGAGACC
CTCGCCACCTGCGCCGGGAACAGCTGTCCGTCATACGTGTTCCAATGTGCGTATGGAGCC
TGTGTGGACCAGAACGCGAAGTGCAACAAGGTGGAAGAGTGTGCTGATGGTTCTGACGAA
ACAGACGAGCTCTGCAACAGGCTGGCGCCGGGTCAGCCGGTGACTCCAGCCACGAGACCA
CCACCTCAGGGGGGTAATTGTCTGTTGCCTCCATACCCTCAGTATGGGTCGTACAAGGTC
AGACAGTACCCCAACGCGGTCCCCGGCCAGAGGTATCCCAACGTGAGGCTGGACGTCACC
TGTAACCCTGGCTTCCAGACTGAAAACAATAACAGCATCTTCTGCGATAACGGAGAGTGG
TCAGGACCTATGCCAGCGTGTCTCCGTTTCTGCAGGCTTAACAAACACCCGAGCGTGGAG
TACCGCTGTCTGTTGTCTGGCAACTCGGTGACAGGGTCCAGAGAGTGTGGCTCATTGGAG
CCGTCTGGGACCGTCGTCACCCCCATCTGCCGCTCCCCCAATTACTACTCCTCGGGGGTA
ATGTCCAACATGCACTGCGTTGAAGGCAGTTGGGACTATATAGCTGTGTGCAAACCAGAG
TGCGGTACAATAACTCCTGAGGGTATCCAGCTGGTGATCGGCGGGCGGTCTGCCAAGCGC
GGGGAACTCCCGTGGCACGCGGGGATTTACAGCAAATTATTCACACCTTACATGCAGATA
TGTGGCGGGTCGCTCATCAGTACAACCACTATTATATCCGATTCTCGCGAGTTTTATTCA
TTTACCGCACATTGTTTCTGGAGCGACACCAAGAAGCTGCTGCCCGCGTCCGAATACGCG
GTGGCTGTTGGGAAGCTGTACCGACCTTACAACGAAAAACACGACGCTGACGCGGAGAAA
TCTGATGTGGCAGATATTATAATTCCGTCCCGCTTTCGAGGGTCTGGTGCCAACTTCCAG
GATGACATCGCGCTGGTTTTGGTCGTGACGCCCTTCATATACCAGGTCTTCATTAGACCT
GTCTGTCTGGACTTCGACGTCAACTTCGACAGAACCCAGCTCTCGGAAGGGAATATGGGC
AAGGTAGCCGGCTGGGGTCTGACTGACAAAAACGGTAAAGCGTCCCAAGTGCTGAAGGTG
GTAGATCTTCCTTACGTCAAAATTGAAGACTGCTACGCCATGTCCCCGCCGACGTTCCGC
GCTTACATCACAAGCGACAAGATCTGCGCCGGTTACACTAACGGCACGACGCTCTGCCAG
GGCGACAGCGGCGGCGGCCTGGCGTTCCCCGCCTACGAACTCAACACCCAGAGGTACTAC
CTGCGAGGCATCGTGTCCACAGCTCCCAGGAACGACGATCTTTGCAACGCCCACACCCTC
ACCACGTTTACGGCTGTATCGAAACACGAGCATTTCATCAAACAGTACCTCTAG

Protein sequence:

MACNGLSDPLSDLMIRRPKRQTQNCRKNQWQCRDGTCIGFDGKCDGVVDCPDFSDETFAL
CRDMQCQSNWFRCTYGACVDGSAPCNGVQECADNSDELLPRCRNQTIGSRGKHTCDNGQV
ISSVDICDGKKDCADGSDETLATCAGNSCPSYVFQCAYGACVDQNAKCNKVEECADGSDE
TDELCNRLAPGQPVTPATRPPPQGGNCLLPPYPQYGSYKVRQYPNAVPGQRYPNVRLDVT
CNPGFQTENNNSIFCDNGEWSGPMPACLRFCRLNKHPSVEYRCLLSGNSVTGSRECGSLE
PSGTVVTPICRSPNYYSSGVMSNMHCVEGSWDYIAVCKPECGTITPEGIQLVIGGRSAKR
GELPWHAGIYSKLFTPYMQICGGSLISTTTIISDSREFYSFTAHCFWSDTKKLLPASEYA
VAVGKLYRPYNEKHDADAEKSDVADIIIPSRFRGSGANFQDDIALVLVVTPFIYQVFIRP
VCLDFDVNFDRTQLSEGNMGKVAGWGLTDKNGKASQVLKVVDLPYVKIEDCYAMSPPTFR
AYITSDKICAGYTNGTTLCQGDSGGGLAFPAYELNTQRYYLRGIVSTAPRNDDLCNAHTL
TTFTAVSKHEHFIKQYL