DPGLEAN11180 in OGS1.0

New model in OGS2.0DPOGS201589 
Genomic Positionscaffold609:+ 131-9459
See gene structure
CDS Length2010
Paired RNAseq reads  645
Single RNAseq reads  1499
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012217 (2e-63)
Best Drosophila hit  CG31217 (3e-59)
Best Human hitsortilin-related receptor preproprotein (2e-19)
Best NR hit (blastp)  pattern recognition serine proteinase precursor [Manduca sexta] (5e-150)
Best NR hit (blastx)  pattern recognition serine proteinase precursor [Manduca sexta] (7e-155)
GeneOntology terms

  
GO:0006508 proteolysis
GO:0004252 serine-type endopeptidase activity
GO:0045087 innate immune response
InterPro families





  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR002172 Low-density lipoprotein (LDL) receptor class A repeat
IPR016060 Complement control module
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR023415 Low-density lipoprotein (LDL) receptor class A, conserved site
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR000436 Sushi/SCR/CCP
Orthology groupMCL10336

Nucleotide sequence:

ATGGAACCTGTATCAGCGCTGACAAGAAGTGTGATGGTGTGGGAGACTGCCCCGACGGTT
CAGACGAGAGGTTCCACATGTGCAGGAACGTGCCCATATTACCACTTTCGATGCACGTAC
GGAGCTTGCGTCGATGGTACAGCTTCCTGCAATGGTGTCAAGGAGTGTATGGACAACTCG
GATGAACTGCAACCGGCCTGCCAGAAGAAAAGCAACATCTTTGGTGAAAAATTTATTTGC
AAGAACGGTGAAATGATAGATGTTTATCAGATCTGTGATGGAACCACTGAGTGCAGTGAT
AATAGTGATGAGATCTTGGATAAGTCGATTTGGATAAAAAAATCGTTGTGTTGTAGCGGG
AATGGCTGCGAGTGCCCATATTACCACTTTCGATGCACGTACGGAGCTTGCGTCGATGGT
ACAGCTTCCTGCAATGGTGTCAAGGAGTGTATGGACAACTCGGATGAACTGCAACCGGCC
TGCCAGAAGAAAAGCAACATCTTTGGTGAAAAATTTATTTGCAAGAACGGTGAAATGATA
GATGTTTATCAGATCTGTGATGGAACCACTGAGTGCAGTGATAATAGTGATGAGATCTTG
GAGACCTGCGCGAGCACTGTTTGCCCATCACATCTGTTCCAATGTGCGTATGGCGCATGC
GTTGATGCTGGGGCCGAGTGCAATAATTTGCAGGAATGCGCTGATAATTCTGATGAATGG
GATCTTCTGTGCAATAAGACGTCTACCACAACAACAACCACCACGGAGATCACGGAGACA
AGTCGGTCGTCCTGCGTCTTGCCAGATCATCCAAAGTTCGGAATATACAGCCTAGCTGAC
GGTAGCAAATATGTTCCAAGGAGTGTTCAAGAAAATTTGGTGGTTCTGAGCCTCACATGC
TACCCAGGATATAAGAGTGTAGGGGAAATAGCCACTTACTGTCATGAAGGGTCATGGTCT
ACCGATTTACCTTATTGCGCCCGGACGTGTAAACTGGATAAATCACCAAGTATTGAGTAC
AGATGTATTACTAATGACGAAGGGACCAGGGCGTGCGAGGATTACGAAGTGGAGGGTACC
ATTGTCCAGCCTCAGTGCAGGGAGCCCAACTACTACAGTCTCAGCGACCTGTACTACATG
GTCTGCCGTGATGGACAATGGAATTACCAACCGAAGTGTGAAGCTGAATGTGGTACATTG
ACCCCACGCGCGACCCCTTTGGTGTTGGGCGGGCGGACTGCGGACGTTGGCGAAGTCCCC
TGGCACGCGGGGATCTACAGCAAGCTGACGGAGCCTCCAATACAGATATGCGGGGCTTCC
TTGGTCAGCGACACCGTACTGGTCTCTGCCGCTCATTGCTTTTGGTTCAACGAAAATACT
GAGCCAGCAGAAAACTATGCAGTGGCGGTTGGCAAGCTGCATAGAGACTGGGATCATCAT
CTAGACATGGATTATCAGCAGACTTCTGATGTGCAATCCATCTACGTCTCCCATTACTAT
CGAGGATCGTCCATGAACTACCAGCACGACCTGGCCATCGTAATCGTCACCCAGCCCTTC
TCCTACCGTCCGTACATAAGACCTATATGTGTGCATTTCCCTCATGATGCGACAGAAATG
GCGATCAAAAACGACGACCTCGGGAAGGTAGCTGGTTGGGGTCTCACGACGGTCCACACT
GGCTCCGAGTCCCCCACGCTAAAGGTCCTGGACGTGCCTTTTGTTGATTTTGACACCTGC
CTCCAGAACACACCAGAATACTACCGGGAATTCTTCAGCAGCGATAAGATCTGTGGTGGC
TACGCTAATGGTACAAGTCTCTGTAAGGGTGACAGCGGTGGTGGGTACGCCTTCCCCTTC
AAGCTCAACGGCCGCACCAGGTACTACCTCCGCGGTGTCGTGTCCACAAGCCCACCGCTG
CCTCTAGGATTGTCATGCAACATATACACGTACACGAGCTTCACGGATATCATGCAACAC
AAAAGAATCATCATGACGCATATGCATTGA

Protein sequence:

MEPVSALTRSVMVWETAPTVQTRGSTCAGTCPYYHFRCTYGACVDGTASCNGVKECMDNS
DELQPACQKKSNIFGEKFICKNGEMIDVYQICDGTTECSDNSDEILDKSIWIKKSLCCSG
NGCECPYYHFRCTYGACVDGTASCNGVKECMDNSDELQPACQKKSNIFGEKFICKNGEMI
DVYQICDGTTECSDNSDEILETCASTVCPSHLFQCAYGACVDAGAECNNLQECADNSDEW
DLLCNKTSTTTTTTTEITETSRSSCVLPDHPKFGIYSLADGSKYVPRSVQENLVVLSLTC
YPGYKSVGEIATYCHEGSWSTDLPYCARTCKLDKSPSIEYRCITNDEGTRACEDYEVEGT
IVQPQCREPNYYSLSDLYYMVCRDGQWNYQPKCEAECGTLTPRATPLVLGGRTADVGEVP
WHAGIYSKLTEPPIQICGASLVSDTVLVSAAHCFWFNENTEPAENYAVAVGKLHRDWDHH
LDMDYQQTSDVQSIYVSHYYRGSSMNYQHDLAIVIVTQPFSYRPYIRPICVHFPHDATEM
AIKNDDLGKVAGWGLTTVHTGSESPTLKVLDVPFVDFDTCLQNTPEYYREFFSSDKICGG
YANGTSLCKGDSGGGYAFPFKLNGRTRYYLRGVVSTSPPLPLGLSCNIYTYTSFTDIMQH
KRIIMTHMH