New model in OGS2.0 | DPOGS201589  |
---|---|
Genomic Position | scaffold609:+ 131-9459 |
See gene structure | |
CDS Length | 2010 |
Paired RNAseq reads   | 645 |
Single RNAseq reads   | 1499 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012217 (2e-63) |
Best Drosophila hit   | CG31217 (3e-59) |
Best Human hit | sortilin-related receptor preproprotein (2e-19) |
Best NR hit (blastp)   | pattern recognition serine proteinase precursor [Manduca sexta] (5e-150) |
Best NR hit (blastx)   | pattern recognition serine proteinase precursor [Manduca sexta] (7e-155) |
GeneOntology terms    | GO:0006508 proteolysis GO:0004252 serine-type endopeptidase activity GO:0045087 innate immune response |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR002172 Low-density lipoprotein (LDL) receptor class A repeat IPR016060 Complement control module IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR023415 Low-density lipoprotein (LDL) receptor class A, conserved site IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR000436 Sushi/SCR/CCP |
Orthology group | MCL10336 |
Nucleotide sequence:
ATGGAACCTGTATCAGCGCTGACAAGAAGTGTGATGGTGTGGGAGACTGCCCCGACGGTT
CAGACGAGAGGTTCCACATGTGCAGGAACGTGCCCATATTACCACTTTCGATGCACGTAC
GGAGCTTGCGTCGATGGTACAGCTTCCTGCAATGGTGTCAAGGAGTGTATGGACAACTCG
GATGAACTGCAACCGGCCTGCCAGAAGAAAAGCAACATCTTTGGTGAAAAATTTATTTGC
AAGAACGGTGAAATGATAGATGTTTATCAGATCTGTGATGGAACCACTGAGTGCAGTGAT
AATAGTGATGAGATCTTGGATAAGTCGATTTGGATAAAAAAATCGTTGTGTTGTAGCGGG
AATGGCTGCGAGTGCCCATATTACCACTTTCGATGCACGTACGGAGCTTGCGTCGATGGT
ACAGCTTCCTGCAATGGTGTCAAGGAGTGTATGGACAACTCGGATGAACTGCAACCGGCC
TGCCAGAAGAAAAGCAACATCTTTGGTGAAAAATTTATTTGCAAGAACGGTGAAATGATA
GATGTTTATCAGATCTGTGATGGAACCACTGAGTGCAGTGATAATAGTGATGAGATCTTG
GAGACCTGCGCGAGCACTGTTTGCCCATCACATCTGTTCCAATGTGCGTATGGCGCATGC
GTTGATGCTGGGGCCGAGTGCAATAATTTGCAGGAATGCGCTGATAATTCTGATGAATGG
GATCTTCTGTGCAATAAGACGTCTACCACAACAACAACCACCACGGAGATCACGGAGACA
AGTCGGTCGTCCTGCGTCTTGCCAGATCATCCAAAGTTCGGAATATACAGCCTAGCTGAC
GGTAGCAAATATGTTCCAAGGAGTGTTCAAGAAAATTTGGTGGTTCTGAGCCTCACATGC
TACCCAGGATATAAGAGTGTAGGGGAAATAGCCACTTACTGTCATGAAGGGTCATGGTCT
ACCGATTTACCTTATTGCGCCCGGACGTGTAAACTGGATAAATCACCAAGTATTGAGTAC
AGATGTATTACTAATGACGAAGGGACCAGGGCGTGCGAGGATTACGAAGTGGAGGGTACC
ATTGTCCAGCCTCAGTGCAGGGAGCCCAACTACTACAGTCTCAGCGACCTGTACTACATG
GTCTGCCGTGATGGACAATGGAATTACCAACCGAAGTGTGAAGCTGAATGTGGTACATTG
ACCCCACGCGCGACCCCTTTGGTGTTGGGCGGGCGGACTGCGGACGTTGGCGAAGTCCCC
TGGCACGCGGGGATCTACAGCAAGCTGACGGAGCCTCCAATACAGATATGCGGGGCTTCC
TTGGTCAGCGACACCGTACTGGTCTCTGCCGCTCATTGCTTTTGGTTCAACGAAAATACT
GAGCCAGCAGAAAACTATGCAGTGGCGGTTGGCAAGCTGCATAGAGACTGGGATCATCAT
CTAGACATGGATTATCAGCAGACTTCTGATGTGCAATCCATCTACGTCTCCCATTACTAT
CGAGGATCGTCCATGAACTACCAGCACGACCTGGCCATCGTAATCGTCACCCAGCCCTTC
TCCTACCGTCCGTACATAAGACCTATATGTGTGCATTTCCCTCATGATGCGACAGAAATG
GCGATCAAAAACGACGACCTCGGGAAGGTAGCTGGTTGGGGTCTCACGACGGTCCACACT
GGCTCCGAGTCCCCCACGCTAAAGGTCCTGGACGTGCCTTTTGTTGATTTTGACACCTGC
CTCCAGAACACACCAGAATACTACCGGGAATTCTTCAGCAGCGATAAGATCTGTGGTGGC
TACGCTAATGGTACAAGTCTCTGTAAGGGTGACAGCGGTGGTGGGTACGCCTTCCCCTTC
AAGCTCAACGGCCGCACCAGGTACTACCTCCGCGGTGTCGTGTCCACAAGCCCACCGCTG
CCTCTAGGATTGTCATGCAACATATACACGTACACGAGCTTCACGGATATCATGCAACAC
AAAAGAATCATCATGACGCATATGCATTGA
Protein sequence:
MEPVSALTRSVMVWETAPTVQTRGSTCAGTCPYYHFRCTYGACVDGTASCNGVKECMDNS
DELQPACQKKSNIFGEKFICKNGEMIDVYQICDGTTECSDNSDEILDKSIWIKKSLCCSG
NGCECPYYHFRCTYGACVDGTASCNGVKECMDNSDELQPACQKKSNIFGEKFICKNGEMI
DVYQICDGTTECSDNSDEILETCASTVCPSHLFQCAYGACVDAGAECNNLQECADNSDEW
DLLCNKTSTTTTTTTEITETSRSSCVLPDHPKFGIYSLADGSKYVPRSVQENLVVLSLTC
YPGYKSVGEIATYCHEGSWSTDLPYCARTCKLDKSPSIEYRCITNDEGTRACEDYEVEGT
IVQPQCREPNYYSLSDLYYMVCRDGQWNYQPKCEAECGTLTPRATPLVLGGRTADVGEVP
WHAGIYSKLTEPPIQICGASLVSDTVLVSAAHCFWFNENTEPAENYAVAVGKLHRDWDHH
LDMDYQQTSDVQSIYVSHYYRGSSMNYQHDLAIVIVTQPFSYRPYIRPICVHFPHDATEM
AIKNDDLGKVAGWGLTTVHTGSESPTLKVLDVPFVDFDTCLQNTPEYYREFFSSDKICGG
YANGTSLCKGDSGGGYAFPFKLNGRTRYYLRGVVSTSPPLPLGLSCNIYTYTSFTDIMQH
KRIIMTHMH