New model in OGS2.0 | DPOGS204619  |
---|---|
Genomic Position | scaffold3224:- 4914-12881 |
See gene structure | |
CDS Length | 1854 |
Paired RNAseq reads   | 1915 |
Single RNAseq reads   | 4862 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012217 (8e-86) |
Best Drosophila hit   | CG31217 (9e-54) |
Best Human hit | low-density lipoprotein receptor-related protein 2 precursor (1e-18) |
Best NR hit (blastp)   | pattern recognition serine proteinase precursor [Manduca sexta] (0.0) |
Best NR hit (blastx)   | pattern recognition serine proteinase precursor [Manduca sexta] (0.0) |
GeneOntology terms    | GO:0006508 proteolysis GO:0004252 serine-type endopeptidase activity GO:0045087 innate immune response |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR016060 Complement control module IPR002172 Low-density lipoprotein (LDL) receptor class A repeat IPR000436 Sushi/SCR/CCP IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR023415 Low-density lipoprotein (LDL) receptor class A, conserved site |
Orthology group | MCL10336 |
Nucleotide sequence:
ATGGCTTGCAACGGACTATCGGACCCGCTCTCCGATTTGATGATCCGCAGACCCAAACGT
CAGACGCAAAATTGTCGCAAGAACCAGTGGCAGTGTCGTGACGGCACCTGCATAGGGTTC
GACGGTAAATGTGACGGTGTGGTCGACTGTCCCGACTTCAGCGACGAGACCTTCGCGCTG
TGCAGGGACATGCAATGCCAGAGCAATTGGTTCCGCTGTACTTACGGCGCCTGCGTCGAC
GGCAGCGCCCCTTGTAATGGTGTGCAAGAGTGCGCTGATAACTCCGACGAGTTGCTGCCT
AGGTGCCGCAATCAAACAATTGGTTCCAGGGGTAAGCACACGTGCGACAATGGTCAGGTG
ATATCCTCGGTGGACATATGCGATGGGAAGAAGGACTGCGCTGATGGCTCTGACGAGACC
CTCGCCACCTGCGCCGGGAACAGCTGTCCGTCATACGTGTTCCAATGTGCGTATGGAGCC
TGTGTGGACCAGAACGCGAAGTGCAACAAGGTGGAAGAGTGTGCTGATGGTTCTGACGAA
ACAGACGAGCTCTGCAACAGGCTGGCGCCGGGTCAGCCGGTGACTCCAGCCACGAGACCA
CCACCTCAGGGGGGTAATTGTCTGTTGCCTCCATACCCTCAGTATGGGTCGTACAAGGTC
AGACAGTACCCCAACGCGGTCCCCGGCCAGAGGTATCCCAACGTGAGGCTGGACGTCACC
TGTAACCCTGGCTTCCAGACTGAAAACAATAACAGCATCTTCTGCGATAACGGAGAGTGG
TCAGGACCTATGCCAGCGTGTCTCCGTTTCTGCAGGCTTAACAAACACCCGAGCGTGGAG
TACCGCTGTCTGTTGTCTGGCAACTCGGTGACAGGGTCCAGAGAGTGTGGCTCATTGGAG
CCGTCTGGGACCGTCGTCACCCCCATCTGCCGCTCCCCCAATTACTACTCCTCGGGGGTA
ATGTCCAACATGCACTGCGTTGAAGGCAGTTGGGACTATATAGCTGTGTGCAAACCAGAG
TGCGGTACAATAACTCCTGAGGGTATCCAGCTGGTGATCGGCGGGCGGTCTGCCAAGCGC
GGGGAACTCCCGTGGCACGCGGGGATTTACAGCAAATTATTCACACCTTACATGCAGATA
TGTGGCGGGTCGCTCATCAGTACAACCACTATTATATCCGATTCTCGCGAGTTTTATTCA
TTTACCGCACATTGTTTCTGGAGCGACACCAAGAAGCTGCTGCCCGCGTCCGAATACGCG
GTGGCTGTTGGGAAGCTGTACCGACCTTACAACGAAAAACACGACGCTGACGCGGAGAAA
TCTGATGTGGCAGATATTATAATTCCGTCCCGCTTTCGAGGGTCTGGTGCCAACTTCCAG
GATGACATCGCGCTGGTTTTGGTCGTGACGCCCTTCATATACCAGGTCTTCATTAGACCT
GTCTGTCTGGACTTCGACGTCAACTTCGACAGAACCCAGCTCTCGGAAGGGAATATGGGC
AAGGTAGCCGGCTGGGGTCTGACTGACAAAAACGGTAAAGCGTCCCAAGTGCTGAAGGTG
GTAGATCTTCCTTACGTCAAAATTGAAGACTGCTACGCCATGTCCCCGCCGACGTTCCGC
GCTTACATCACAAGCGACAAGATCTGCGCCGGTTACACTAACGGCACGACGCTCTGCCAG
GGCGACAGCGGCGGCGGCCTGGCGTTCCCCGCCTACGAACTCAACACCCAGAGGTACTAC
CTGCGAGGCATCGTGTCCACAGCTCCCAGGAACGACGATCTTTGCAACGCCCACACCCTC
ACCACGTTTACGGCTGTATCGAAACACGAGCATTTCATCAAACAGTACCTCTAG
Protein sequence:
MACNGLSDPLSDLMIRRPKRQTQNCRKNQWQCRDGTCIGFDGKCDGVVDCPDFSDETFAL
CRDMQCQSNWFRCTYGACVDGSAPCNGVQECADNSDELLPRCRNQTIGSRGKHTCDNGQV
ISSVDICDGKKDCADGSDETLATCAGNSCPSYVFQCAYGACVDQNAKCNKVEECADGSDE
TDELCNRLAPGQPVTPATRPPPQGGNCLLPPYPQYGSYKVRQYPNAVPGQRYPNVRLDVT
CNPGFQTENNNSIFCDNGEWSGPMPACLRFCRLNKHPSVEYRCLLSGNSVTGSRECGSLE
PSGTVVTPICRSPNYYSSGVMSNMHCVEGSWDYIAVCKPECGTITPEGIQLVIGGRSAKR
GELPWHAGIYSKLFTPYMQICGGSLISTTTIISDSREFYSFTAHCFWSDTKKLLPASEYA
VAVGKLYRPYNEKHDADAEKSDVADIIIPSRFRGSGANFQDDIALVLVVTPFIYQVFIRP
VCLDFDVNFDRTQLSEGNMGKVAGWGLTDKNGKASQVLKVVDLPYVKIEDCYAMSPPTFR
AYITSDKICAGYTNGTTLCQGDSGGGLAFPAYELNTQRYYLRGIVSTAPRNDDLCNAHTL
TTFTAVSKHEHFIKQYL