New model in OGS2.0 | DPOGS209745  |
---|---|
Genomic Position | scaffold744:+ 58077-66483 |
See gene structure | |
CDS Length | 2820 |
Paired RNAseq reads   | 1055 |
Single RNAseq reads   | 2772 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008958 (0.0) |
Best Drosophila hit   | Reversion-inducing-cysteine-rich protein with kazal motifs (1e-128) |
Best Human hit | reversion-inducing cysteine-rich protein with Kazal motifs precursor (2e-112) |
Best NR hit (blastp)   | serine protease inhibitor [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to serine protease inhibitor [Tribolium castaneum] (0.0) |
GeneOntology terms   | GO:0004867 serine-type endopeptidase inhibitor activity |
InterPro families    | IPR002350 Proteinase inhibitor I1, Kazal IPR011497 Protease inhibitor, Kazal-type |
Orthology group | MCL15749 |
Nucleotide sequence:
ATGTCGCTGGTGGAGATGGCTTCCGATTCTGCAATGCGGGAAGAGAGAATAGAAAACATT
TACAAATTCTGCGCCCCTCATTTGATAGAGTTCTGGATATGCATGAATCAGACAATACAA
GAGGTGGTATCACGGTCTGGTTGGTGGGGGCGCGCGTGTTGTGCTTTAGGTCATTCGGCC
AAGTGTCGTAGAGCTTGCGCCACAGCCTCTGATGCGAGTGCTCTGTCTGAACCGTGCAGA
AGATCGGATGAAATAAACTTCTTCGACTGCGTGCAGAAGCAGCAAGAGGGACAGTGGTGC
TGTTCTCAAACACGGTCAATAAGCTGTCACGAAGCGTGTCAGAAAGCTGTGTGGAGGGTC
GGGCAGACCAGAGCTGACAGTGGCGCCAGGGAGAAGGCCGCTGAGTTGTGCGAGCAGTCA
CCAGCGTTGCTGCACTGCTTCCGAGATCTGACAGCATCCACTGTCCACACCGACACTTCT
AAATATTTACCATGCTGCCACGAATCTCCAAGTCAAGAGTGTCGTAGTACCTGCGAAACA
GTGCTCAGAAGAACGGGGGAGTCCCAGGAAATCGCGGAGGCGTTGTCCCAGGAATGTGGT
GCTCCCGCGATCCACGATGGTATGTGGCAGTGTTTCCTAAGAAAAGACGCGCCTCCAGAA
ACCAAAGACGTTATTCCTCATGATGTAGCAAAACTGCATTGTTGTCAGAAGGGCGCTACG
ATTAATTGTAGAAGGCTATGTTTCTATACATTTAACAACGGCTGGCATTTAAATTGGCAG
AAATTCTATTCTGAATGCCTCGGAGATCCACAGGAAACGGAAATGGCGGAATGCATAGAA
GAAGTCGAGGCTCCGTGTTCCTTGGGCTGCGCCGGCCTTACATACTGTAGTCAACTAAAC
AATCGCCCGACCAGTTTGTTCCGATCGTGTTCAGCTCAGGCTGATTTGGACGCACATTTG
GCCGTCGCCGAGCAAAAGGCTAGTGGGGACGTCACTGTGGCAGGGTTACGTTTACCGTTG
AAGAATTCTTCCCAGTGCACTACAGATATTTGGAAGAGCGTCGCTTGTGCTCTTCATGTG
AAGCCCTGTACACTAAAGGGTCACAGTAGCCTGTTGTGTGCTGAGGAGTGTCGTCGCCTG
GTGTCCTCATGTGTGGAGTGGTCTCGAGCCCCGCTGTCAGCCGCCGCGCTCTGTGCTAGA
CTAGCGCCAGCTCGCCCTGACGCTCCATGTGTGTCACTGGCCGAGTTCCTTACAGCCAGT
CCCGAACCGCCTTTACTATCAGCTACGGAAATGGTCACTTCCCCATGCGCTGGTTCTCCC
TGTAACAGTTCCCAAGTGTGTGTCATCAACAGAAGTTGTATTCGAGGAGGCTCTTGCTCC
AAGTATACATGCATTGACGGATGTCCGCTAGGTGATGGTAGCCCATATGTGGTTCCCGTC
GGTTCCTGGGTGAGGGTGCCCATGCCCTGTGCTGCACAGAAGGTCTGCATCAAGATTTGC
CGCTGCGGTAACAAGGGGCTGTCAGACTGCCAGCCGTTACCCAGCGTCGCCCTTGATAAT
TGTCGATTGCATGATAAAGTGGTCAAGCATGGTGACAAGTACTACATGGAGTGCAACCCA
TGTGTGTGTGTGTCTGGTGAGCGTGTGTGCGCTCGTCGAGCGTGTGGTCGCGCGGCGCTC
CTGACGGGTTTGCCCTGTAACTGTCCCCCTCACCATCTTCCAGTGCGATCGCCAGGAAGG
CTCTACCCTAATGCCTGCTTGGCCAAATGCGCGGGCGCGACGGACGCTGAGATCGAATTC
GGTTCGGGAGGTGTTTGTTCTGGCGGCGAATGTTCGCGGCTCGCGTGTCTCCCCGCTCGC
TCCGTCTGCCTCTCGCGCCTACAGACAGCGTGTCCGCAGCACGTATGCGCGGGCGCGACG
GACGCTGAGATCGAATTCGGTTCGGGAGGTGTTTGTTCTGGCGGCGAATGTTCGCGGCTC
GCGTGTCTCCCCGCTCGCTCCGTCTGCCTCTCGCGCTTACAGACAGCGTGTCCGCAGCAC
GTGTGCGTGAGTACAACCAACTGCCACACCCAGCCGCCGTCGCCGGTGTGTGACACGGAC
GGACATTCCCACTCCAAGCCCTGCCAGCTCATCATGAGTGGTCGCCGGTTTGCCTACTGG
GGACAATGTTTACAGGGATGTTCATCGAGCGGAACAGTTTGCGGAGTCAATGGCATCACC
TACAGCTCTGAATGCGCCGCGTGGTCGGAATACGTCAGTGTGGACTACAACGGAGCTTGT
TTTGCTGTTGGTCCCATATCAGACCTCATGGAACCAAAATGTCAGTTCGACAGAATAATG
TGCCCTCCATTGAAGAAACCGGATTGTCTCGGATATACTGCCCCTGGAGCGTGTTGCCCG
AAATGCGGTGGAGCTTTGAGAATATTATACTCTAAAAAACAAATCGATAGAGCGCTTTAT
GGGACCAATATATCGGCTTCCGTTATAAACCTGCACAATGTCCTGTCAGCTCTGGACAGG
CATGTCAAAGTGGCTGAATGCGCTTTAAGGGGCTATCTTACCATCGAAATGGAGATTTTC
GTCACCATCGAATCAGTATTGAAGAATCCGACCGACCTGCAGTTGAACGTCTGTGTTTTG
GAAGCGGAAAAACTTGCTGATCTCATAAACCGGGAGAGTGCTCTTATTTCTAGTGATTTA
GGCTTGAGTGCGTTGTCGTACGCCTTGATAGTGCACACACATCCGTCTCAAGGCGCGTCC
GTTGTGAGTTTGTCGTTGGCCATAATATTATCGTACACAGTAATATTCGTTCTGAGGTAG
Protein sequence:
MSLVEMASDSAMREERIENIYKFCAPHLIEFWICMNQTIQEVVSRSGWWGRACCALGHSA
KCRRACATASDASALSEPCRRSDEINFFDCVQKQQEGQWCCSQTRSISCHEACQKAVWRV
GQTRADSGAREKAAELCEQSPALLHCFRDLTASTVHTDTSKYLPCCHESPSQECRSTCET
VLRRTGESQEIAEALSQECGAPAIHDGMWQCFLRKDAPPETKDVIPHDVAKLHCCQKGAT
INCRRLCFYTFNNGWHLNWQKFYSECLGDPQETEMAECIEEVEAPCSLGCAGLTYCSQLN
NRPTSLFRSCSAQADLDAHLAVAEQKASGDVTVAGLRLPLKNSSQCTTDIWKSVACALHV
KPCTLKGHSSLLCAEECRRLVSSCVEWSRAPLSAAALCARLAPARPDAPCVSLAEFLTAS
PEPPLLSATEMVTSPCAGSPCNSSQVCVINRSCIRGGSCSKYTCIDGCPLGDGSPYVVPV
GSWVRVPMPCAAQKVCIKICRCGNKGLSDCQPLPSVALDNCRLHDKVVKHGDKYYMECNP
CVCVSGERVCARRACGRAALLTGLPCNCPPHHLPVRSPGRLYPNACLAKCAGATDAEIEF
GSGGVCSGGECSRLACLPARSVCLSRLQTACPQHVCAGATDAEIEFGSGGVCSGGECSRL
ACLPARSVCLSRLQTACPQHVCVSTTNCHTQPPSPVCDTDGHSHSKPCQLIMSGRRFAYW
GQCLQGCSSSGTVCGVNGITYSSECAAWSEYVSVDYNGACFAVGPISDLMEPKCQFDRIM
CPPLKKPDCLGYTAPGACCPKCGGALRILYSKKQIDRALYGTNISASVINLHNVLSALDR
HVKVAECALRGYLTIEMEIFVTIESVLKNPTDLQLNVCVLEAEKLADLINRESALISSDL
GLSALSYALIVHTHPSQGASVVSLSLAIILSYTVIFVLR