New model in OGS2.0 | DPOGS211236  |
---|---|
Genomic Position | scaffold1538:+ 4527-9848 |
See gene structure | |
CDS Length | 1269 |
Paired RNAseq reads   | 71 |
Single RNAseq reads   | 221 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005008 (1e-37) |
Best Drosophila hit   | melanization protease 1, isoform C (1e-35) |
Best Human hit | coagulation factor IX preproprotein (4e-26) |
Best NR hit (blastp)   | seminal fluid protein HACP049 [Heliconius melpomene] (3e-88) |
Best NR hit (blastx)   | seminal fluid protein HACP049 [Heliconius melpomene] (5e-87) |
GeneOntology terms    | GO:0006952 defense response GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis GO:0008236 serine-type peptidase activity GO:0035006 melanization defense response |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL17068 |
Nucleotide sequence:
AATCATAAAACTTGGGAATTTTTGGATACCTTCGATTGTGGATTTAATTTCGTCGATCGT
ATTATTGGGGGATTAAATGCAGCACCAAAACAATTTCCTTGGATCACGAGACTGGGTTAT
TCCACCCGAGAAGAAAAAGAACTAGATTGGATGTGTGGTGGTGCGCTCCTATCTGACCGT
CATGTTATCACAGCAGCGCATTGCGTTGTGAGCTCAATCGAAGCTAAACTGGTAAAAATT
CGTATGGGAGAGTACGACATTAGGACAAACCCGGATTGTCAATTTAACAAATGCGCCCCT
CCAGTCCAGGATCGCGGTATAAAAACTATTATAAGTCACCCAAATTTTAACAAGCCAGCT
TTTCACAATGATATAGCAATCATCGTTCTGGATGAACCCGTAGAAATGAATGACTATGTT
ATACCAATTTGTTTGCCGCGGGAGGAGCAATTACGTCAGTACTTAGAACTAGGAGAAAAG
TTAATAGTAGCTGGCTGGGGTAAAATGAATATGACTACAGACGAAAGAGCTAAAATACTA
CAATATGTAACTGTACCTGTCCTGAAATTAGAAATGTGCAATACTTTTGGAAAGCGATTC
ACTTTAGCCGAATCGGAAATATGCGCGGGAGCACAAGAACACAAGGACGCATGTGGGGGC
GATTCAGGGGGTCCTCTAATGAAGGCAACCCGAGAAGAAAAAGAACTAGATTGGATGTGT
GGTGGTGCGCTCCTATCTGACCGTCATGTTATCACAGCAGCGCATTGCGTTGTGAGCTCA
ATCGAAGCTAAACTGGTAAAAATTCGTATGGGAGAGTACGACATTAGGACAAACCCGGAT
TGTCAATTTAACAAATGCGCCCCTCCAGTCCAGGATCGCGGTATAAAAACTATTATAAGT
CACCCAAATTTTAACAAGCCAGCTTTTCACAATGATATAGCAATCATCGTTCTGGATGAA
CCCGTAGAAATGAATGACTATGTTATACCAATTTGTTTGCCGCGGGAGGAGCAATTACGT
CAGTACTTAGAACTAGGAGAAAAGTTAATAGTAGCTGGCTGGGGTAAAATGAATATGACT
ACAGACGAAAGAGCTAAAATACTACAATATGTAACTGTACCTGTCCTGAAATTAGAAATG
TGCAATACTTTTGGAAAGCGATTCACTTTAGCCGAATCGGAAATATGCGCGGGAGCACAA
GAACACAAGGACGCATGTGGGGGCGATTCAGGGGGTCCTCTAATGAAGGCAAGTACTTTT
TTTAATTAG
Protein sequence:
NHKTWEFLDTFDCGFNFVDRIIGGLNAAPKQFPWITRLGYSTREEKELDWMCGGALLSDR
HVITAAHCVVSSIEAKLVKIRMGEYDIRTNPDCQFNKCAPPVQDRGIKTIISHPNFNKPA
FHNDIAIIVLDEPVEMNDYVIPICLPREEQLRQYLELGEKLIVAGWGKMNMTTDERAKIL
QYVTVPVLKLEMCNTFGKRFTLAESEICAGAQEHKDACGGDSGGPLMKATREEKELDWMC
GGALLSDRHVITAAHCVVSSIEAKLVKIRMGEYDIRTNPDCQFNKCAPPVQDRGIKTIIS
HPNFNKPAFHNDIAIIVLDEPVEMNDYVIPICLPREEQLRQYLELGEKLIVAGWGKMNMT
TDERAKILQYVTVPVLKLEMCNTFGKRFTLAESEICAGAQEHKDACGGDSGGPLMKASTF
FN