DPGLEAN14343 in OGS1.0

New model in OGS2.0DPOGS211236 
Genomic Positionscaffold1538:+ 4527-9848
See gene structure
CDS Length1269
Paired RNAseq reads  71
Single RNAseq reads  221
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005008 (1e-37)
Best Drosophila hit  melanization protease 1, isoform C (1e-35)
Best Human hitcoagulation factor IX preproprotein (4e-26)
Best NR hit (blastp)  seminal fluid protein HACP049 [Heliconius melpomene] (3e-88)
Best NR hit (blastx)  seminal fluid protein HACP049 [Heliconius melpomene] (5e-87)
GeneOntology terms



  
GO:0006952 defense response
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
GO:0008236 serine-type peptidase activity
GO:0035006 melanization defense response
InterPro families


  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
Orthology groupMCL17068

Nucleotide sequence:

AATCATAAAACTTGGGAATTTTTGGATACCTTCGATTGTGGATTTAATTTCGTCGATCGT
ATTATTGGGGGATTAAATGCAGCACCAAAACAATTTCCTTGGATCACGAGACTGGGTTAT
TCCACCCGAGAAGAAAAAGAACTAGATTGGATGTGTGGTGGTGCGCTCCTATCTGACCGT
CATGTTATCACAGCAGCGCATTGCGTTGTGAGCTCAATCGAAGCTAAACTGGTAAAAATT
CGTATGGGAGAGTACGACATTAGGACAAACCCGGATTGTCAATTTAACAAATGCGCCCCT
CCAGTCCAGGATCGCGGTATAAAAACTATTATAAGTCACCCAAATTTTAACAAGCCAGCT
TTTCACAATGATATAGCAATCATCGTTCTGGATGAACCCGTAGAAATGAATGACTATGTT
ATACCAATTTGTTTGCCGCGGGAGGAGCAATTACGTCAGTACTTAGAACTAGGAGAAAAG
TTAATAGTAGCTGGCTGGGGTAAAATGAATATGACTACAGACGAAAGAGCTAAAATACTA
CAATATGTAACTGTACCTGTCCTGAAATTAGAAATGTGCAATACTTTTGGAAAGCGATTC
ACTTTAGCCGAATCGGAAATATGCGCGGGAGCACAAGAACACAAGGACGCATGTGGGGGC
GATTCAGGGGGTCCTCTAATGAAGGCAACCCGAGAAGAAAAAGAACTAGATTGGATGTGT
GGTGGTGCGCTCCTATCTGACCGTCATGTTATCACAGCAGCGCATTGCGTTGTGAGCTCA
ATCGAAGCTAAACTGGTAAAAATTCGTATGGGAGAGTACGACATTAGGACAAACCCGGAT
TGTCAATTTAACAAATGCGCCCCTCCAGTCCAGGATCGCGGTATAAAAACTATTATAAGT
CACCCAAATTTTAACAAGCCAGCTTTTCACAATGATATAGCAATCATCGTTCTGGATGAA
CCCGTAGAAATGAATGACTATGTTATACCAATTTGTTTGCCGCGGGAGGAGCAATTACGT
CAGTACTTAGAACTAGGAGAAAAGTTAATAGTAGCTGGCTGGGGTAAAATGAATATGACT
ACAGACGAAAGAGCTAAAATACTACAATATGTAACTGTACCTGTCCTGAAATTAGAAATG
TGCAATACTTTTGGAAAGCGATTCACTTTAGCCGAATCGGAAATATGCGCGGGAGCACAA
GAACACAAGGACGCATGTGGGGGCGATTCAGGGGGTCCTCTAATGAAGGCAAGTACTTTT
TTTAATTAG

Protein sequence:

NHKTWEFLDTFDCGFNFVDRIIGGLNAAPKQFPWITRLGYSTREEKELDWMCGGALLSDR
HVITAAHCVVSSIEAKLVKIRMGEYDIRTNPDCQFNKCAPPVQDRGIKTIISHPNFNKPA
FHNDIAIIVLDEPVEMNDYVIPICLPREEQLRQYLELGEKLIVAGWGKMNMTTDERAKIL
QYVTVPVLKLEMCNTFGKRFTLAESEICAGAQEHKDACGGDSGGPLMKATREEKELDWMC
GGALLSDRHVITAAHCVVSSIEAKLVKIRMGEYDIRTNPDCQFNKCAPPVQDRGIKTIIS
HPNFNKPAFHNDIAIIVLDEPVEMNDYVIPICLPREEQLRQYLELGEKLIVAGWGKMNMT
TDERAKILQYVTVPVLKLEMCNTFGKRFTLAESEICAGAQEHKDACGGDSGGPLMKASTF
FN