DPGLEAN12839 in OGS1.0

New model in OGS2.0DPOGS201312 
Genomic Positionscaffold1094:+ 65833-70006
See gene structure
CDS Length1302
Paired RNAseq reads  13
Single RNAseq reads  124
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003104 (1e-39)
Best Drosophila hit  CG3355, isoform A (8e-18)
Best Human hitvitamin K-dependent protein C preproprotein (1e-12)
Best NR hit (blastp)  GJ23363 [Drosophila virilis] (8e-21)
Best NR hit (blastx)  GK23752 [Drosophila willistoni] (1e-20)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families


  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR001314 Peptidase S1A, chymotrypsin-type
Orthology groupMCL39723

Nucleotide sequence:

ATGAAACACTTAAAAGTTCGTAAATTCTTAGTACATCACAAATACTGCGACACCTCTCTG
GTAAATGACATCGGTTTGACATATTCAAACGCACCGGTAAAGTTTGGGGCGAACGTGAAG
CGTGTTGCGCTGCTGAGTATATTTCCAAGACGAGCAACTCACGGTTACGTTACAGGATGG
GGATTGGTTAATACAGACCCTGAAGAACTAGCTACGTCCATGAAATATGTTCAACAAGAC
ATAATAAAACCCAAAGAATGCAGTCGTGGAAACGTGCCTGCTGGAATCTTCTGTGGCCAA
TCCATGAATGATGGGGAAAATCCAGAAATTTCATCGAGGCACTGGATTTGCGTACTAGTA
AAAATTAGAAAAAATGTGAAGGGCTGGTCATCGTCTGCTTGGACACCTTTAACGAGACGA
AACTTCTTAGCATTCAATGGATCCATCGTGCGGGTGCCGTTCCGTCGCGTTGCAGGGAGA
GGATCTTCAACAGTGCCTGGGACTTGCGACCCAGGTGTAGGTTCAAGGGGCGCCTATCTA
ACATGCGTTGATCGCTCACCTCCACTAAAGGTTTTTTCATCTAAACCAAATGAATTAGAA
GGGAGGGTAGTCAGAGGAGACGTTGTGTCGATCGAGGACTTCCCATATTCAGCGTTTCTG
TTGATGGGTAGAGAGAGGGGCAGCTTTATATGTGGTTCATCCATCATCAATCAGAGAATC
TTATTGACGGCAGCACATTGTATCGAAATATGCAATCCCAAGTGCAAGAACGGAGCGGCA
TTTGTTGGAAATGAACAAAAGAGGATGGGAATCAAAATGACTATAACATTCGCAAAATAC
CACCCCAGATATAGAACAAATCGTGTGCACTTTGATATAGGTCTTGCATTGCTTTCTAGA
TCTATAAAGTTTGGTAAATTTGTTAAACGGGTTGCCATTTCAAGGCGTCCGAGGATAAAA
TCTGTCGCTGATATAGCTGGTTGGGGTTTAGTTGATGAAATAAACAAATTGTCGACAGAT
TACTTGCATCATATAACGCAAAAGGTGATAAGTCATAGTGATTGTAAGGCCTATATATCC
AATATTCCTCCAGGCTCTTTCTGCGCTGGTGAGATTAAGAGCAGGCAGTTTGCATCAGAA
GGGGACTCTGGCAGTGCTTTAATAATCAACAAGTACACGCAAATCGGTATCGTGTCTTAT
AAACGGCCGGACATATCGGCCAGTCTTATTGTATATACAAACGTCTCATTCTATTACGAC
TGGATAAAACAAACTTCGAGAAAATTGTACTGCGACTATTAA

Protein sequence:

MKHLKVRKFLVHHKYCDTSLVNDIGLTYSNAPVKFGANVKRVALLSIFPRRATHGYVTGW
GLVNTDPEELATSMKYVQQDIIKPKECSRGNVPAGIFCGQSMNDGENPEISSRHWICVLV
KIRKNVKGWSSSAWTPLTRRNFLAFNGSIVRVPFRRVAGRGSSTVPGTCDPGVGSRGAYL
TCVDRSPPLKVFSSKPNELEGRVVRGDVVSIEDFPYSAFLLMGRERGSFICGSSIINQRI
LLTAAHCIEICNPKCKNGAAFVGNEQKRMGIKMTITFAKYHPRYRTNRVHFDIGLALLSR
SIKFGKFVKRVAISRRPRIKSVADIAGWGLVDEINKLSTDYLHHITQKVISHSDCKAYIS
NIPPGSFCAGEIKSRQFASEGDSGSALIINKYTQIGIVSYKRPDISASLIVYTNVSFYYD
WIKQTSRKLYCDY