DPGLEAN07492 in OGS1.0

New model in OGS2.0DPOGS206698 
Genomic Positionscaffold725:- 6922-11691
See gene structure
CDS Length2076
Paired RNAseq reads  28
Single RNAseq reads  73
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008530 (2e-10)
Best Drosophila hit  scarface, isoform A (6e-14)
Best Human hitazurocidin preproprotein (2e-09)
Best NR hit (blastp)  GE22829 [Drosophila yakuba] (7e-12)
Best NR hit (blastx)  scarface, isoform B [Drosophila melanogaster] (1e-12)
GeneOntology terms


  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
GO:0005615 extracellular space
GO:0048803 imaginal disc-derived male genitalia morphogenesis
InterPro families
  
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR009003 Peptidase cysteine/serine, trypsin-like
Orthology groupMCL40287

Nucleotide sequence:

ATGACGTCCCCATTTTTAGTTTTGTTGATCGTTCTGAAAACGGTTTCATCGCAAATAAAT
GGCGAATTTTGGTGGCTTAATGAAAAATTCACGAAGTTGCAGCAGGTTGTGCCACCATCA
CCAACATTCGAAGACACGGGACATTTGGAAACCGATGAAAGTGTTAAAATTATTTTTAAA
GATGTCACAGAGGATATTGATAAAAATATAAACTTTTCTCTCAACGAAGGTGAGGTTTTT
CCCGAATTTATTCTCTATAACTTAACTAAATCCCCGATTGTGGGTGAAGGAAATGTCTCC
AGTGAGAAAAATACAACGAATAATACCGATGAAGACTTCACAGTTAATTCTATTCAATCC
AAGAAACAAATAAAGAAAGTAAGAAAAATTGCCATAAATGATAAAATTGACTTCGATGAA
GATAAACCGACAATAGAATCTGAAAGTATTTGCACGTTTATTACAAAACATGAATGCTTA
CGCAACAAGGGCACTGTTCACATGTCTGGATTATGTCCCTTGAATTCTTTTCATAATTAT
CACAGAATTTGCTGTATACTTCCTCTTTTTCCATATCCCAAACAATTACACCCAAGTGAC
ATCCTCAATGGAAGCCGGTACAAACGATCTAATGATGATGAAATCAGTCCAGCGCTCAAA
CAAAGAAACGCTTTGCTGCAGCGAAAAAATTTCTCCCAGGCGTTTAAAAATAACCACGAT
CCTACAACAGATCAATCTAGAAACCAAAAAATTGTAACTCCTAGTGATAACGTGGATCCG
TATTGGAATGTTAAAAACTTTAAATTTCGTCAACAAAATAATTTTAATGAAAACAAGGAT
CGTGAAAATACAAAAATCGACTCAAGTAGTAAAGATTATTCAGATGATTATACAGCAGAG
GTACCTAAACCTGGACTTTTAGGTGCTTATACAGAACGTGATGAACGTCTTACTACTTGG
AAAATGAGAAATAAAGCCTACTCATACGACGGCTATGACGAAATAAGCGAGGAAGATAGC
GGAGAAACTGACATGCCGTTCGGTTACTCAACGTTTGATCCAAGACAAGGTAATCGCAAG
AAAAATTCTAGAAGCAAAAAACGTAAACCTCAAAGGCTAATGTTAACATCAACAGAAGAG
AACAAAGGATCAGAGAGCCAGGCTATAAATTTTCATTCGAGGCCGGATTTTCATGTACTA
CATGGTTTTAAACTTGTTAATTTATCAGGGAATAAAAATAGATTTGTTAGAACTACCACA
GAAACTTTATGTGACAGCCAGGATACTTCCAACGAAAATACATTTCCAGAGGTCGAAAAT
CCGGATGATATTTACGACGACACGATTGATTTAGATCAGCAAGTTTACAAAGATTGTGGA
AATAGTGTTACCAACGCTTTCAATAGCGATTTTAGAAAATCACATGAAGAAAAAAACCCT
TGGCTCGCTCTGGTCGTGTTAACAAAAAGTCCACAGACAATATTATGTTATGCCACAATA
GTACATCCGCGCGCAGTCATTACAGCTGCCGAATGTGTCCAGGGTAAAATCCCTGGGGAC
GTAACTGTACTAACTGGTGTATGGCAACTAAGGAAAGACAAAGCAGTACCACAACACCGC
ATGGCCTCGGTCTACATTGTTTCTGACTATAAACCTGGAGAACTTGTTAATGATCTTGCT
CTTCTGTATTGGAAACGACCATTACAACTGGCAGAGAACGTTCAGCCCGCATGCCTCGCG
GATCCGCACGTCGGAGACGAGTGTTATTTCGTTGGGTGGGGTGGTTACGATCAAGGTTTA
AGTCATCACCCTGACTCTCAACAAGCGACTATACTCACGCCTCGTGTGTGTAACGAGAAA
TTATCATCACCAGAGCTGCTTCTACCCCCAGGCGCGTTCTGTGCTTCAGTTGAATCACGT
GGCACTGTAACCGGTATTGGAGGTGCTCTTCTATGTAAAGGCGCGGGTAGTCGAACATCT
GTCGTAGGAGTGGCGGTGTATCGTGACAGTATAGTTGTCTTACTACCCACATTCGAATGG
GTCGTCTCGGCGTTGCGACACCACCAAATAATTTAA

Protein sequence:

MTSPFLVLLIVLKTVSSQINGEFWWLNEKFTKLQQVVPPSPTFEDTGHLETDESVKIIFK
DVTEDIDKNINFSLNEGEVFPEFILYNLTKSPIVGEGNVSSEKNTTNNTDEDFTVNSIQS
KKQIKKVRKIAINDKIDFDEDKPTIESESICTFITKHECLRNKGTVHMSGLCPLNSFHNY
HRICCILPLFPYPKQLHPSDILNGSRYKRSNDDEISPALKQRNALLQRKNFSQAFKNNHD
PTTDQSRNQKIVTPSDNVDPYWNVKNFKFRQQNNFNENKDRENTKIDSSSKDYSDDYTAE
VPKPGLLGAYTERDERLTTWKMRNKAYSYDGYDEISEEDSGETDMPFGYSTFDPRQGNRK
KNSRSKKRKPQRLMLTSTEENKGSESQAINFHSRPDFHVLHGFKLVNLSGNKNRFVRTTT
ETLCDSQDTSNENTFPEVENPDDIYDDTIDLDQQVYKDCGNSVTNAFNSDFRKSHEEKNP
WLALVVLTKSPQTILCYATIVHPRAVITAAECVQGKIPGDVTVLTGVWQLRKDKAVPQHR
MASVYIVSDYKPGELVNDLALLYWKRPLQLAENVQPACLADPHVGDECYFVGWGGYDQGL
SHHPDSQQATILTPRVCNEKLSSPELLLPPGAFCASVESRGTVTGIGGALLCKGAGSRTS
VVGVAVYRDSIVVLLPTFEWVVSALRHHQII