DPGLEAN02677 in OGS1.0

New model in OGS2.0DPOGS214422 
Genomic Positionscaffold593:+ 20977-41171
See gene structure
CDS Length5055
Paired RNAseq reads  60804
Single RNAseq reads  146072
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011362 (2e-85)
Best Drosophila hit  scarface, isoform A (4e-39)
Best Human hitplasma kallikrein precursor (5e-18)
Best NR hit (blastp)  PREDICTED: similar to AGAP008091-PA [Tribolium castaneum] (3e-61)
Best NR hit (blastx)  serine protease H164 [Tribolium castaneum] (2e-59)
GeneOntology terms


  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
GO:0005615 extracellular space
GO:0048803 imaginal disc-derived male genitalia morphogenesis
InterPro families
  
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR009003 Peptidase cysteine/serine, trypsin-like
Orthology groupMCL15891

Nucleotide sequence:

ATGAAGCCACTGCTATCAGTGGTGTCGCTGTGCCTTCTGCTGTGCGTACACGCGCTGCCG
GAAAACATCGAAGACGTCAAAGAATTACCTCAATCCAAAGAACCGATAGTGGAAGCAGAA
GAATCGAAGCCTCAAGCGAGAGCCGAAAGATGCACGACCTGTAGCACACTTAAACTAGGC
CTTAAGTCCCCAAAAGAAGTATTAGCAGCCTTGCATTCCTTGCCGGGAGCGGAAGTACAC
ACTCAGCAGTCCTTCGAAGGCTGCTCCAGCGATAAAGGATGTGCAGGACTGAAGCTCAAG
GACGGAAAAGTCATAGAACGTTTCGGGAACGTAGAGGCCTTCAAAGCCGCAGCTGCCTCC
GATGTGAACAACGAGTTCAACTTCCACGCCGGTTTCGGAAGCGATTTGTTCAAAAGCGCC
AACTCACCATTCTGGTGGATGAACCAAGACAGTCCTTTCAATGGCGGAGCTAACGGTGCT
AGCTTTGAGAAATTCAGCAAATCCTCAAGTTTCTCCTCCGGTAGTGGAGGTAACGCTGCA
TTCACTGGAATGGACCTATCAGCTAACCCGTTCTTAAACGGACAGTTTTCCAACCTGGGT
CTTGCTGGTGATGCTAGCTCTGTAAATCCCTTCCAATCATCAACCTTCGAGTCATCGGCA
TTCAGTGCCTCTAGCAAAACCGGTCAGGCTGGACTCCAAGGTTTCGGAGCTCAAAACGCT
GCTTCGAACTTCGGAGCTAGTGCCTTCAACTCTGGTTTCGCTGGCAATAAATTATCTGGC
TTCAGTGGTTCAAGCCCAGCACCTTTCGGATCTGGAGCTAACGTCAACCTCATTCAGAAC
GCCCAAAAGAACGACTTCGATTTCGAGCAACAGCAAACTCAACAGAACATCGACGAAATC
TTCCAAAACGCTGGAAACCTCGGCGTGGACGCGGGGGTTACAGCTGGGGAGTTGCAGCAG
ACCTGCTCCGGCTTGGGATACGCCTGCGTGCTGAAAACACAATGCAACAACGGCGTCGTA
AACATCAACGGAGCGGCTGCGCTACAAGCTAAAACTAAGAAGCAATACTGCAACCTTGCG
ACGGAAATATGCTGCAGAATCGAGACCGCTCAAGGAGCCGTTGGATCTACCGCCGGCCAG
GGTTCTGGTCTCTTCGCTGGACAAATTGGTTCCGTATCCGGAGGTTACGGCAGTCAGACA
ACCCAAAGTGGATTTTCAAACGGATTTGGTGCTAAAGGCACCTTTGCAACCGGAGCTTCT
TTCGGATCTGGCATCGGATCTGCCAACAGAGGCATCACGGTCGAATCCACTAAATTCGGA
TCTGGATATGGCTCGACTCTCGCACCCACCACATCCAGATTTGGATCCAACGGCTTCAAA
TCCACGAGCCAAACGAACTTCATCGACGCTGATTCACTAACCGCTGGCAGTGAAGCTGCT
GGTGTTTACCGACCTGGTGCTGTCGGATCTGGTTTGAAACCTGGTATCCCCTACCTCCCA
CCCATCGACGTCACAGGCAGTGGCAGTAATGTCGTCTCCACAACCGTCTTCCCTACCCCT
ACCATAATCACGACTCCTAGACCATTCACAACCCCAAAACCGACCTATCTGCCCCCTATT
TCATCAACCTCAGCCCCAGGTTACTTACCACCTATCGGGGAACCAACCAACAACAGAGAG
ACCATCGTCCCTAAACCTGATTATCAAGATGGTTCTATAATCCTGGACGAAAACAGATTC
CCCACAGCTAGACCTACCCCCGTGCCTGCACCGAGTGAAATCCCCGCTGGATGTGCCGCC
GCCCTAAAGTGTACTGCCGTCGAGTTCTGCACAGCTGAAGGTGTGATCTCAAACGTTACT
GTCTTTTTGACCAGAGATCAAGAGGCTTACAGAGTACCTCTCACGGATTGCCGTGACTTG
GAGACTGGACGCATTGGTAAATGCTGCCGGGATCCTTACTACACCGACCCCTGGCCTGTG
AACCAGCTGGGTAAGTGGGTGCCCGGGGTATTCGGGGGTAACGACGGTAAATACGTTCCG
GATAGCAGAGTTAGTCCAAACAATATCAGACCCAGTGTCACGGTCCGCCCTCCTGTCACC
GGTTCCGTCATATCACCAGCCTTCCTGACTAAACCCACGCCTACACCATTTGGGCCCAAC
CAAGTTTCTCCTGGTTTTGGCTCCACTGTAACTCCATTGAATCAGAGAGGTCAGGGTCAG
TTCCCTATCGGAGGTCAAGGACAATACAATAAAGGTGGTCTGGGACAATTCTCTCAAGGG
GGACAGGGGCAATTCACATCAGCTGGACAAGGACAACTTGGCATCATAGGACAAGGTCAA
ATTGGATCTGGATCTGCAATCAACACAGCGTTCGCCCAAGGACAGGTTGCACAAAAGGGA
CAAGGGTCGTTTGTGTCCCAAGGACAGGGAGTGGTTGCATCCAGGGGCCAAGGTCAAGTG
GTAAACAGAGGTCAAGTTAGCAAGGGACAAGGTTTCTTGGTGAATCAGGGTGCGGGGGTT
GGAATCAATAAACAGCAGGGACAGTTCGTCAGTCAGGGTCAAGGACAAATAGTGTCCCAA
GGTCAAGGACAAATTGTTTCCCAAGGAGTGGGACAGGGAGTCAGACAAGGAGTCGGGCAA
TACGGCCAAGGACAGCTTGGTATCCAAGGACAAGGTGTCCAGTCGCAATTTGGTGCTGGG
CAAAATGGCTTAGGCGTAGCAGCAATTGGAGCACAAGGAGTGAACGGTCAGGGACAGCTC
GTAAATCAAGGACAGGGCCAATTCGTATCAAAAGGTCAAGGCAGCGCTATCAATCAAGGA
TTTGGTACTGGCATCCGTCAGGGAAGCGGCGTGGTTGCGTCTCAAGGATTCGGGCAGGGA
GTGCGACAAGGACAAGGCACGGTTGTGTCGCAAGGATTCGGTCAGGGAGTCCGTCAAGGA
CAAGGACTCCTGGTCAATCAGGGAGAGGGACAAGTATCTTCGCAAGGACAGGGACAATTT
GTAAGCCAGGGTCAAGGACAACTCCTAAATCAGGGACAGGGACAATATGTATCGCAAGGT
GAAGGACAGCTAGTCTCTCAAGGACAGGGTGCTCTTGTGTCCCAGGGTCAAGGACAGCTG
GTCTCTCAAGGACAGGGTGCTTTTGTATCCCAGGGTCAAGGATCTCTTGTTTCCCAAGGA
TTCGGACAAGCCATCCGTCCAGGACAAGGCGCTTTCCTGACTAATGGCCAGGGACAAATA
GTCTCTCAAGGAGGAGGAGCTCTGATCAATCAGGGTGAGGGAGCATACGTCACAAATGGC
TTCGATCAAATCCGCCGAGCTCAAGCCCAACTCGTATCTACAAAGGAAGGGCAGTTGGTT
ACGCAAGGGGAAGGAGAGCTTGTTTCACAAGGCCAGGGGCAGAGAGTGTCGCAAGGATTC
GGTCAGGGTGTCCGCCAGGGGCAAGGATTCTCTGTGACGCAGGGCGGAGGGTATGGTGTT
GAAAACGAGTACGGTGAATCAGTGCAGAGGGTTTTCCTTCAACAGTACAACGCTGGAGGA
CAATGTGGTGTTCTGAATGGCCAACGTCCTTTTGGCAACCGCAATGAATTGGAAGCCGAT
TTCGCTGAGATACCCTGGCAGGCGATGGTGCTGTTGCAAACTAACAGAAGCCTGCTGTGC
GGCGGAGTCATCACCAGACCTGATGTGGTCGTAACCTCAGCCGCCTGTGTTGAAGGCCTG
GATGCCAAGAACGTGCTGATTAAAGGAGGTGAATGGAAGCTCGGGATAGACGACGAGCCT
CTGCCGTTCCAGATCGTCCAGGTCAAGACGATTCTCCGCCATCCGCTGTACAAACACAGC
AACCTCCACTACGACGCTGCTATCCTGGTACTCGCTGAGAACTTGAGATTCGCTAAAAAC
ATCTATCCCATCTGTCTCCCTGACAAGGATGACAGTTTGGACAAATACTACAACGGCGTC
GGAGAGTGTATCGTAACGGGATGGGGCAAGCAAGTCCTCCAAGCTCACCTTCAAGGCAGT
ATAATGCACAGCATCAACGTCTCGCTCATCAGCCCAGGTGAATGCCAGTCCAAATTATCA
TCAGAATACCCTCACCTCCTGGACCTGTACGATGAAGACAGCTGCGTCTGTGGCCAACCT
TCGAACCCTCTAAATAATATTTGCAGGGTTGACATTGGCAGTGCTCTTGCCTGCACGACT
GGCGACGGTCATTACACCTTCCGAGGAGTGTACTCCTGGGATTCCGGATGTCAAGTCGGA
AACCAAGTGGCTGGTTTCTATAGATTCGACCTGGAATGGTACCAGTGGGCCATCGGTCTC
ATCGAAAGCGTCAGATTCGCTCAATACAGTACAGTTACCAAGGTCACCACGGGGATATAC
ACTGGTCAAATAAAGGGTGGAGTGAAGGGCTTCTCTGGAGTCAAAGGAGTCAAGGGTTCG
TCAAACTCTGGCTCATCCATCAGAGCTGGTGCTGTAGCTTCAGTTTCATCTGGAGCAGTC
TCTTCGGGATCATCAGGAGTCATAAGTGGCCTTAATAGCTTCAACTTTGGAAAAGGTCAA
TTCGGATTTGGACAAAGTCAAGGCCAGCTATCTGGTAACCAAGGGCTGGTCAGTCAGGGA
CAGTTCGCTGGTAAAGTGAACCAGTTCCAGGAAAAAATAAACAGTGGTAGCTCAAGCCAA
GCCGGTTTTGGAGATGGATTCAACTTCAGCGAAATCAAACCGATCACTAACGGCTTCAGC
GCCACCTTCTCCGAGAAGAAGGTCTTCAAGACCGAACCGAAATTCGTGACATTCACAACG
AAACCAGAGATCGTGACGTATACAACTAAACCAGAAATCTTCACATTTACAACCAAACCC
AAAATTATTACTTACACAACCAAACCCAAAATCATAACCTACACAACCAAACCCCAGATC
ATCAGATACGAGACATCCGGCAGTGGGACCAACCCCCAATACGTAGCCCCAGGGGTGACC
TTCAACCCCTCCTTTTCAGAATTAGTGGGTAAGCACGAACACACAGCCAAATGCAAATGT
TTAGAAGGTAAATGA

Protein sequence:

MKPLLSVVSLCLLLCVHALPENIEDVKELPQSKEPIVEAEESKPQARAERCTTCSTLKLG
LKSPKEVLAALHSLPGAEVHTQQSFEGCSSDKGCAGLKLKDGKVIERFGNVEAFKAAAAS
DVNNEFNFHAGFGSDLFKSANSPFWWMNQDSPFNGGANGASFEKFSKSSSFSSGSGGNAA
FTGMDLSANPFLNGQFSNLGLAGDASSVNPFQSSTFESSAFSASSKTGQAGLQGFGAQNA
ASNFGASAFNSGFAGNKLSGFSGSSPAPFGSGANVNLIQNAQKNDFDFEQQQTQQNIDEI
FQNAGNLGVDAGVTAGELQQTCSGLGYACVLKTQCNNGVVNINGAAALQAKTKKQYCNLA
TEICCRIETAQGAVGSTAGQGSGLFAGQIGSVSGGYGSQTTQSGFSNGFGAKGTFATGAS
FGSGIGSANRGITVESTKFGSGYGSTLAPTTSRFGSNGFKSTSQTNFIDADSLTAGSEAA
GVYRPGAVGSGLKPGIPYLPPIDVTGSGSNVVSTTVFPTPTIITTPRPFTTPKPTYLPPI
SSTSAPGYLPPIGEPTNNRETIVPKPDYQDGSIILDENRFPTARPTPVPAPSEIPAGCAA
ALKCTAVEFCTAEGVISNVTVFLTRDQEAYRVPLTDCRDLETGRIGKCCRDPYYTDPWPV
NQLGKWVPGVFGGNDGKYVPDSRVSPNNIRPSVTVRPPVTGSVISPAFLTKPTPTPFGPN
QVSPGFGSTVTPLNQRGQGQFPIGGQGQYNKGGLGQFSQGGQGQFTSAGQGQLGIIGQGQ
IGSGSAINTAFAQGQVAQKGQGSFVSQGQGVVASRGQGQVVNRGQVSKGQGFLVNQGAGV
GINKQQGQFVSQGQGQIVSQGQGQIVSQGVGQGVRQGVGQYGQGQLGIQGQGVQSQFGAG
QNGLGVAAIGAQGVNGQGQLVNQGQGQFVSKGQGSAINQGFGTGIRQGSGVVASQGFGQG
VRQGQGTVVSQGFGQGVRQGQGLLVNQGEGQVSSQGQGQFVSQGQGQLLNQGQGQYVSQG
EGQLVSQGQGALVSQGQGQLVSQGQGAFVSQGQGSLVSQGFGQAIRPGQGAFLTNGQGQI
VSQGGGALINQGEGAYVTNGFDQIRRAQAQLVSTKEGQLVTQGEGELVSQGQGQRVSQGF
GQGVRQGQGFSVTQGGGYGVENEYGESVQRVFLQQYNAGGQCGVLNGQRPFGNRNELEAD
FAEIPWQAMVLLQTNRSLLCGGVITRPDVVVTSAACVEGLDAKNVLIKGGEWKLGIDDEP
LPFQIVQVKTILRHPLYKHSNLHYDAAILVLAENLRFAKNIYPICLPDKDDSLDKYYNGV
GECIVTGWGKQVLQAHLQGSIMHSINVSLISPGECQSKLSSEYPHLLDLYDEDSCVCGQP
SNPLNNICRVDIGSALACTTGDGHYTFRGVYSWDSGCQVGNQVAGFYRFDLEWYQWAIGL
IESVRFAQYSTVTKVTTGIYTGQIKGGVKGFSGVKGVKGSSNSGSSIRAGAVASVSSGAV
SSGSSGVISGLNSFNFGKGQFGFGQSQGQLSGNQGLVSQGQFAGKVNQFQEKINSGSSSQ
AGFGDGFNFSEIKPITNGFSATFSEKKVFKTEPKFVTFTTKPEIVTYTTKPEIFTFTTKP
KIITYTTKPKIITYTTKPQIIRYETSGSGTNPQYVAPGVTFNPSFSELVGKHEHTAKCKC
LEGK