DPGLEAN01925 in OGS1.0

Genomic Positionscaffold1362:- 5828-8899
See gene structure
CDS Length1803
Paired RNAseq reads  514
Single RNAseq reads  2947
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000840 (2e-49)
Best Drosophila hit  CG11192 (8e-18)
Best Human hittrypsin-2 preproprotein (2e-18)
Best NR hit (blastp)  PREDICTED: similar to ankyrin 2,3/unc44 [Strongylocentrotus purpuratus] (2e-68)
Best NR hit (blastx)  PREDICTED: similar to ankyrin 2,3/unc44 [Strongylocentrotus purpuratus] (3e-68)
GeneOntology terms







  
GO:0016787 hydrolase activity
GO:0007586 digestion
GO:0005576 extracellular region
GO:0008233 peptidase activity
GO:0046872 metal ion binding
GO:0008236 serine-type peptidase activity
GO:0004252 serine-type endopeptidase activity
GO:0003824 catalytic activity
GO:0006508 proteolysis
InterPro families

  
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR000477 Reverse transcriptase
Orthology groupMCL10014

Nucleotide sequence:

ATGGTCAGTTTCGATGTACAGTCATTATTCACTAGTATACCTGTTCTTGACTGCATTGAG
ATTGTAAGAGGTAAGTTAAAGGATAACAATATGCCTATAGAATATGCAGAGCTATTAAAG
CATTGCCTAACATCTGGCTACCTCATGTGGAAGGATGAATTCTACATACAAGTAGATGGA
GTTGCAATGGGTTCACCGGTTTCCCCCGTTGTCGCTGACATATTCATGGAGGACTTCGAG
GTGCGAGCCCTTTGCTCTCCTCCTATAAGACCTTTAATTTATAAACGGTATGTAGATGAC
ACCTTCACAATATTAAATAAAAATAAAACATCTGCTTTTCTGAACCATCTCAATTCTATC
AATAGTAAGATTCAGTGTACTATAGAATTGGAGGCAAATAATTCTTTAGCTTTCCTTGAT
ATACTTGTTGTTAGGAATCCTGACAATACTTTGGGACATACTGTTTATAGGAAACCCACA
CATACGGACAGGTACCTCAATGGTTACTCACACCACCACCCTATCCAGTTAGCTACCGTT
GGCAAATCTTTGTTACAGAGAGCCCAACATCTTTGTGATGCTGACCACCTAGAGGCCGAG
CTGCAGCATGTAAAACATGCTCTCACTATCAACAACCTGCCCGTGCCTCGCCAGCATCGC
AAGAAGCACCTGAAGCCACCCACAGTTGAACGACAACCTGCGATACTACCATATGTGAAG
GGAGTTACTGACAGAATAGGCAACATCTTGAAGAAGGTTTCCATTAAAACTATTTACAAA
CCACATAAGAAAGTGAGCCAATTCTTGAGACCAATCAAGAGTAACATTCCTTTACAACAA
GCGGGTGTATACAAACTCGACTGTGACTGTGTCTTGTCATACATTGGACAGACGAAGAGG
AGCATCGGTACAAGGGTTAAGGAACACATCTCAGATATCAAAAACAGGCGCGCGTCGAAG
TCAGCAGTGTGTGAACACACAATGGACAAACCAGGCCACTACATTCGTTTTGATAAACCT
CAAATCCTCGCTCGGGAAGACAAGTATATACCGAGATTAATTCGCGAGGCTATTGAAATT
AAAAAACATCCCAATTTCAATAGAGAAGATGGCTGGAATCTTTCAAACACCTGGGACCCC
GTTCTTAAAAATATAAAATCCCATGTCCGTAACCACACCGCAGGACCTCAAGACACCGTG
AGCGCATTCTGCCGGCATCCAGAGCGGTACGCCAGAAAATTAAGAAATCGATGGCGTATT
GATCCTACCACAGTGATTCTGAGAGCTGGTAGCACATATCGGGGCAATGGTACTATTATA
CCGATAGATGAGATAGTTGCACACCCAGAATATAACGATTCACCCTTTGATAAGGATGTT
GGCTATATACGAACTTCTAATCCAATACAGTTTACTGGCGCTATGAAGCCCATTCCCCTC
GTAAATGAATCTGAACCGTGCAGTAATAGAGTGAACGTCAGCGGATGGGGTAGACTGATG
GAAGGACAAAATCCCTTGCCTCTAAGACTAAGAGCGGTGAATGTGCCTGTTGTTGATTAT
TTTAGATGTAAGATGGCGTATCCCAGAATATTAACTCGCAACATGGTATGTGTTGGGAAT
TTCGTCTTAGGAGGTCAGGGTACTTGTCAGGGGGATTCAGGAGACGCTGGGGTTGATAAT
GGGAGGGCTTGTGGTATTGTGTCATTTGCAAGAGGTTGTGCACGCCCTATGTCTCCGAAT
GTCTTCACAAATATAGCAGCTGGACCAGTTAGAAGATTTATCACAGATAATACAGGTGTC
TAA

Protein sequence:

MVSFDVQSLFTSIPVLDCIEIVRGKLKDNNMPIEYAELLKHCLTSGYLMWKDEFYIQVDG
VAMGSPVSPVVADIFMEDFEVRALCSPPIRPLIYKRYVDDTFTILNKNKTSAFLNHLNSI
NSKIQCTIELEANNSLAFLDILVVRNPDNTLGHTVYRKPTHTDRYLNGYSHHHPIQLATV
GKSLLQRAQHLCDADHLEAELQHVKHALTINNLPVPRQHRKKHLKPPTVERQPAILPYVK
GVTDRIGNILKKVSIKTIYKPHKKVSQFLRPIKSNIPLQQAGVYKLDCDCVLSYIGQTKR
SIGTRVKEHISDIKNRRASKSAVCEHTMDKPGHYIRFDKPQILAREDKYIPRLIREAIEI
KKHPNFNREDGWNLSNTWDPVLKNIKSHVRNHTAGPQDTVSAFCRHPERYARKLRNRWRI
DPTTVILRAGSTYRGNGTIIPIDEIVAHPEYNDSPFDKDVGYIRTSNPIQFTGAMKPIPL
VNESEPCSNRVNVSGWGRLMEGQNPLPLRLRAVNVPVVDYFRCKMAYPRILTRNMVCVGN
FVLGGQGTCQGDSGDAGVDNGRACGIVSFARGCARPMSPNVFTNIAAGPVRRFITDNTGV