DPGLEAN11622 in OGS1.0

New model in OGS2.0DPOGS204737 
Genomic Positionscaffold405:+ 73005-79377
See gene structure
CDS Length1866
Paired RNAseq reads  206
Single RNAseq reads  648
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012478 (4e-34)
Best Drosophila hit  CG10405 (2e-38)
Best Human hittransmembrane protease serine 2 isoform 2 (3e-32)
Best NR hit (blastp)  seminal fluid protein HACP002 [Heliconius erato] (6e-123)
Best NR hit (blastx)  seminal fluid protein HACP002 [Heliconius erato] (2e-119)
GeneOntology terms
  
GO:0004252 serine-type endopeptidase activity
GO:0006508 proteolysis
InterPro families




  
IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site
IPR002350 Proteinase inhibitor I1, Kazal
IPR009003 Peptidase cysteine/serine, trypsin-like
IPR011497 Protease inhibitor, Kazal-type
IPR001254 Peptidase S1/S6, chymotrypsin/Hap
IPR001314 Peptidase S1A, chymotrypsin-type
Orthology groupMCL23897

Nucleotide sequence:

ATGAAGAACGAAATATGGATTTTTACTATAACATTATTGATCCAAAGTTTTAAATCGAGA
GCTGAAAACGGTATCATAACTGTTAAAAAAGTTACTGTAAGCCTTCCACAAGTATGTCCC
TGTGAATACAATTATAATCCAGTCTGCGGGACGGATGGAGTTACATTTATAAATAAATGT
CTTCTTGATTGTACGAATGACAGAAATGCTCGTCTGACCGATGGATTGCCACATATCTCA
ATCGCCCCTGACGATCAATGCAGTGGATGTGTATGTCTGAATCTCAACGTTCCCGTCTGC
TCATTAAATGGAGTGACTTACGACGATGAATGTGCATTATCCTGTGAAAATAGAAATCGT
ATTCGTGACAATCAAACAATTGTTTATCTCGCATATAGAGCGGCTTGTAATGGACCGCCA
TGTCCATGCACCAACGTTGCGGCTCCGGTGTGCGGCACTGACAATATTTTATATAGAAAT
CAATGTGTTCTTGAATGCGCTAGTAGCAACGCCCAAGCGAAGAATCTTCCTGCAATCGAA
TTACAAAATAATGGTGCCTGTCTAGACGGTTGCCTCTGCCCAAAAACAGTGGAACCTGTT
TGCGGAACAGATGGAAGGATTTACGATAATCTTTGCACTTTCGAATGCCAAAATAAAAAA
CTATCTAACTCTCAAAACGAACCCCTTAAAATTGCTAACCCATTTACTTGTAGAGAATGC
GCTTGTAGAAAAGTATTTGAACCAGTTTGTGGTACCGATGGCAGAACGTATCCCAATAAG
TGTGAAATTCAGTGTGCTTCATTTAGAAGAAGGGATCCCTTCCTCCAGATAGTCTCTCAA
GGACCATGTCCCGAATGTTTCTGTAAAGACGAGTTTTATCCAGTTTGCGGCACAGATCAC
AAGACATACAAAAATGATTGCGAACTGCGATGTGCCAACAATAAGCTTTCAGCTGGTGAA
CAACTGATTAGCATTTTTTATCAAGGGCAATGCATGGAATACAATTGTGACTGCAATTGT
GACTCAGAGTACCAGCCCGTTTGTGGGATCGACAACAGGTCCTATTGGAATTTATGCTTC
CTGAACTGTAACAGTGTATGCAGACAAAGACAAAAACTATCACCAATATTGTTTGTGTCG
AACGGCGTCTGTCCAACACGTAGAATAGTGGCGGGTGTTAATACATCAATAGCGGCGGTT
CCCTGGCAAGTCTCGTTGAGGGAAAAGACGTATCCCATATGTGGAGGGTCCGTAGTTACT
ACATTGTGGCTACTCACAGCAGCGCACTGCCTCTTACGGCCACGAGCCAGTGAGTTAAGT
GTTCGCCTCGGCTCCTCGTGGAAGACTCATGGGGGTGAGATGTATGACGTCAAACAGTCC
TATGTCCACCCGCAGTATGTGAGAAACACAAAAGTCAACGACGTCGGTCTCATCAAACTT
TACTCCCCACTGAGATTCTCTTCAAGAGTTCTTCCTATTAAGATGGTGGGGAAGGGAACT
CGCTTGCCGGCCGACAAAGCAGCTGTGGTCTCTGGATGGGGAAAGTTAAAGGAAGGTGGA
CCCAGTGCTACATTTCTTCAATCATCCACCATAAATACAATTGCGATGAAACTCTGTCGG
AATTCCGGCTTAGACAGAAACCCTATAGATCCAGGGTCCATGTTCTGTGCAGGAGCCTTC
AGCCAGCCCTCGCCCGATGCTTGCCAGGGTGACAGTGGTGGTCCCATAGTGAGTGAAGGT
GTGTTGATCGGAGTGGTATCCTGGGGACTCGGCTGCGCCCGCGGCAACTTTCCCGGCGTC
TACACTCGACTGGCCGCCCCTGTGATATGGGACTGGGTCCATGAACACATTTCACAGGAC
TCTTAA

Protein sequence:

MKNEIWIFTITLLIQSFKSRAENGIITVKKVTVSLPQVCPCEYNYNPVCGTDGVTFINKC
LLDCTNDRNARLTDGLPHISIAPDDQCSGCVCLNLNVPVCSLNGVTYDDECALSCENRNR
IRDNQTIVYLAYRAACNGPPCPCTNVAAPVCGTDNILYRNQCVLECASSNAQAKNLPAIE
LQNNGACLDGCLCPKTVEPVCGTDGRIYDNLCTFECQNKKLSNSQNEPLKIANPFTCREC
ACRKVFEPVCGTDGRTYPNKCEIQCASFRRRDPFLQIVSQGPCPECFCKDEFYPVCGTDH
KTYKNDCELRCANNKLSAGEQLISIFYQGQCMEYNCDCNCDSEYQPVCGIDNRSYWNLCF
LNCNSVCRQRQKLSPILFVSNGVCPTRRIVAGVNTSIAAVPWQVSLREKTYPICGGSVVT
TLWLLTAAHCLLRPRASELSVRLGSSWKTHGGEMYDVKQSYVHPQYVRNTKVNDVGLIKL
YSPLRFSSRVLPIKMVGKGTRLPADKAAVVSGWGKLKEGGPSATFLQSSTINTIAMKLCR
NSGLDRNPIDPGSMFCAGAFSQPSPDACQGDSGGPIVSEGVLIGVVSWGLGCARGNFPGV
YTRLAAPVIWDWVHEHISQDS