New model in OGS2.0 | DPOGS204737  |
---|---|
Genomic Position | scaffold405:+ 73005-79377 |
See gene structure | |
CDS Length | 1866 |
Paired RNAseq reads   | 206 |
Single RNAseq reads   | 648 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012478 (4e-34) |
Best Drosophila hit   | CG10405 (2e-38) |
Best Human hit | transmembrane protease serine 2 isoform 2 (3e-32) |
Best NR hit (blastp)   | seminal fluid protein HACP002 [Heliconius erato] (6e-123) |
Best NR hit (blastx)   | seminal fluid protein HACP002 [Heliconius erato] (2e-119) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR002350 Proteinase inhibitor I1, Kazal IPR009003 Peptidase cysteine/serine, trypsin-like IPR011497 Protease inhibitor, Kazal-type IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL23897 |
Nucleotide sequence:
ATGAAGAACGAAATATGGATTTTTACTATAACATTATTGATCCAAAGTTTTAAATCGAGA
GCTGAAAACGGTATCATAACTGTTAAAAAAGTTACTGTAAGCCTTCCACAAGTATGTCCC
TGTGAATACAATTATAATCCAGTCTGCGGGACGGATGGAGTTACATTTATAAATAAATGT
CTTCTTGATTGTACGAATGACAGAAATGCTCGTCTGACCGATGGATTGCCACATATCTCA
ATCGCCCCTGACGATCAATGCAGTGGATGTGTATGTCTGAATCTCAACGTTCCCGTCTGC
TCATTAAATGGAGTGACTTACGACGATGAATGTGCATTATCCTGTGAAAATAGAAATCGT
ATTCGTGACAATCAAACAATTGTTTATCTCGCATATAGAGCGGCTTGTAATGGACCGCCA
TGTCCATGCACCAACGTTGCGGCTCCGGTGTGCGGCACTGACAATATTTTATATAGAAAT
CAATGTGTTCTTGAATGCGCTAGTAGCAACGCCCAAGCGAAGAATCTTCCTGCAATCGAA
TTACAAAATAATGGTGCCTGTCTAGACGGTTGCCTCTGCCCAAAAACAGTGGAACCTGTT
TGCGGAACAGATGGAAGGATTTACGATAATCTTTGCACTTTCGAATGCCAAAATAAAAAA
CTATCTAACTCTCAAAACGAACCCCTTAAAATTGCTAACCCATTTACTTGTAGAGAATGC
GCTTGTAGAAAAGTATTTGAACCAGTTTGTGGTACCGATGGCAGAACGTATCCCAATAAG
TGTGAAATTCAGTGTGCTTCATTTAGAAGAAGGGATCCCTTCCTCCAGATAGTCTCTCAA
GGACCATGTCCCGAATGTTTCTGTAAAGACGAGTTTTATCCAGTTTGCGGCACAGATCAC
AAGACATACAAAAATGATTGCGAACTGCGATGTGCCAACAATAAGCTTTCAGCTGGTGAA
CAACTGATTAGCATTTTTTATCAAGGGCAATGCATGGAATACAATTGTGACTGCAATTGT
GACTCAGAGTACCAGCCCGTTTGTGGGATCGACAACAGGTCCTATTGGAATTTATGCTTC
CTGAACTGTAACAGTGTATGCAGACAAAGACAAAAACTATCACCAATATTGTTTGTGTCG
AACGGCGTCTGTCCAACACGTAGAATAGTGGCGGGTGTTAATACATCAATAGCGGCGGTT
CCCTGGCAAGTCTCGTTGAGGGAAAAGACGTATCCCATATGTGGAGGGTCCGTAGTTACT
ACATTGTGGCTACTCACAGCAGCGCACTGCCTCTTACGGCCACGAGCCAGTGAGTTAAGT
GTTCGCCTCGGCTCCTCGTGGAAGACTCATGGGGGTGAGATGTATGACGTCAAACAGTCC
TATGTCCACCCGCAGTATGTGAGAAACACAAAAGTCAACGACGTCGGTCTCATCAAACTT
TACTCCCCACTGAGATTCTCTTCAAGAGTTCTTCCTATTAAGATGGTGGGGAAGGGAACT
CGCTTGCCGGCCGACAAAGCAGCTGTGGTCTCTGGATGGGGAAAGTTAAAGGAAGGTGGA
CCCAGTGCTACATTTCTTCAATCATCCACCATAAATACAATTGCGATGAAACTCTGTCGG
AATTCCGGCTTAGACAGAAACCCTATAGATCCAGGGTCCATGTTCTGTGCAGGAGCCTTC
AGCCAGCCCTCGCCCGATGCTTGCCAGGGTGACAGTGGTGGTCCCATAGTGAGTGAAGGT
GTGTTGATCGGAGTGGTATCCTGGGGACTCGGCTGCGCCCGCGGCAACTTTCCCGGCGTC
TACACTCGACTGGCCGCCCCTGTGATATGGGACTGGGTCCATGAACACATTTCACAGGAC
TCTTAA
Protein sequence:
MKNEIWIFTITLLIQSFKSRAENGIITVKKVTVSLPQVCPCEYNYNPVCGTDGVTFINKC
LLDCTNDRNARLTDGLPHISIAPDDQCSGCVCLNLNVPVCSLNGVTYDDECALSCENRNR
IRDNQTIVYLAYRAACNGPPCPCTNVAAPVCGTDNILYRNQCVLECASSNAQAKNLPAIE
LQNNGACLDGCLCPKTVEPVCGTDGRIYDNLCTFECQNKKLSNSQNEPLKIANPFTCREC
ACRKVFEPVCGTDGRTYPNKCEIQCASFRRRDPFLQIVSQGPCPECFCKDEFYPVCGTDH
KTYKNDCELRCANNKLSAGEQLISIFYQGQCMEYNCDCNCDSEYQPVCGIDNRSYWNLCF
LNCNSVCRQRQKLSPILFVSNGVCPTRRIVAGVNTSIAAVPWQVSLREKTYPICGGSVVT
TLWLLTAAHCLLRPRASELSVRLGSSWKTHGGEMYDVKQSYVHPQYVRNTKVNDVGLIKL
YSPLRFSSRVLPIKMVGKGTRLPADKAAVVSGWGKLKEGGPSATFLQSSTINTIAMKLCR
NSGLDRNPIDPGSMFCAGAFSQPSPDACQGDSGGPIVSEGVLIGVVSWGLGCARGNFPGV
YTRLAAPVIWDWVHEHISQDS