New model in OGS2.0 | DPOGS213461  |
---|---|
Genomic Position | scaffold4038:+ 7370-11324 |
See gene structure | |
CDS Length | 1251 |
Paired RNAseq reads   | 246 |
Single RNAseq reads   | 727 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004487 (2e-103) |
Best Drosophila hit   | CG13430, isoform B (9e-18) |
Best Human hit | kallikrein-13 precursor (2e-15) |
Best NR hit (blastp)   | GE15163 [Drosophila yakuba] (1e-21) |
Best NR hit (blastx)   | trypsin 2 [Culex quinquefasciatus] (3e-18) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR009003 Peptidase cysteine/serine, trypsin-like IPR001314 Peptidase S1A, chymotrypsin-type IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site |
Orthology group | MCL39860 |
Nucleotide sequence:
ATGAACTGGGTTCTCGTTTTTATGACTCTATCAATGTTTTATAGCTATGTTCTGAGCTAT
GGAGACCAGGCCTCAGTAGTTAAATTTAAATTCAATCTGGACCCGTATGGAGAGGCTCCG
CTCCAAGATGGCCGACGGAGACATCGAGATAACAAAAAATCATTAAGAGTCCGAAACGAC
TTCCTGTTCAGTTTAAACAAAGACGCTCTCAGGATCCGGGGGGGGAATGCCACGGATACG
ACCAACTATCCGTACATAGCGGCCATTATAATCAACGGCAGGTTATGGTGCGCCGGCACC
ATCGTCGACGTCAACTGGGTACTGACAGCGGCGCATTGTCTGAATTACGTGCTTCACGTA
GCGCCAATGAAGACCCTGGGGCAGTACGTGAAGGTCAGGGTCGGCAGCGCCCAGGCTCAC
GAAGGAGGTTTGCTGGTAGACGTCGCGGGGGCCGTGCGACACCCGAAATTCGAAGAGGAA
CCCGTGCCTCATGCTGATGTAGCTTTATTGAAACTGACTGAAAACCTTGAATTCTCAACT
CACATCAATCTGATTAAAATAAACGAAGATATGAGAGAGCCTTACGCGCAGAGTTTCGTG
TCTGTAACCGGCTGGGGAGCGACCCGTGGCACAGACACAGCCTTCAGAGAACACACGCCC
GACCTGATGACGGCTCGTCTCAAGGTTCGCACGGTCAACTACTGCAGAGACGCGTACCAA
CTGGTTAGCGGGTTTCAGTTCACCGCAGACTTCTTCTGCGCTTCGTTGAGAAACGGCACC
AGAGACGCGTGTTTGGGCACAGACACAGCCTTCAGAGAACACACGCCCGACCTGATGACG
GCTCGTCTGAAGGTTCGCACGGTCAACTACTGCAGAGACGCGTACCAACTGGTTAGCGGG
TTTCAGTTCACCGCAGACTTCTTCTGCGCTTCGTTAAGAAACGGCACCAGAGACGCGTGT
TTGTTCGACGCGGGCGCGCCAGCCACCCAACACAACAAATTAATGGGCGTCATGAGCTTC
GGGCCCGAGCGTTGCGGACACGAATACCAACCAGCGGTGTTCATTAAGGCTTTTTATTTC
AGGGATTTCGTGAAGCACACTATATCCTCATATAAGACTACAGCTGAACTTATAGAAGCC
ATGAAAGATATCGACAAAGTTATCAGACCACCCGTTCATGTGAAACAGGAACACGTGGTC
GTCGAGAAAGATGAGCAAGAGGTCACGGAACCAGATTATAAACACGATTGA
Protein sequence:
MNWVLVFMTLSMFYSYVLSYGDQASVVKFKFNLDPYGEAPLQDGRRRHRDNKKSLRVRND
FLFSLNKDALRIRGGNATDTTNYPYIAAIIINGRLWCAGTIVDVNWVLTAAHCLNYVLHV
APMKTLGQYVKVRVGSAQAHEGGLLVDVAGAVRHPKFEEEPVPHADVALLKLTENLEFST
HINLIKINEDMREPYAQSFVSVTGWGATRGTDTAFREHTPDLMTARLKVRTVNYCRDAYQ
LVSGFQFTADFFCASLRNGTRDACLGTDTAFREHTPDLMTARLKVRTVNYCRDAYQLVSG
FQFTADFFCASLRNGTRDACLFDAGAPATQHNKLMGVMSFGPERCGHEYQPAVFIKAFYF
RDFVKHTISSYKTTAELIEAMKDIDKVIRPPVHVKQEHVVVEKDEQEVTEPDYKHD