New model in OGS2.0 | DPOGS201312  |
---|---|
Genomic Position | scaffold1094:+ 65833-70006 |
See gene structure | |
CDS Length | 1302 |
Paired RNAseq reads   | 13 |
Single RNAseq reads   | 124 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003104 (1e-39) |
Best Drosophila hit   | CG3355, isoform A (8e-18) |
Best Human hit | vitamin K-dependent protein C preproprotein (1e-12) |
Best NR hit (blastp)   | GJ23363 [Drosophila virilis] (8e-21) |
Best NR hit (blastx)   | GK23752 [Drosophila willistoni] (1e-20) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL39723 |
Nucleotide sequence:
ATGAAACACTTAAAAGTTCGTAAATTCTTAGTACATCACAAATACTGCGACACCTCTCTG
GTAAATGACATCGGTTTGACATATTCAAACGCACCGGTAAAGTTTGGGGCGAACGTGAAG
CGTGTTGCGCTGCTGAGTATATTTCCAAGACGAGCAACTCACGGTTACGTTACAGGATGG
GGATTGGTTAATACAGACCCTGAAGAACTAGCTACGTCCATGAAATATGTTCAACAAGAC
ATAATAAAACCCAAAGAATGCAGTCGTGGAAACGTGCCTGCTGGAATCTTCTGTGGCCAA
TCCATGAATGATGGGGAAAATCCAGAAATTTCATCGAGGCACTGGATTTGCGTACTAGTA
AAAATTAGAAAAAATGTGAAGGGCTGGTCATCGTCTGCTTGGACACCTTTAACGAGACGA
AACTTCTTAGCATTCAATGGATCCATCGTGCGGGTGCCGTTCCGTCGCGTTGCAGGGAGA
GGATCTTCAACAGTGCCTGGGACTTGCGACCCAGGTGTAGGTTCAAGGGGCGCCTATCTA
ACATGCGTTGATCGCTCACCTCCACTAAAGGTTTTTTCATCTAAACCAAATGAATTAGAA
GGGAGGGTAGTCAGAGGAGACGTTGTGTCGATCGAGGACTTCCCATATTCAGCGTTTCTG
TTGATGGGTAGAGAGAGGGGCAGCTTTATATGTGGTTCATCCATCATCAATCAGAGAATC
TTATTGACGGCAGCACATTGTATCGAAATATGCAATCCCAAGTGCAAGAACGGAGCGGCA
TTTGTTGGAAATGAACAAAAGAGGATGGGAATCAAAATGACTATAACATTCGCAAAATAC
CACCCCAGATATAGAACAAATCGTGTGCACTTTGATATAGGTCTTGCATTGCTTTCTAGA
TCTATAAAGTTTGGTAAATTTGTTAAACGGGTTGCCATTTCAAGGCGTCCGAGGATAAAA
TCTGTCGCTGATATAGCTGGTTGGGGTTTAGTTGATGAAATAAACAAATTGTCGACAGAT
TACTTGCATCATATAACGCAAAAGGTGATAAGTCATAGTGATTGTAAGGCCTATATATCC
AATATTCCTCCAGGCTCTTTCTGCGCTGGTGAGATTAAGAGCAGGCAGTTTGCATCAGAA
GGGGACTCTGGCAGTGCTTTAATAATCAACAAGTACACGCAAATCGGTATCGTGTCTTAT
AAACGGCCGGACATATCGGCCAGTCTTATTGTATATACAAACGTCTCATTCTATTACGAC
TGGATAAAACAAACTTCGAGAAAATTGTACTGCGACTATTAA
Protein sequence:
MKHLKVRKFLVHHKYCDTSLVNDIGLTYSNAPVKFGANVKRVALLSIFPRRATHGYVTGW
GLVNTDPEELATSMKYVQQDIIKPKECSRGNVPAGIFCGQSMNDGENPEISSRHWICVLV
KIRKNVKGWSSSAWTPLTRRNFLAFNGSIVRVPFRRVAGRGSSTVPGTCDPGVGSRGAYL
TCVDRSPPLKVFSSKPNELEGRVVRGDVVSIEDFPYSAFLLMGRERGSFICGSSIINQRI
LLTAAHCIEICNPKCKNGAAFVGNEQKRMGIKMTITFAKYHPRYRTNRVHFDIGLALLSR
SIKFGKFVKRVAISRRPRIKSVADIAGWGLVDEINKLSTDYLHHITQKVISHSDCKAYIS
NIPPGSFCAGEIKSRQFASEGDSGSALIINKYTQIGIVSYKRPDISASLIVYTNVSFYYD
WIKQTSRKLYCDY