New model in OGS2.0 | DPOGS215100  |
---|---|
Genomic Position | scaffold4222:+ 121-4616 |
See gene structure | |
CDS Length | 1128 |
Paired RNAseq reads   | 689 |
Single RNAseq reads   | 2486 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009610 (2e-35) |
Best Drosophila hit   | CG3700, isoform A (2e-33) |
Best Human hit | transmembrane protease serine 2 isoform 2 (2e-28) |
Best NR hit (blastp)   | PREDICTED: similar to snake CG7996-PA [Apis mellifera] (3e-41) |
Best NR hit (blastx)   | PREDICTED: similar to snake CG7996-PA [Apis mellifera] (4e-43) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR009003 Peptidase cysteine/serine, trypsin-like IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL23866 |
Nucleotide sequence:
ACATGTAAATTTCAAGGTGATCAAAGAATAGTCTGCTGTCCTGAGACCGACCTGTTCCAT
GAAACGGGAGTCTTCAAGCATCACTTCATTGGGCTTGCCAAAGGAGTCAAGAAATCTAAG
TACATGACCTGTCGCTACGATGGCTACCAGCCCTTGCAATGTTGTGAGAACGCTAAACCC
GTCACCATTCCACCAGAACCGGCAACTTGTCCAAGCCTCCCAAGACCCCTGCTGGCCAAA
AACCACATCGCCTGGACTAAATGCGTTGACTACCAGCGTTACATCCACAAGTGCGTGCCA
GTTGATCCAATCAACCAACCATACAAAATGCAAAGGGTAAACACTTGCGGCATCAGCAAC
TCCAATTTTAGGATATCCGGTGGCGTTGAAGCTAAACCCAGAGAGTTTCCGTTCATGGCC
GTCATCGGCTGCCACAATTCCCTGGACGTGGACGCCGACATCAAGTGGGTAGGCGGAGGC
TCGCTGATCAGTGAGAAGTTTATACTCACGGCTACTCACATATTGAGTGAACCGACTTAT
GGCCGCGTACGGTACGCCTTGCTTGGCACTTTGAATAAGACAGACATAAGGTCCGGAGTC
CTTTACAATATCGTGTCTATGATCGCGCACCCTGAATACGACATTCCCGTTAAAGCGAAT
GACATAGCGCTCCTGGAGCTAGACAGACAGGTCTTTTTCAATGAATTCATTCACCCCGTC
TGTCTCCCGGTGCCGGGCAGATATATTACAAATGACTATATTGTTGCCGGTTGGGGCGAA
AACAACAACAGATACAGCAGTGACGTGCTGTTAACTGCGAGACTGCGACCCAGCGATGAA
TGCAAGAGCAGAATAGTAAGAAAAGACTTCGTTTATTCGAATGAGAAGTATATCTGTGCT
AAAGGAGAGCTGGAAAAGGGCGTCTATCAAGACACCTGCAAGGGCGACAGCGGAGGCCCG
TTGTTGGCTCTGATGTTTAATATAAACTGCTCCTACTCCTTGGAGGGTATCGTCAGTTTT
GGACCCGAATGCGGCAAAGGCTTTCCAGCGGTTTACACCAAAGTATCCAACTATTTGGAT
TGGATAGTTGAAAACGTATGGCCCGATAAGGTCAACAAAAAGCAATAA
Protein sequence:
TCKFQGDQRIVCCPETDLFHETGVFKHHFIGLAKGVKKSKYMTCRYDGYQPLQCCENAKP
VTIPPEPATCPSLPRPLLAKNHIAWTKCVDYQRYIHKCVPVDPINQPYKMQRVNTCGISN
SNFRISGGVEAKPREFPFMAVIGCHNSLDVDADIKWVGGGSLISEKFILTATHILSEPTY
GRVRYALLGTLNKTDIRSGVLYNIVSMIAHPEYDIPVKANDIALLELDRQVFFNEFIHPV
CLPVPGRYITNDYIVAGWGENNNRYSSDVLLTARLRPSDECKSRIVRKDFVYSNEKYICA
KGELEKGVYQDTCKGDSGGPLLALMFNINCSYSLEGIVSFGPECGKGFPAVYTKVSNYLD
WIVENVWPDKVNKKQ