New model in OGS2.0 | DPOGS206620  |
---|---|
Genomic Position | scaffold551:+ 89171-98661 |
See gene structure | |
CDS Length | 1191 |
Paired RNAseq reads   | 785 |
Single RNAseq reads   | 2142 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008515 (2e-27) |
Best Drosophila hit   | jonah 74E (3e-46) |
Best Human hit | transmembrane protease serine 3 isoform 1 (4e-28) |
Best NR hit (blastp)   | chymotrypsin [Helicoverpa armigera] (2e-71) |
Best NR hit (blastx)   | chymotrypsin [Helicoverpa armigera] (4e-73) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL16739 |
Nucleotide sequence:
ATGGGTTACAGTTCCTATACATCCATCTGGTGTGTTATTTTTACCACATTATATTTGCAG
GCTGGTCTCATAAGCGATATTCTTGGTAGCTCAGGACAAGAGGTATGTGGAGGTTCTCTA
CTTAGCACAAACCGAGTGTTAATGGCTGCTCATTGCTGGGACGATGGTAGAAGCAAAGCT
TGGAGATTCACTGTTGTTCTTGGTTCGGAAACGCTTTTCACAGGCGGTACTAGAATTCTT
ACCAGCGATGTGGTCACCCATCCACTGTGGAATCCTGCATTTGTCCATAATGACGTTGCC
ATGATTCGTTTACCCGAACATGTTGTTCTATCCGTTGACAGTATCAATAAAATATATTTT
GTGAATATGGCTGTCATCACTAACGAAATATGCAACGTCGCTGTCTTCGGTGACATTCAA
TCATCGAACATTTGCACGAACACCCTCGGTGATAAGAGCACTTGCCGTGGTGACTCTGGT
GGCCCACTTGTTGTCCAAAGACGTGTCAGGACTGTTCTGGCTGGTCTCATCAGTGATATA
ACGGGAATAGAAGGAAGAGGTGTTTGCGGAGGATCTCTCGTCTCGACAAACCGGGTAATT
ACAGCTGCTCATTGCTGGGACGATGGTAGAAACAAAGCCTGGAGATTCACTGTTGTTCTT
GGATCGGAAACGCTTTTCACAGGCGGTACTAGAATCCTTACCAGTGACGTCGTGATGCAC
CCACTGTGGATCCCAATACTGATCCTCAATGACATCGCTGTCATCCGTTTGCCAGAACCT
GTCGCTTTGTCCGATACCATCGGAACCGTTTCTTTGCCCACTGGATTAGAAATCTTTGAC
GACTTCAACGGTCAAGTCGCTATTGCCTCTGGCTATGGACTTACACAAGACGGCGGCAGT
ATCAGCAACAGCCAATTCTTAAGTTACGTCAATATGTCAGTCATTACAAATGAAGTTTGC
AACATCGCTTTCTTCGGTAACATTCGGCCTTCGAACATTTGCACCAGCACCCAAGGTGGT
AAGAGCACTTGCCGTGGTGACTCTGGTGGTCCACTTGTTGTCCAAAGACGTGACAGAACT
GTTCTGGTTGGCGTTACCTCATTTGGAATTGCTTTTGGTTGCGAAATCGGATGGCCAGCA
GCCTTTTCAAGAATTACATCATTCCTTGGATTTATTAATGACAATTTATAA
Protein sequence:
MGYSSYTSIWCVIFTTLYLQAGLISDILGSSGQEVCGGSLLSTNRVLMAAHCWDDGRSKA
WRFTVVLGSETLFTGGTRILTSDVVTHPLWNPAFVHNDVAMIRLPEHVVLSVDSINKIYF
VNMAVITNEICNVAVFGDIQSSNICTNTLGDKSTCRGDSGGPLVVQRRVRTVLAGLISDI
TGIEGRGVCGGSLVSTNRVITAAHCWDDGRNKAWRFTVVLGSETLFTGGTRILTSDVVMH
PLWIPILILNDIAVIRLPEPVALSDTIGTVSLPTGLEIFDDFNGQVAIASGYGLTQDGGS
ISNSQFLSYVNMSVITNEVCNIAFFGNIRPSNICTSTQGGKSTCRGDSGGPLVVQRRDRT
VLVGVTSFGIAFGCEIGWPAAFSRITSFLGFINDNL