Genomic Position | scaffold401:- 10963-16320 |
---|---|
See gene structure | |
CDS Length | 1500 |
Paired RNAseq reads   | 14121 |
Single RNAseq reads   | 39709 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003569 (5e-78) |
Best Drosophila hit   | trypsin 29F (7e-28) |
Best Human hit | transmembrane protease serine 11F (2e-24) |
Best NR hit (blastp)   | trypsin [Choristoneura fumiferana] (1e-62) |
Best NR hit (blastx)   | trypsin [Choristoneura fumiferana] (4e-63) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR001314 Peptidase S1A, chymotrypsin-type |
Orthology group | MCL18146 |
Nucleotide sequence:
ATGACTCGTGTACTCTTGCTTCTTACGGTTTGTCTGGCTTCAGCGTCAGCTGAATCAGCG
GCCGGAAGAATTGCGATTGGATCAACAGGTTATATCAGCTCCTACCCGTTTGCTGCCAGC
CTGTTGTATTCTAGAATTGGCGTTGGAGTATTTACCCAGGCGTGTGGTGGCTCTATTATA
ACTTACAAAACAGTTCTAACATCAGCTTATTGCTTACATGGTGAATCGCTAAATGCCTGG
CGTGTGAGGGTTGGTTCTTCCAGCAGCAGCAGTGGAGGTGTGGTACATGATTTGTCGCGA
ACTATTGTGCACCCTTTTTACAATACACGCACCTTGGACAATGATATAGGGTTACTCCAT
GTTACTACGCCATTTGATGTTAATAGCAATGTAAAACCGGGGACTATTGCCGGTGCTAAC
TTCTACCTACCTGACAATAGACTTATTGAAGCTATAGGATGGGGACTAAGTAGTAATGGT
GGGCAACCTTCCAATCAGTTACGTTACGATAGAATGTATGTTATTAACAGGTCCACATGT
CAGTCAAGATATTCTGAAATTGGACGTAAGGTTACTAATAATATGGTTTGCGTCGGCAAT
TTGGACGTTAACGGCTATGGACAATGCGTTGGTGATTTTGGCAGTCCAGTTCTATACAAC
TCTGTCATATTTGGCGTCTTCTCGTTTAGCCAGCAATGTGGGCTTGGCCGATACCCCAGC
GTTAATACTTACATACCTAACTACATCGATTGGATAGTTGAAAATACGCTGCCACCACCT
GTGAGAATCGTCGGTGGATCAAACGCAGCAATAACATCATACAGATTTGCTGCCAGTCTT
TTACATTCTAGAATTGGCGTTGGTACTTTTATATATGGCTGCGGTGGGTCAATCATTACC
AACAGAGTGATTCTGACTGCTGCTTATTGCCTTTACAATGAACCTGTATATCGTTGGCGT
GTTCGTGTTGGTTCAGCCAGGTCCAGCACTGGGGGAGTCGTTCATAATACTCTGAGAACA
GTAGTTCATCCAAATTATAATCCACGGACTGCTGACAGTGACATTGCTTTATTGCACTCA
ATGACAGTTTTCGTTTTCAACAATAACGTTAATTTGGTTGGAATTGCTAGCGCAAATTAT
AACCTTCCTGACAATCAGCCTGTTACAGCTATTGGATGGGGAGCTACCAGTCACGGTGGT
CAACTCTCTGATAGGCTCCGTCATGTTGACATTTGGACAGTCAATAGAAACGTTTGCCGG
ACGCGTCATTCTGAGTTGGGATACAGCATTACCGACAACATGCTATGTGCTGGTTGGCTG
GATGTCGGAGGTCGTGGCGCTTGCATTGGTGATACTGGCAGCGCTCTCATTCACCTTACC
GGCAATGTTCAAACTATTGTTGGAGTGTACTCGTGGAGTTACAATTGTGCACTTCCTCGA
TACCCTAGCGTTAACACATTTATTCCCAGATATACTAATTGGATTCTAGCCAATGCATAA
Protein sequence:
MTRVLLLLTVCLASASAESAAGRIAIGSTGYISSYPFAASLLYSRIGVGVFTQACGGSII
TYKTVLTSAYCLHGESLNAWRVRVGSSSSSSGGVVHDLSRTIVHPFYNTRTLDNDIGLLH
VTTPFDVNSNVKPGTIAGANFYLPDNRLIEAIGWGLSSNGGQPSNQLRYDRMYVINRSTC
QSRYSEIGRKVTNNMVCVGNLDVNGYGQCVGDFGSPVLYNSVIFGVFSFSQQCGLGRYPS
VNTYIPNYIDWIVENTLPPPVRIVGGSNAAITSYRFAASLLHSRIGVGTFIYGCGGSIIT
NRVILTAAYCLYNEPVYRWRVRVGSARSSTGGVVHNTLRTVVHPNYNPRTADSDIALLHS
MTVFVFNNNVNLVGIASANYNLPDNQPVTAIGWGATSHGGQLSDRLRHVDIWTVNRNVCR
TRHSELGYSITDNMLCAGWLDVGGRGACIGDTGSALIHLTGNVQTIVGVYSWSYNCALPR
YPSVNTFIPRYTNWILANA