Genomic Position | scaffold1362:- 5828-8899 |
---|---|
See gene structure | |
CDS Length | 1803 |
Paired RNAseq reads   | 514 |
Single RNAseq reads   | 2947 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000840 (2e-49) |
Best Drosophila hit   | CG11192 (8e-18) |
Best Human hit | trypsin-2 preproprotein (2e-18) |
Best NR hit (blastp)   | PREDICTED: similar to ankyrin 2,3/unc44 [Strongylocentrotus purpuratus] (2e-68) |
Best NR hit (blastx)   | PREDICTED: similar to ankyrin 2,3/unc44 [Strongylocentrotus purpuratus] (3e-68) |
GeneOntology terms    | GO:0016787 hydrolase activity GO:0007586 digestion GO:0005576 extracellular region GO:0008233 peptidase activity GO:0046872 metal ion binding GO:0008236 serine-type peptidase activity GO:0004252 serine-type endopeptidase activity GO:0003824 catalytic activity GO:0006508 proteolysis |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR001254 Peptidase S1/S6, chymotrypsin/Hap IPR000477 Reverse transcriptase |
Orthology group | MCL10014 |
Nucleotide sequence:
ATGGTCAGTTTCGATGTACAGTCATTATTCACTAGTATACCTGTTCTTGACTGCATTGAG
ATTGTAAGAGGTAAGTTAAAGGATAACAATATGCCTATAGAATATGCAGAGCTATTAAAG
CATTGCCTAACATCTGGCTACCTCATGTGGAAGGATGAATTCTACATACAAGTAGATGGA
GTTGCAATGGGTTCACCGGTTTCCCCCGTTGTCGCTGACATATTCATGGAGGACTTCGAG
GTGCGAGCCCTTTGCTCTCCTCCTATAAGACCTTTAATTTATAAACGGTATGTAGATGAC
ACCTTCACAATATTAAATAAAAATAAAACATCTGCTTTTCTGAACCATCTCAATTCTATC
AATAGTAAGATTCAGTGTACTATAGAATTGGAGGCAAATAATTCTTTAGCTTTCCTTGAT
ATACTTGTTGTTAGGAATCCTGACAATACTTTGGGACATACTGTTTATAGGAAACCCACA
CATACGGACAGGTACCTCAATGGTTACTCACACCACCACCCTATCCAGTTAGCTACCGTT
GGCAAATCTTTGTTACAGAGAGCCCAACATCTTTGTGATGCTGACCACCTAGAGGCCGAG
CTGCAGCATGTAAAACATGCTCTCACTATCAACAACCTGCCCGTGCCTCGCCAGCATCGC
AAGAAGCACCTGAAGCCACCCACAGTTGAACGACAACCTGCGATACTACCATATGTGAAG
GGAGTTACTGACAGAATAGGCAACATCTTGAAGAAGGTTTCCATTAAAACTATTTACAAA
CCACATAAGAAAGTGAGCCAATTCTTGAGACCAATCAAGAGTAACATTCCTTTACAACAA
GCGGGTGTATACAAACTCGACTGTGACTGTGTCTTGTCATACATTGGACAGACGAAGAGG
AGCATCGGTACAAGGGTTAAGGAACACATCTCAGATATCAAAAACAGGCGCGCGTCGAAG
TCAGCAGTGTGTGAACACACAATGGACAAACCAGGCCACTACATTCGTTTTGATAAACCT
CAAATCCTCGCTCGGGAAGACAAGTATATACCGAGATTAATTCGCGAGGCTATTGAAATT
AAAAAACATCCCAATTTCAATAGAGAAGATGGCTGGAATCTTTCAAACACCTGGGACCCC
GTTCTTAAAAATATAAAATCCCATGTCCGTAACCACACCGCAGGACCTCAAGACACCGTG
AGCGCATTCTGCCGGCATCCAGAGCGGTACGCCAGAAAATTAAGAAATCGATGGCGTATT
GATCCTACCACAGTGATTCTGAGAGCTGGTAGCACATATCGGGGCAATGGTACTATTATA
CCGATAGATGAGATAGTTGCACACCCAGAATATAACGATTCACCCTTTGATAAGGATGTT
GGCTATATACGAACTTCTAATCCAATACAGTTTACTGGCGCTATGAAGCCCATTCCCCTC
GTAAATGAATCTGAACCGTGCAGTAATAGAGTGAACGTCAGCGGATGGGGTAGACTGATG
GAAGGACAAAATCCCTTGCCTCTAAGACTAAGAGCGGTGAATGTGCCTGTTGTTGATTAT
TTTAGATGTAAGATGGCGTATCCCAGAATATTAACTCGCAACATGGTATGTGTTGGGAAT
TTCGTCTTAGGAGGTCAGGGTACTTGTCAGGGGGATTCAGGAGACGCTGGGGTTGATAAT
GGGAGGGCTTGTGGTATTGTGTCATTTGCAAGAGGTTGTGCACGCCCTATGTCTCCGAAT
GTCTTCACAAATATAGCAGCTGGACCAGTTAGAAGATTTATCACAGATAATACAGGTGTC
TAA
Protein sequence:
MVSFDVQSLFTSIPVLDCIEIVRGKLKDNNMPIEYAELLKHCLTSGYLMWKDEFYIQVDG
VAMGSPVSPVVADIFMEDFEVRALCSPPIRPLIYKRYVDDTFTILNKNKTSAFLNHLNSI
NSKIQCTIELEANNSLAFLDILVVRNPDNTLGHTVYRKPTHTDRYLNGYSHHHPIQLATV
GKSLLQRAQHLCDADHLEAELQHVKHALTINNLPVPRQHRKKHLKPPTVERQPAILPYVK
GVTDRIGNILKKVSIKTIYKPHKKVSQFLRPIKSNIPLQQAGVYKLDCDCVLSYIGQTKR
SIGTRVKEHISDIKNRRASKSAVCEHTMDKPGHYIRFDKPQILAREDKYIPRLIREAIEI
KKHPNFNREDGWNLSNTWDPVLKNIKSHVRNHTAGPQDTVSAFCRHPERYARKLRNRWRI
DPTTVILRAGSTYRGNGTIIPIDEIVAHPEYNDSPFDKDVGYIRTSNPIQFTGAMKPIPL
VNESEPCSNRVNVSGWGRLMEGQNPLPLRLRAVNVPVVDYFRCKMAYPRILTRNMVCVGN
FVLGGQGTCQGDSGDAGVDNGRACGIVSFARGCARPMSPNVFTNIAAGPVRRFITDNTGV