New model in OGS2.0 | DPOGS201112  |
---|---|
Genomic Position | scaffold258:- 38376-40402 |
See gene structure | |
CDS Length | 1491 |
Paired RNAseq reads   | 80 |
Single RNAseq reads   | 196 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013679 (2e-114) |
Best Drosophila hit   | CG7829, isoform A (1e-21) |
Best Human hit | kallikrein-14 preproprotein (3e-16) |
Best NR hit (blastp)   | PREDICTED: similar to Trypsin alpha [Tribolium castaneum] (3e-24) |
Best NR hit (blastx)   | trypsin-like serine protease [Ctenocephalides felis] (2e-24) |
GeneOntology terms    | GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR001254 Peptidase S1/S6, chymotrypsin/Hap |
Orthology group | MCL40886 |
Nucleotide sequence:
ATGGCAACAATTCAAATATTTAATAACTTTCATTGCGCCGGATCCATTATTAAGTCGGAT
CTTATAATCACAGCCTCTTCTTGTTTACAATTGGCTTATAATAATCGTCTGTTCCGAGAA
AATCCAGCGTTTCTGTCAGCTCGAGTCGGCAGTAGCTTTTATAACGGTGGAGGTGAAGTC
ATATCTGTGCAGGAGGTCTACTTCCATCCTTCCTACGATCCAAAGACTTTGAGGAACAAT
ATCTGTCTCCTCCGACTAGCACGCCATCTGAAATTTAGGAGAAAAATCAGAAGCGTAAAA
AAAATTGATTTTGATAGACACGAGTCCACTCTCTCTATGACTACATCTGGAATTACTATA
GTGGGTTGGGGTGCCAAAGAGCACAGTCCGATAATTGGCAGTCCATGGAAAAACATATTG
TCTTTCGCTGAATTACATGTGTATCCTTTAGAAGATTGTCAAGATGTTTATTCCAAAGCT
TACGTTACGAAAAAAAACTTTTGTGCTGGTTTTATATCTAGAGGAGGAGGAGCCTGCAAT
CGTGACGTAGGTGGTCCTGGTATAGTTGAAAATAAGCTGATGGGAATCATAAGCTTCGGA
TCTCCGGTTTGTGGCTCTCCCGATATGCCAACAGTGTTCACGAAAGTGGGGTATTATACT
GATTGGATTGAAGAAATTATGGAACAGCCGGTAATTATTTCGAAGAAAAGGACTACTCTA
AAATCAGACTTCAACCCATTTTTAGCTCAACCAATTCATATTGAACCGGATCAAACCACA
TTTAAGATACCACCTTTGACTGGTGAAAAAATGAAGCCAATACCTATTACAGAAATAGAT
GGTCAGCTTAGAATATCTGATGAGAAACTGTTTAAAGAATTCCTAGCCACTATGTTCAAT
AGCCAGGAAATTGCTGAATATGAGGACATAATAAATCCAGACAATGGTGACATCGAGATT
AATGATATGATACTCAATGATGATAAGGTCTTAGAGGAAACGGCAACTGAGAACATTCCA
GCAGATCAAACCATGATAATTCCCGTTGAAGAGGATACAGAAGTACAGGAAGAAGTAGAG
AATCAAACACAAATAAATGAAGTTTCAAACAAAAGCTTAGAAGAAAGTGAAAAAGAAGGC
AATAAATACGAAATGAATACACCTGCTATAGAAGATGAACCAGCGGGCGTTGATAATTTA
AATAAGGATTTAGCCAACTTACTTGAAAATGTACAAGATGATGGCGGGTTAGGACCTAAG
AAAGATGAAGATAACAACGTTCAGACTGACGATCAAAAGGTTTTGACACTTTTATATTTA
TCTGACGAGGACAAAAAGAGTAATGGAGGATTGAGCATACCAACGGAACATTTTGAGGAT
TTAACGAGAAGTAAACAAAATGTTCTGAATATTTTACCAGAGAATGAACTATATGCTCTT
TTATCGGAAGTTATACAAGACGAGGTTGAGAAAATAAACGCTGGGACTTGA
Protein sequence:
MATIQIFNNFHCAGSIIKSDLIITASSCLQLAYNNRLFRENPAFLSARVGSSFYNGGGEV
ISVQEVYFHPSYDPKTLRNNICLLRLARHLKFRRKIRSVKKIDFDRHESTLSMTTSGITI
VGWGAKEHSPIIGSPWKNILSFAELHVYPLEDCQDVYSKAYVTKKNFCAGFISRGGGACN
RDVGGPGIVENKLMGIISFGSPVCGSPDMPTVFTKVGYYTDWIEEIMEQPVIISKKRTTL
KSDFNPFLAQPIHIEPDQTTFKIPPLTGEKMKPIPITEIDGQLRISDEKLFKEFLATMFN
SQEIAEYEDIINPDNGDIEINDMILNDDKVLEETATENIPADQTMIIPVEEDTEVQEEVE
NQTQINEVSNKSLEESEKEGNKYEMNTPAIEDEPAGVDNLNKDLANLLENVQDDGGLGPK
KDEDNNVQTDDQKVLTLLYLSDEDKKSNGGLSIPTEHFEDLTRSKQNVLNILPENELYAL
LSEVIQDEVEKINAGT