New model in OGS2.0 | DPOGS209266  |
---|---|
Genomic Position | scaffold610:+ 63639-68691 |
See gene structure | |
CDS Length | 1863 |
Paired RNAseq reads   | 1225 |
Single RNAseq reads   | 4390 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007046 (3e-98) |
Best Drosophila hit   | Ser7 (4e-35) |
Best Human hit | tissue-type plasminogen activator isoform 1 preproprotein (4e-21) |
Best NR hit (blastp)   | prophenoloxidase activating proteinase-2 [Manduca sexta] (6e-41) |
Best NR hit (blastx)   | prophenoloxidase activating proteinase-2 [Manduca sexta] (5e-41) |
GeneOntology terms    | GO:0008236 serine-type peptidase activity GO:0004252 serine-type endopeptidase activity GO:0006508 proteolysis |
InterPro families    | IPR009003 Peptidase cysteine/serine, trypsin-like IPR001314 Peptidase S1A, chymotrypsin-type IPR018114 Peptidase S1/S6, chymotrypsin/Hap, active site IPR001254 Peptidase S1/S6, chymotrypsin/Hap |
Orthology group | MCL40123 |
Nucleotide sequence:
ATGTTTGAATTCTGGTTTAAATATGACGTCTGGACGCACGACAAGGAACTGCAGAGTTCA
GCGAGTGCGGATCATAAAGAATATATTTGTGACAAATGTGTCCGTATTTCTGATTGCCCA
GCCTTTGCGAAAATGAATTCCCGACAACAACAGGCATGGCTTCAACAATTTCCTTGTAAA
GGCCCAAGTGATAGCGAAAGACCCTCGATTTTTGGATTCTCACCCGTTGCGAAAGGAGAC
TATGTATGTTGTCCAAATTCTAATATTTGGGGAATAGATAATGGGTACCAAAACCAACAT
CAGAAGCCTGTTCCAATAAGACCTAATGAAAGGGGACATCGTAACTATCAAAGTCCTGAT
TTTAATAATCCAGGTGCATTTACTGGACAGCAAAACAATTTACCAGGAAATCCTGATTTT
GAAAATGGTATGGGAGTTGATACAAAAATACCCCATCCATTTGGAAAACATCCAGGAACT
TTTGGTGGAGACTTTAATGGAAACCCACAAAATGGACAAAATAATCAAGGCATATTTGAC
AATATGCCAAACTTAAATCTTCCGCAATTCCCTAACAATGGAAATAACTTTGGTGGCCAA
CCGCAAAACGGGCAATATCCAAATGGTCAATTTCCAAATAATAATCAGTTTCTCAATAGT
AATTTCCCAAATAGTCAATTTCCAAATGGTCAATATCCAAGTTATCAATTTCCAAGCAGT
CAATCTCCAAACAGCCATTTTCCGAATAATCAGTTCCCAAATGATCAATTTCCTAGCTTT
TCTGGACAAACACAATATCCCAATAGTGGGGAACAAAATAGCGGGATTTTCAATCAAGGT
GGATACCAACAATGTCCGTCTCATACAAACATGATTCCAGATCCTTCTGCAGGTTGTTGC
GGAAAAGATGACTCTGATTCTGTAAGAATAACAGATTTACAAAAAGTCCTTAGCATGTAT
GCACCTGATAACTCCAATAGATATCCAAGGCCTAACTATTCTCCACGCCAAAAACCGCAA
AGATATCCATACTATCAAAATAGGCAAAAACGATCCTTTGATCAAAATAACACATCGGAT
GATAGTCTAGAAGATAGAATAGCAGGTGGAAAAGAAACCGAATTAGATCAGTTTCCATGG
ACTGCTCTATTGAAGGTAACCTTCGATTATGGTAACAGAGAAGCTGCTTTTAGTTGCGGT
GGTTCTCTGATAAGCCAACGATTTATCCTAACTGCTGGTCACTGCGTTTATGAATCTGGA
GCAAAAGTATCAAGCGTTGAAATTACACTAGCTGAGTATGACAAAAGAACCTTTCCCAAA
GACTGCATATCGGAAATGGGCGGAAGGCGAGAATGTATTGAAAATATAAGAATGTATTCG
GAAAATATTATACATCATCCTGAATATGATGACGATCAGCTACATAATGATATTGCACTT
ATAAAAATTCGTGGATATGCTCCCTATACGCGTTTTATAAGGCCTATCTGCCTTCCGCCG
TTAAATATCGATGACCCTGATTTATCAAACCTTCCCCTCTCTGTGGCGGGATGGGGTCGC
AACGGTGCTTATGAAACTAATATCAAACAATCGACTGTAGTTCATTTGGTGCCCCATGAC
AAATGTTTGAAGTCATATCCTCAATTGACGTCTTCTCACCTATGTGCAGCCGGTCGCACC
GGTGAAGATACTTGTAAAGGCGACTCAGGAGGTCCTTTAATGATGTTATATCGAGGAAAT
TATTATATTATTGGTGTTGTTAGTGGCAAAAGAGCTGACAGTCCATGTGGAACGTCAGTA
CCTTCACTTTACACGAATGTCTATCAATATGTACCTTGGATAACAAGTAGTTTAAGAAAT
TGA
Protein sequence:
MFEFWFKYDVWTHDKELQSSASADHKEYICDKCVRISDCPAFAKMNSRQQQAWLQQFPCK
GPSDSERPSIFGFSPVAKGDYVCCPNSNIWGIDNGYQNQHQKPVPIRPNERGHRNYQSPD
FNNPGAFTGQQNNLPGNPDFENGMGVDTKIPHPFGKHPGTFGGDFNGNPQNGQNNQGIFD
NMPNLNLPQFPNNGNNFGGQPQNGQYPNGQFPNNNQFLNSNFPNSQFPNGQYPSYQFPSS
QSPNSHFPNNQFPNDQFPSFSGQTQYPNSGEQNSGIFNQGGYQQCPSHTNMIPDPSAGCC
GKDDSDSVRITDLQKVLSMYAPDNSNRYPRPNYSPRQKPQRYPYYQNRQKRSFDQNNTSD
DSLEDRIAGGKETELDQFPWTALLKVTFDYGNREAAFSCGGSLISQRFILTAGHCVYESG
AKVSSVEITLAEYDKRTFPKDCISEMGGRRECIENIRMYSENIIHHPEYDDDQLHNDIAL
IKIRGYAPYTRFIRPICLPPLNIDDPDLSNLPLSVAGWGRNGAYETNIKQSTVVHLVPHD
KCLKSYPQLTSSHLCAAGRTGEDTCKGDSGGPLMMLYRGNYYIIGVVSGKRADSPCGTSV
PSLYTNVYQYVPWITSSLRN