New model in OGS2.0 | DPOGS211874  |
---|---|
Genomic Position | scaffold10791:+ 896-3543 |
See gene structure | |
CDS Length | 1395 |
Paired RNAseq reads   | 32 |
Single RNAseq reads   | 106 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001245 (2e-76) |
Best Drosophila hit   | CG17739 (5e-19) |
Best Human hit | kunitz-type protease inhibitor 3 precursor (3e-09) |
Best NR hit (blastp)   | AGAP012307-PA [Anopheles gambiae str. PEST] (4e-40) |
Best NR hit (blastx)   | AGAP012307-PA [Anopheles gambiae str. PEST] (1e-40) |
GeneOntology terms    | GO:0005576 extracellular region GO:0004867 serine-type endopeptidase inhibitor activity |
InterPro families    | IPR002223 Proteinase inhibitor I2, Kunitz metazoa IPR000884 Thrombospondin, type 1 repeat IPR020901 Proteinase inhibitor I2, Kunitz, conserved site |
Orthology group | ND |
Nucleotide sequence:
GCTCGTCGAAATCCGACAAACCCAAAAGCTCCAGTCCGAGCTCTCAGACAGGAGTGGCCC
AGAGACACTCGCTCGGGGCCCTTCTTGAGCTCGTTCGGTGAATTGCGACCATTTGCTCGG
CTGAGGCTGACGAGATTGAGACTTTACGAGAAAAGCTGTGACGTCACAGAGCCTACTGGC
GTGCAAGTCCCCGAGGGGGCTGCCGTTCCCGGCGGTTCATGTGCTACGTACGGTTGGGGC
GGTTGGAGCCCCTGCAGCGTGTCGTGCGGCACGGGGCGAAGCACCCGCCAGCGCCGGTAC
ATGTGGCCACTTCGCGCCCAGCACGACGCCTGCCGCACTACACTCACGGAATACAGATCC
TGTCATGGACCGAGACTGCACTGCAGAGTGAAGTCTGACTACGAGCCTGAGGCGGCGGAC
TCATCAGGTCCTTGCGCCATGTCTCCCTGGTCGGAGTGGTCTCCGTGTGTAGGGTGCGGA
GTGCGAGCTCGCACCAGGCACTACCTCGCTCCGCGCGCACACAAGCGGTGCCATGTGGGG
TTCCGCGCAAGAACCGTCATGAGCCAGGCCATGCCGTGCGACGCGGGACCCTGTTACAAA
CCCTTCCACGGGGTGGGCCAACTAACGTCGTATATACTCGGAGGATACAATGTTGTTGTA
AAATATTGCACTTACCAGGTTTTACTTGATTACAGACCTCCCAAAGAACGCGCGAATGCA
ACAAACTTTGATTGGTTTTTTGAGCTCATGATGGCAGGCTATGGTCACGGTACAATGACG
TCACCTGATGTCCAGGAAAGCCCCAAGTCGGACTGTCCGGTGACTCCGTGGTCCACGTGG
TCGCCGTGTTCCTCGCGCTGCGGCCGAGGCCGGCGACTCCGCACCAGGATGTACGTGGTG
CGGGAGACGAACCTTCAGCGAGAGATCACTAAGAGGCTGTTAAGGGACTGGAACCAACGG
TTCGCTGAACTACAGAATTTGGAACTGCCTCATGAGAACATAACCAGCGAGGACCCGTCC
CTGGACGCCGCGGTCCAGGAACATCTCGACAGATGTCAGTTCACGATGACGCAGCAGGAG
GCGCTGTGTGACGGAGGCGACGGAGGCTGCTCCGACAACCCTTCACCCAATGAGATCTGC
GCGCTGCCGGTGTCGGTGGGGCCGTGTCGGGGCTACGACGAGCGCTGGTTCTTCGACCAC
CCGCGAGTCGCGTGCGAGCCCTTCGGGTACACCGGCTGCGGCGGGAACGAGAACAACTTC
AGGACCCGGCAGATCTGCGAGCAGACTTGTCTGTCGCAAGGAAACAAACACGACAACGGT
AGTGGGGCTTCTGTAGGGAATTTATCCCAGTCGTTCTCATCTCGTTCTAAGCGATCGCGG
TTACCGCAAAGTTAA
Protein sequence:
ARRNPTNPKAPVRALRQEWPRDTRSGPFLSSFGELRPFARLRLTRLRLYEKSCDVTEPTG
VQVPEGAAVPGGSCATYGWGGWSPCSVSCGTGRSTRQRRYMWPLRAQHDACRTTLTEYRS
CHGPRLHCRVKSDYEPEAADSSGPCAMSPWSEWSPCVGCGVRARTRHYLAPRAHKRCHVG
FRARTVMSQAMPCDAGPCYKPFHGVGQLTSYILGGYNVVVKYCTYQVLLDYRPPKERANA
TNFDWFFELMMAGYGHGTMTSPDVQESPKSDCPVTPWSTWSPCSSRCGRGRRLRTRMYVV
RETNLQREITKRLLRDWNQRFAELQNLELPHENITSEDPSLDAAVQEHLDRCQFTMTQQE
ALCDGGDGGCSDNPSPNEICALPVSVGPCRGYDERWFFDHPRVACEPFGYTGCGGNENNF
RTRQICEQTCLSQGNKHDNGSGASVGNLSQSFSSRSKRSRLPQS