New model in OGS2.0 | DPOGS204736  |
---|---|
Genomic Position | scaffold405:+ 67754-70645 |
See gene structure | |
CDS Length | 2655 |
Paired RNAseq reads   | 351 |
Single RNAseq reads   | 930 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013655 (8e-20) |
Best Drosophila hit   | CG32354 (6e-24) |
Best Human hit | agrin precursor (7e-22) |
Best NR hit (blastp)   | serine protease inhibitor dipetalogastin precursor, putative [Toxoplasma gondii ME49] (3e-77) |
Best NR hit (blastx)   | follistatin, putative [Toxoplasma gondii GT1] (1e-99) |
GeneOntology terms    | GO:0004867 serine-type endopeptidase inhibitor activity GO:0005575 cellular_component GO:0050819 negative regulation of coagulation |
InterPro families    | IPR002350 Proteinase inhibitor I1, Kazal IPR011497 Protease inhibitor, Kazal-type |
Orthology group | MCL19010 |
Nucleotide sequence:
ATGAAGACTGTAATAGGTCTACTGTTTATCTTCGCATCGATATGTTATTTGGATGCGAAA
AGAATAAAAAAAAGGTCATGCATCTGTACAGAACTATACAGTCCTATATGTGGTACTGAT
GGAACCACTTACACGAATAAATGTTTTTTCAATTGCGCCAAAAATACCCACAAAAAACAT
GGTTCCACAAAAGACATATATATAGCCTATGAAGGAAAATGTAGTGACTCATGTATTTGC
AAGGATAACTATTCTCCGGTATGTGGCAGTGACGGCAAAACGTACCCTAATAGCTGTTAT
CTTAATTTTAAAAGTAAAGAAATAGAAAATGACTGTAAAAATAACGGAGACGATCCTGAT
GAAAACAAATTAATAGAAGCATATAAAGGTGAATGTTCCGACGAATGTTTTTGCACAGAT
GAATATGCACCAATTTGCGCTAACAACAATAAAACTTACTCGAATTCCTGCCAACTAGAG
TGTGAGAATAAAAAAAGAAAAAATAATAATTTACCGCCTCTTGTGGTTAAAAGTGACGGT
CAATGTCCCAAACCATGTATCTGCGAAGGAATGTATCAACCGATATGCGGTGACGATGGG
AAAACTTATGCCAATGTTTGTAGTTTAGGATGTATTAATGAAGAAAGACAAAATAATAAC
CTTCCACCAATAAGTAAAAGGAGTGACGGAAAATGTCCGAACATATGTAAATGTCCAAAA
ATATATAAACCAGTATGTGGAAATGATGGCAAAACATATCCAAGTAATTGCAATTTAAAA
TGTATAAACAAAGAGAGAGAAGGAAACAAGCTGTCACCCATTAGAGAAATCAGTAAGGGC
GAGTGTCCAAAAACATGTGTATGCCCTTTTAATTATTTACCTGTATGCGGTTCTGATGGA
GTTACTTATTCTAATGAATGTTTACTTAAATGTGCAAGTAAGGACAATGAAAAGAAAAAC
TTACCACCTATAACTGTTGTAAATGAAACGTCATGCCCAGAATCATGTCTGTGTCCATTA
ATCTATGAGCCAATATGCGGTGACGACGGCAAAACATATTCCAGTAGTTGTGAACTGAGA
TGTAAAAATAAAGAAAGAGAAATAAATAAAGAACTACCAATTAAAAAAGTCAGTGATGGG
GAATGCTCAAAACCATGTCGTTGTCCAAAAATTTATAGTCCCGTGTGTGGTGATAATGGT
GAAACATTTTCTAATAACTGTGAATTAGAATGTGAAAACAAAAAACGCCAAGCTAAAAAT
GAATCACCAATAGCTGTGGTAAGTAAGGGAAAGTGTCCGGAACCTTGTAGTTGCCCAAAA
ATATTCGAACCTGTATGTGGTGATGACGGAATAACTTATTCCAGCAGTTGTGATTTAGGT
TGTGTTAATAAAGAAAAAGAAAAAAATAATGAAGCACCCATCCTTGAGGTTTCCAAAGGT
GCATGCCCAGGTTCCTGTATATGTCCATTAATAATTTCAGAGCCTGTTTGTGGAAGCGAC
GGTCAAACTTATCGTAGTGAATGTGAATTAGACTGTGAAAATAAAATAAGAATAGCAAAA
GATGAATCACCTCTCTCTGTTATTAGCAAGGGTGAATGTCCAAAAGCTTGCGCGTGTCCT
TTAATAGATCTTCCTGTTTGCGGTTCGGATGACGTCACTTACCCTAACGAATGTTCACTT
AACTGTACAAGTGCAGATAATGTAAGAAAAAGTTTACCTGCTATTACTGTGAAAAGCCAA
GGAGAATGTGAAGAGTCATGCATATGTTCAACAAATTATGATCCTATATGTGGTTCAGAC
GGTGTAACTTACTCCAACGAGTGTCAACTAGAATGCAAAAATAAAAAGCGAATCAAAAAC
TCCCTAGATAGAATAGATATTGTAAAAAAAGGAAAATGTAATGGATCCTGCAGCTGTCCT
GCAGATGTCAATCCAGTATGTGGCAGTGACGGACAAAGTTATCCCAATGAATGTCAATTA
GTATGCGAGAGCGATGATTTGGTACGACAGGGGCTTTCAGCTTTAGAAGTCATCGAAAGT
GATCTTTGTGAAGAATCATGCGAATGTTATAACGCAATTATACCAGTTTGCGGGTCAAAT
AATAAATCTTACAGAAATGCTTGCTATTTAGATTGTGCCAACAGAAACAGAAGAGGCAAT
GAAACATCAATTACGATAAAATATAGTGGTGCATGCAGAAGTTGCACTTGCACCCGAGAA
CTTAACCAAGTGTGTGGTAGCGACGGTAATACGTATAATAATCCTTGTCTTTTAGATTGT
GAAAGTGAAAGACTAAAAGGAATAGGAAAATCACCTCTGTATATTATTCACTATGGCGAC
TGTCAAGGATGTGATTGTTCAAATGAATACGAACCTGTCTGTGGAACTGATAACAATACA
TACACAAACTTATGTCAATTACAGTGTGAAAGTAACATTAGACAACGTGAAAATCAGAAA
GAGATAGCTCTCCTCAGCAAAGGAACATGCCCAGAGAGTGATTATGATTGTGAAAATTGC
CCTCTTACGTACCAACCAGTTTGTGGTAAAGATCTTGTAAGCTACTGGAACGACTGCTGG
TTTAAATGTAGTAATAAATGTAAACTGAGTCGTGGGGAAAAACCTATCCCGATGGCTAAA
ACTGGATGCTGTTAA
Protein sequence:
MKTVIGLLFIFASICYLDAKRIKKRSCICTELYSPICGTDGTTYTNKCFFNCAKNTHKKH
GSTKDIYIAYEGKCSDSCICKDNYSPVCGSDGKTYPNSCYLNFKSKEIENDCKNNGDDPD
ENKLIEAYKGECSDECFCTDEYAPICANNNKTYSNSCQLECENKKRKNNNLPPLVVKSDG
QCPKPCICEGMYQPICGDDGKTYANVCSLGCINEERQNNNLPPISKRSDGKCPNICKCPK
IYKPVCGNDGKTYPSNCNLKCINKEREGNKLSPIREISKGECPKTCVCPFNYLPVCGSDG
VTYSNECLLKCASKDNEKKNLPPITVVNETSCPESCLCPLIYEPICGDDGKTYSSSCELR
CKNKEREINKELPIKKVSDGECSKPCRCPKIYSPVCGDNGETFSNNCELECENKKRQAKN
ESPIAVVSKGKCPEPCSCPKIFEPVCGDDGITYSSSCDLGCVNKEKEKNNEAPILEVSKG
ACPGSCICPLIISEPVCGSDGQTYRSECELDCENKIRIAKDESPLSVISKGECPKACACP
LIDLPVCGSDDVTYPNECSLNCTSADNVRKSLPAITVKSQGECEESCICSTNYDPICGSD
GVTYSNECQLECKNKKRIKNSLDRIDIVKKGKCNGSCSCPADVNPVCGSDGQSYPNECQL
VCESDDLVRQGLSALEVIESDLCEESCECYNAIIPVCGSNNKSYRNACYLDCANRNRRGN
ETSITIKYSGACRSCTCTRELNQVCGSDGNTYNNPCLLDCESERLKGIGKSPLYIIHYGD
CQGCDCSNEYEPVCGTDNNTYTNLCQLQCESNIRQRENQKEIALLSKGTCPESDYDCENC
PLTYQPVCGKDLVSYWNDCWFKCSNKCKLSRGEKPIPMAKTGCC