DPGLEAN11621 in OGS1.0

New model in OGS2.0DPOGS204736 
Genomic Positionscaffold405:+ 67754-70645
See gene structure
CDS Length2655
Paired RNAseq reads  351
Single RNAseq reads  930
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013655 (8e-20)
Best Drosophila hit  CG32354 (6e-24)
Best Human hitagrin precursor (7e-22)
Best NR hit (blastp)  serine protease inhibitor dipetalogastin precursor, putative [Toxoplasma gondii ME49] (3e-77)
Best NR hit (blastx)  follistatin, putative [Toxoplasma gondii GT1] (1e-99)
GeneOntology terms

  
GO:0004867 serine-type endopeptidase inhibitor activity
GO:0005575 cellular_component
GO:0050819 negative regulation of coagulation
InterPro families
  
IPR002350 Proteinase inhibitor I1, Kazal
IPR011497 Protease inhibitor, Kazal-type
Orthology groupMCL19010

Nucleotide sequence:

ATGAAGACTGTAATAGGTCTACTGTTTATCTTCGCATCGATATGTTATTTGGATGCGAAA
AGAATAAAAAAAAGGTCATGCATCTGTACAGAACTATACAGTCCTATATGTGGTACTGAT
GGAACCACTTACACGAATAAATGTTTTTTCAATTGCGCCAAAAATACCCACAAAAAACAT
GGTTCCACAAAAGACATATATATAGCCTATGAAGGAAAATGTAGTGACTCATGTATTTGC
AAGGATAACTATTCTCCGGTATGTGGCAGTGACGGCAAAACGTACCCTAATAGCTGTTAT
CTTAATTTTAAAAGTAAAGAAATAGAAAATGACTGTAAAAATAACGGAGACGATCCTGAT
GAAAACAAATTAATAGAAGCATATAAAGGTGAATGTTCCGACGAATGTTTTTGCACAGAT
GAATATGCACCAATTTGCGCTAACAACAATAAAACTTACTCGAATTCCTGCCAACTAGAG
TGTGAGAATAAAAAAAGAAAAAATAATAATTTACCGCCTCTTGTGGTTAAAAGTGACGGT
CAATGTCCCAAACCATGTATCTGCGAAGGAATGTATCAACCGATATGCGGTGACGATGGG
AAAACTTATGCCAATGTTTGTAGTTTAGGATGTATTAATGAAGAAAGACAAAATAATAAC
CTTCCACCAATAAGTAAAAGGAGTGACGGAAAATGTCCGAACATATGTAAATGTCCAAAA
ATATATAAACCAGTATGTGGAAATGATGGCAAAACATATCCAAGTAATTGCAATTTAAAA
TGTATAAACAAAGAGAGAGAAGGAAACAAGCTGTCACCCATTAGAGAAATCAGTAAGGGC
GAGTGTCCAAAAACATGTGTATGCCCTTTTAATTATTTACCTGTATGCGGTTCTGATGGA
GTTACTTATTCTAATGAATGTTTACTTAAATGTGCAAGTAAGGACAATGAAAAGAAAAAC
TTACCACCTATAACTGTTGTAAATGAAACGTCATGCCCAGAATCATGTCTGTGTCCATTA
ATCTATGAGCCAATATGCGGTGACGACGGCAAAACATATTCCAGTAGTTGTGAACTGAGA
TGTAAAAATAAAGAAAGAGAAATAAATAAAGAACTACCAATTAAAAAAGTCAGTGATGGG
GAATGCTCAAAACCATGTCGTTGTCCAAAAATTTATAGTCCCGTGTGTGGTGATAATGGT
GAAACATTTTCTAATAACTGTGAATTAGAATGTGAAAACAAAAAACGCCAAGCTAAAAAT
GAATCACCAATAGCTGTGGTAAGTAAGGGAAAGTGTCCGGAACCTTGTAGTTGCCCAAAA
ATATTCGAACCTGTATGTGGTGATGACGGAATAACTTATTCCAGCAGTTGTGATTTAGGT
TGTGTTAATAAAGAAAAAGAAAAAAATAATGAAGCACCCATCCTTGAGGTTTCCAAAGGT
GCATGCCCAGGTTCCTGTATATGTCCATTAATAATTTCAGAGCCTGTTTGTGGAAGCGAC
GGTCAAACTTATCGTAGTGAATGTGAATTAGACTGTGAAAATAAAATAAGAATAGCAAAA
GATGAATCACCTCTCTCTGTTATTAGCAAGGGTGAATGTCCAAAAGCTTGCGCGTGTCCT
TTAATAGATCTTCCTGTTTGCGGTTCGGATGACGTCACTTACCCTAACGAATGTTCACTT
AACTGTACAAGTGCAGATAATGTAAGAAAAAGTTTACCTGCTATTACTGTGAAAAGCCAA
GGAGAATGTGAAGAGTCATGCATATGTTCAACAAATTATGATCCTATATGTGGTTCAGAC
GGTGTAACTTACTCCAACGAGTGTCAACTAGAATGCAAAAATAAAAAGCGAATCAAAAAC
TCCCTAGATAGAATAGATATTGTAAAAAAAGGAAAATGTAATGGATCCTGCAGCTGTCCT
GCAGATGTCAATCCAGTATGTGGCAGTGACGGACAAAGTTATCCCAATGAATGTCAATTA
GTATGCGAGAGCGATGATTTGGTACGACAGGGGCTTTCAGCTTTAGAAGTCATCGAAAGT
GATCTTTGTGAAGAATCATGCGAATGTTATAACGCAATTATACCAGTTTGCGGGTCAAAT
AATAAATCTTACAGAAATGCTTGCTATTTAGATTGTGCCAACAGAAACAGAAGAGGCAAT
GAAACATCAATTACGATAAAATATAGTGGTGCATGCAGAAGTTGCACTTGCACCCGAGAA
CTTAACCAAGTGTGTGGTAGCGACGGTAATACGTATAATAATCCTTGTCTTTTAGATTGT
GAAAGTGAAAGACTAAAAGGAATAGGAAAATCACCTCTGTATATTATTCACTATGGCGAC
TGTCAAGGATGTGATTGTTCAAATGAATACGAACCTGTCTGTGGAACTGATAACAATACA
TACACAAACTTATGTCAATTACAGTGTGAAAGTAACATTAGACAACGTGAAAATCAGAAA
GAGATAGCTCTCCTCAGCAAAGGAACATGCCCAGAGAGTGATTATGATTGTGAAAATTGC
CCTCTTACGTACCAACCAGTTTGTGGTAAAGATCTTGTAAGCTACTGGAACGACTGCTGG
TTTAAATGTAGTAATAAATGTAAACTGAGTCGTGGGGAAAAACCTATCCCGATGGCTAAA
ACTGGATGCTGTTAA

Protein sequence:

MKTVIGLLFIFASICYLDAKRIKKRSCICTELYSPICGTDGTTYTNKCFFNCAKNTHKKH
GSTKDIYIAYEGKCSDSCICKDNYSPVCGSDGKTYPNSCYLNFKSKEIENDCKNNGDDPD
ENKLIEAYKGECSDECFCTDEYAPICANNNKTYSNSCQLECENKKRKNNNLPPLVVKSDG
QCPKPCICEGMYQPICGDDGKTYANVCSLGCINEERQNNNLPPISKRSDGKCPNICKCPK
IYKPVCGNDGKTYPSNCNLKCINKEREGNKLSPIREISKGECPKTCVCPFNYLPVCGSDG
VTYSNECLLKCASKDNEKKNLPPITVVNETSCPESCLCPLIYEPICGDDGKTYSSSCELR
CKNKEREINKELPIKKVSDGECSKPCRCPKIYSPVCGDNGETFSNNCELECENKKRQAKN
ESPIAVVSKGKCPEPCSCPKIFEPVCGDDGITYSSSCDLGCVNKEKEKNNEAPILEVSKG
ACPGSCICPLIISEPVCGSDGQTYRSECELDCENKIRIAKDESPLSVISKGECPKACACP
LIDLPVCGSDDVTYPNECSLNCTSADNVRKSLPAITVKSQGECEESCICSTNYDPICGSD
GVTYSNECQLECKNKKRIKNSLDRIDIVKKGKCNGSCSCPADVNPVCGSDGQSYPNECQL
VCESDDLVRQGLSALEVIESDLCEESCECYNAIIPVCGSNNKSYRNACYLDCANRNRRGN
ETSITIKYSGACRSCTCTRELNQVCGSDGNTYNNPCLLDCESERLKGIGKSPLYIIHYGD
CQGCDCSNEYEPVCGTDNNTYTNLCQLQCESNIRQRENQKEIALLSKGTCPESDYDCENC
PLTYQPVCGKDLVSYWNDCWFKCSNKCKLSRGEKPIPMAKTGCC