DPGLEAN22201 in OGS1.0

New model in OGS2.0DPOGS209745 
Genomic Positionscaffold744:+ 58077-66483
See gene structure
CDS Length2820
Paired RNAseq reads  1055
Single RNAseq reads  2772
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008958 (0.0)
Best Drosophila hit  Reversion-inducing-cysteine-rich protein with kazal motifs (1e-128)
Best Human hitreversion-inducing cysteine-rich protein with Kazal motifs precursor (2e-112)
Best NR hit (blastp)  serine protease inhibitor [Aedes aegypti] (0.0)
Best NR hit (blastx)  PREDICTED: similar to serine protease inhibitor [Tribolium castaneum] (0.0)
GeneOntology terms  GO:0004867 serine-type endopeptidase inhibitor activity
InterPro families
  
IPR002350 Proteinase inhibitor I1, Kazal
IPR011497 Protease inhibitor, Kazal-type
Orthology groupMCL15749

Nucleotide sequence:

ATGTCGCTGGTGGAGATGGCTTCCGATTCTGCAATGCGGGAAGAGAGAATAGAAAACATT
TACAAATTCTGCGCCCCTCATTTGATAGAGTTCTGGATATGCATGAATCAGACAATACAA
GAGGTGGTATCACGGTCTGGTTGGTGGGGGCGCGCGTGTTGTGCTTTAGGTCATTCGGCC
AAGTGTCGTAGAGCTTGCGCCACAGCCTCTGATGCGAGTGCTCTGTCTGAACCGTGCAGA
AGATCGGATGAAATAAACTTCTTCGACTGCGTGCAGAAGCAGCAAGAGGGACAGTGGTGC
TGTTCTCAAACACGGTCAATAAGCTGTCACGAAGCGTGTCAGAAAGCTGTGTGGAGGGTC
GGGCAGACCAGAGCTGACAGTGGCGCCAGGGAGAAGGCCGCTGAGTTGTGCGAGCAGTCA
CCAGCGTTGCTGCACTGCTTCCGAGATCTGACAGCATCCACTGTCCACACCGACACTTCT
AAATATTTACCATGCTGCCACGAATCTCCAAGTCAAGAGTGTCGTAGTACCTGCGAAACA
GTGCTCAGAAGAACGGGGGAGTCCCAGGAAATCGCGGAGGCGTTGTCCCAGGAATGTGGT
GCTCCCGCGATCCACGATGGTATGTGGCAGTGTTTCCTAAGAAAAGACGCGCCTCCAGAA
ACCAAAGACGTTATTCCTCATGATGTAGCAAAACTGCATTGTTGTCAGAAGGGCGCTACG
ATTAATTGTAGAAGGCTATGTTTCTATACATTTAACAACGGCTGGCATTTAAATTGGCAG
AAATTCTATTCTGAATGCCTCGGAGATCCACAGGAAACGGAAATGGCGGAATGCATAGAA
GAAGTCGAGGCTCCGTGTTCCTTGGGCTGCGCCGGCCTTACATACTGTAGTCAACTAAAC
AATCGCCCGACCAGTTTGTTCCGATCGTGTTCAGCTCAGGCTGATTTGGACGCACATTTG
GCCGTCGCCGAGCAAAAGGCTAGTGGGGACGTCACTGTGGCAGGGTTACGTTTACCGTTG
AAGAATTCTTCCCAGTGCACTACAGATATTTGGAAGAGCGTCGCTTGTGCTCTTCATGTG
AAGCCCTGTACACTAAAGGGTCACAGTAGCCTGTTGTGTGCTGAGGAGTGTCGTCGCCTG
GTGTCCTCATGTGTGGAGTGGTCTCGAGCCCCGCTGTCAGCCGCCGCGCTCTGTGCTAGA
CTAGCGCCAGCTCGCCCTGACGCTCCATGTGTGTCACTGGCCGAGTTCCTTACAGCCAGT
CCCGAACCGCCTTTACTATCAGCTACGGAAATGGTCACTTCCCCATGCGCTGGTTCTCCC
TGTAACAGTTCCCAAGTGTGTGTCATCAACAGAAGTTGTATTCGAGGAGGCTCTTGCTCC
AAGTATACATGCATTGACGGATGTCCGCTAGGTGATGGTAGCCCATATGTGGTTCCCGTC
GGTTCCTGGGTGAGGGTGCCCATGCCCTGTGCTGCACAGAAGGTCTGCATCAAGATTTGC
CGCTGCGGTAACAAGGGGCTGTCAGACTGCCAGCCGTTACCCAGCGTCGCCCTTGATAAT
TGTCGATTGCATGATAAAGTGGTCAAGCATGGTGACAAGTACTACATGGAGTGCAACCCA
TGTGTGTGTGTGTCTGGTGAGCGTGTGTGCGCTCGTCGAGCGTGTGGTCGCGCGGCGCTC
CTGACGGGTTTGCCCTGTAACTGTCCCCCTCACCATCTTCCAGTGCGATCGCCAGGAAGG
CTCTACCCTAATGCCTGCTTGGCCAAATGCGCGGGCGCGACGGACGCTGAGATCGAATTC
GGTTCGGGAGGTGTTTGTTCTGGCGGCGAATGTTCGCGGCTCGCGTGTCTCCCCGCTCGC
TCCGTCTGCCTCTCGCGCCTACAGACAGCGTGTCCGCAGCACGTATGCGCGGGCGCGACG
GACGCTGAGATCGAATTCGGTTCGGGAGGTGTTTGTTCTGGCGGCGAATGTTCGCGGCTC
GCGTGTCTCCCCGCTCGCTCCGTCTGCCTCTCGCGCTTACAGACAGCGTGTCCGCAGCAC
GTGTGCGTGAGTACAACCAACTGCCACACCCAGCCGCCGTCGCCGGTGTGTGACACGGAC
GGACATTCCCACTCCAAGCCCTGCCAGCTCATCATGAGTGGTCGCCGGTTTGCCTACTGG
GGACAATGTTTACAGGGATGTTCATCGAGCGGAACAGTTTGCGGAGTCAATGGCATCACC
TACAGCTCTGAATGCGCCGCGTGGTCGGAATACGTCAGTGTGGACTACAACGGAGCTTGT
TTTGCTGTTGGTCCCATATCAGACCTCATGGAACCAAAATGTCAGTTCGACAGAATAATG
TGCCCTCCATTGAAGAAACCGGATTGTCTCGGATATACTGCCCCTGGAGCGTGTTGCCCG
AAATGCGGTGGAGCTTTGAGAATATTATACTCTAAAAAACAAATCGATAGAGCGCTTTAT
GGGACCAATATATCGGCTTCCGTTATAAACCTGCACAATGTCCTGTCAGCTCTGGACAGG
CATGTCAAAGTGGCTGAATGCGCTTTAAGGGGCTATCTTACCATCGAAATGGAGATTTTC
GTCACCATCGAATCAGTATTGAAGAATCCGACCGACCTGCAGTTGAACGTCTGTGTTTTG
GAAGCGGAAAAACTTGCTGATCTCATAAACCGGGAGAGTGCTCTTATTTCTAGTGATTTA
GGCTTGAGTGCGTTGTCGTACGCCTTGATAGTGCACACACATCCGTCTCAAGGCGCGTCC
GTTGTGAGTTTGTCGTTGGCCATAATATTATCGTACACAGTAATATTCGTTCTGAGGTAG

Protein sequence:

MSLVEMASDSAMREERIENIYKFCAPHLIEFWICMNQTIQEVVSRSGWWGRACCALGHSA
KCRRACATASDASALSEPCRRSDEINFFDCVQKQQEGQWCCSQTRSISCHEACQKAVWRV
GQTRADSGAREKAAELCEQSPALLHCFRDLTASTVHTDTSKYLPCCHESPSQECRSTCET
VLRRTGESQEIAEALSQECGAPAIHDGMWQCFLRKDAPPETKDVIPHDVAKLHCCQKGAT
INCRRLCFYTFNNGWHLNWQKFYSECLGDPQETEMAECIEEVEAPCSLGCAGLTYCSQLN
NRPTSLFRSCSAQADLDAHLAVAEQKASGDVTVAGLRLPLKNSSQCTTDIWKSVACALHV
KPCTLKGHSSLLCAEECRRLVSSCVEWSRAPLSAAALCARLAPARPDAPCVSLAEFLTAS
PEPPLLSATEMVTSPCAGSPCNSSQVCVINRSCIRGGSCSKYTCIDGCPLGDGSPYVVPV
GSWVRVPMPCAAQKVCIKICRCGNKGLSDCQPLPSVALDNCRLHDKVVKHGDKYYMECNP
CVCVSGERVCARRACGRAALLTGLPCNCPPHHLPVRSPGRLYPNACLAKCAGATDAEIEF
GSGGVCSGGECSRLACLPARSVCLSRLQTACPQHVCAGATDAEIEFGSGGVCSGGECSRL
ACLPARSVCLSRLQTACPQHVCVSTTNCHTQPPSPVCDTDGHSHSKPCQLIMSGRRFAYW
GQCLQGCSSSGTVCGVNGITYSSECAAWSEYVSVDYNGACFAVGPISDLMEPKCQFDRIM
CPPLKKPDCLGYTAPGACCPKCGGALRILYSKKQIDRALYGTNISASVINLHNVLSALDR
HVKVAECALRGYLTIEMEIFVTIESVLKNPTDLQLNVCVLEAEKLADLINRESALISSDL
GLSALSYALIVHTHPSQGASVVSLSLAIILSYTVIFVLR