DPGLEAN14959 in OGS1.0

New model in OGS2.0DPOGS211874 
Genomic Positionscaffold10791:+ 896-3543
See gene structure
CDS Length1395
Paired RNAseq reads  32
Single RNAseq reads  106
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001245 (2e-76)
Best Drosophila hit  CG17739 (5e-19)
Best Human hitkunitz-type protease inhibitor 3 precursor (3e-09)
Best NR hit (blastp)  AGAP012307-PA [Anopheles gambiae str. PEST] (4e-40)
Best NR hit (blastx)  AGAP012307-PA [Anopheles gambiae str. PEST] (1e-40)
GeneOntology terms
  
GO:0005576 extracellular region
GO:0004867 serine-type endopeptidase inhibitor activity
InterPro families

  
IPR002223 Proteinase inhibitor I2, Kunitz metazoa
IPR000884 Thrombospondin, type 1 repeat
IPR020901 Proteinase inhibitor I2, Kunitz, conserved site
Orthology groupND

Nucleotide sequence:

GCTCGTCGAAATCCGACAAACCCAAAAGCTCCAGTCCGAGCTCTCAGACAGGAGTGGCCC
AGAGACACTCGCTCGGGGCCCTTCTTGAGCTCGTTCGGTGAATTGCGACCATTTGCTCGG
CTGAGGCTGACGAGATTGAGACTTTACGAGAAAAGCTGTGACGTCACAGAGCCTACTGGC
GTGCAAGTCCCCGAGGGGGCTGCCGTTCCCGGCGGTTCATGTGCTACGTACGGTTGGGGC
GGTTGGAGCCCCTGCAGCGTGTCGTGCGGCACGGGGCGAAGCACCCGCCAGCGCCGGTAC
ATGTGGCCACTTCGCGCCCAGCACGACGCCTGCCGCACTACACTCACGGAATACAGATCC
TGTCATGGACCGAGACTGCACTGCAGAGTGAAGTCTGACTACGAGCCTGAGGCGGCGGAC
TCATCAGGTCCTTGCGCCATGTCTCCCTGGTCGGAGTGGTCTCCGTGTGTAGGGTGCGGA
GTGCGAGCTCGCACCAGGCACTACCTCGCTCCGCGCGCACACAAGCGGTGCCATGTGGGG
TTCCGCGCAAGAACCGTCATGAGCCAGGCCATGCCGTGCGACGCGGGACCCTGTTACAAA
CCCTTCCACGGGGTGGGCCAACTAACGTCGTATATACTCGGAGGATACAATGTTGTTGTA
AAATATTGCACTTACCAGGTTTTACTTGATTACAGACCTCCCAAAGAACGCGCGAATGCA
ACAAACTTTGATTGGTTTTTTGAGCTCATGATGGCAGGCTATGGTCACGGTACAATGACG
TCACCTGATGTCCAGGAAAGCCCCAAGTCGGACTGTCCGGTGACTCCGTGGTCCACGTGG
TCGCCGTGTTCCTCGCGCTGCGGCCGAGGCCGGCGACTCCGCACCAGGATGTACGTGGTG
CGGGAGACGAACCTTCAGCGAGAGATCACTAAGAGGCTGTTAAGGGACTGGAACCAACGG
TTCGCTGAACTACAGAATTTGGAACTGCCTCATGAGAACATAACCAGCGAGGACCCGTCC
CTGGACGCCGCGGTCCAGGAACATCTCGACAGATGTCAGTTCACGATGACGCAGCAGGAG
GCGCTGTGTGACGGAGGCGACGGAGGCTGCTCCGACAACCCTTCACCCAATGAGATCTGC
GCGCTGCCGGTGTCGGTGGGGCCGTGTCGGGGCTACGACGAGCGCTGGTTCTTCGACCAC
CCGCGAGTCGCGTGCGAGCCCTTCGGGTACACCGGCTGCGGCGGGAACGAGAACAACTTC
AGGACCCGGCAGATCTGCGAGCAGACTTGTCTGTCGCAAGGAAACAAACACGACAACGGT
AGTGGGGCTTCTGTAGGGAATTTATCCCAGTCGTTCTCATCTCGTTCTAAGCGATCGCGG
TTACCGCAAAGTTAA

Protein sequence:

ARRNPTNPKAPVRALRQEWPRDTRSGPFLSSFGELRPFARLRLTRLRLYEKSCDVTEPTG
VQVPEGAAVPGGSCATYGWGGWSPCSVSCGTGRSTRQRRYMWPLRAQHDACRTTLTEYRS
CHGPRLHCRVKSDYEPEAADSSGPCAMSPWSEWSPCVGCGVRARTRHYLAPRAHKRCHVG
FRARTVMSQAMPCDAGPCYKPFHGVGQLTSYILGGYNVVVKYCTYQVLLDYRPPKERANA
TNFDWFFELMMAGYGHGTMTSPDVQESPKSDCPVTPWSTWSPCSSRCGRGRRLRTRMYVV
RETNLQREITKRLLRDWNQRFAELQNLELPHENITSEDPSLDAAVQEHLDRCQFTMTQQE
ALCDGGDGGCSDNPSPNEICALPVSVGPCRGYDERWFFDHPRVACEPFGYTGCGGNENNF
RTRQICEQTCLSQGNKHDNGSGASVGNLSQSFSSRSKRSRLPQS