DPGLEAN08870 in OGS1.0

New model in OGS2.0DPOGS210680 
Genomic Positionscaffold321:- 13256-19749
See gene structure
CDS Length1389
Paired RNAseq reads  137
Single RNAseq reads  388
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006297 (2e-96)
Best Drosophila hit  kal-1 (9e-33)
Best Human hitanosmin-1 precursor (2e-33)
Best NR hit (blastp)  Kal-1 protein [Bombyx mori] (8e-172)
Best NR hit (blastx)  Kal-1 protein [Bombyx mori] (3e-172)
GeneOntology terms



  
GO:0004867 serine-type endopeptidase inhibitor activity
GO:0007155 cell adhesion
GO:0030414 peptidase inhibitor activity
GO:0005576 extracellular region
GO:0009986 cell surface
InterPro families



  
IPR015874 4-disulphide core
IPR008957 Fibronectin type III domain
IPR008197 Whey acidic protein, 4-disulphide core
IPR013783 Immunoglobulin-like fold
IPR003961 Fibronectin, type III
Orthology groupMCL15174

Nucleotide sequence:

ATGTGGATGATAAAAACTGGTGTGATAATACTGGCTGTTCTGATATCAGCATCGGCTAAA
TCAAAAAGGTACACTAGACTGCAGAGCGATCCCTTGACAACAACGAGATGTGACCTTATA
TGTTTTGATGCGAGCAAAGAAAATAAATCTCAGTGTCGATCAGCTTGTCGGTCAGAGACG
CAAAAGCCAGGAACCTGTCCTGATGGAGACGATCCTCGCTGGATGGCCGCGTGCCTCGAA
GCCTGTAATCATGACTCTCAATGCGACGGCACTCAAAGATGTTGCCAGCATGGATGCAGT
TCCACGTGCAGTGAGCCCACCGATTTGTTGACTATACCGGGCCTTCCAGCGATGCCGTCA
ATAGAGGAACCAAAAGAAAGACGGCGCGCAGTTCAGATTAAATGGTCAGATGGTGTAGGT
GATGAAGCAAGATCTGTTCCAGGTAGAGTTCTTTACCTATTGGAAGAACAACATTATCTT
TGCCCCAACTACGATGAATCACGACTTGGAGAGTGGAATCTCCTGATGAGAACCAATAAA
ACCAAAGTGTCTCTACGTAACCAGTTGAAACCAGGTCGTTGGTATCGTTTCCGAGTGGCT
GCTGTAAGTGCGTCTGGTACAAGAGGGTTCTCCGAACCTAGCGCTCCCTTCACTCCTCGT
AAAGGACCACGCCCTCCACCCCCGCCAAAGAAGCTAAAGGTGGAACATGTGAGATCAGAC
AATGATAGTGTTACAATACGACTGGAATGGAAAGAGCCAAAATCTGATTTACCAGTGATG
AGATATAAAGTTTTTTGGAGCAGACGACTTCGAGGTCTCTCAGGGGAGTTGGATTCTGTT
GTCGTTAATCATCAAACTGTGCCAAAGGATCAGACTTTCGTTGAAATAAGTAAACTTCAT
CCGAATTCAATGTATTTCCTCCAAGTACAAACAATAAGTGCATTTGGTGGTGGAAAACTA
CGAAGTGAAAAGGCTGAAATTTTTTATAACACAACGAGTTCTGAACAGCCACCACAGGCA
TTAAAAAGGCGTATAGACAACTCCGTAACAGGACTCAGATTAAACAAACTTATATGGTTG
AACCATAAGATTAAGGCTAAAATATCATGGGAATTGCCTCCAGGCTCAAAGGGACAATCT
AAAAGATATTTTGTGCACTGGAAAACTCTGTCCTGCCAACATCCAGCAACAGAATTAAAG
GAATTTTCAGCAATAACCGAGCAAAACAGCTTCGAAATATATGAGTTAGATTACAAATGC
AAATACAAAGTAAACGTGAACAGATCTCCGAACAGCGTTACTCCAGACTCCGAATACATT
TTATCAGTTCCTGGATGCGATTATTTTAAACGGAAATTTAATAGCTCCTACGTTAAATGT
AAAACATAG

Protein sequence:

MWMIKTGVIILAVLISASAKSKRYTRLQSDPLTTTRCDLICFDASKENKSQCRSACRSET
QKPGTCPDGDDPRWMAACLEACNHDSQCDGTQRCCQHGCSSTCSEPTDLLTIPGLPAMPS
IEEPKERRRAVQIKWSDGVGDEARSVPGRVLYLLEEQHYLCPNYDESRLGEWNLLMRTNK
TKVSLRNQLKPGRWYRFRVAAVSASGTRGFSEPSAPFTPRKGPRPPPPPKKLKVEHVRSD
NDSVTIRLEWKEPKSDLPVMRYKVFWSRRLRGLSGELDSVVVNHQTVPKDQTFVEISKLH
PNSMYFLQVQTISAFGGGKLRSEKAEIFYNTTSSEQPPQALKRRIDNSVTGLRLNKLIWL
NHKIKAKISWELPPGSKGQSKRYFVHWKTLSCQHPATELKEFSAITEQNSFEIYELDYKC
KYKVNVNRSPNSVTPDSEYILSVPGCDYFKRKFNSSYVKCKT