DPGLEAN07996 in OGS1.0

New model in OGS2.0DPOGS200722 
Genomic Positionscaffold402:- 50068-66397
See gene structure
CDS Length2469
Paired RNAseq reads  339
Single RNAseq reads  882
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001121 (0.0)
Best Drosophila hit  dumpy (3e-20)
Best Human hitprotein kinase C-binding protein NELL1 isoform 1 precursor (1e-125)
Best NR hit (blastp)  protein kinase C-binding protein NELL1 precursor, putative [Pediculus humanus corporis] (0.0)
Best NR hit (blastx)  PREDICTED: similar to protein kinase c-binding protein nell1 [Tribolium castaneum] (0.0)
GeneOntology terms








  
GO:0045667 regulation of osteoblast differentiation
GO:0005576 extracellular region
GO:0005198 structural molecule activity
GO:0006917 induction of apoptosis
GO:0045778 positive regulation of ossification
GO:0005509 calcium ion binding
GO:0005080 protein kinase C binding
GO:0007399 nervous system development
GO:0005737 cytoplasm
GO:0007155 cell adhesion
InterPro families













  
IPR001791 Laminin G domain
IPR000742 Epidermal growth factor-like, type 3
IPR001007 von Willebrand factor, type C
IPR000152 EGF-type aspartate/asparagine hydroxylation site
IPR013032 EGF-like region, conserved site
IPR018097 EGF-like calcium-binding, conserved site
IPR008985 Concanavalin A-like lectin/glucanase
IPR013091 EGF calcium-binding
IPR006209 EGF
IPR012680 Laminin G, subdomain 2
IPR013320 Concanavalin A-like lectin/glucanase, subgroup
IPR003129 Laminin G, thrombospondin-type, N-terminal
IPR006552 VWC out
IPR001881 EGF-like calcium-binding
IPR006210 Epidermal growth factor-like
Orthology groupMCL15163

Nucleotide sequence:

ATGTATGTTTATCGAATGTATTTTATTGTTCCAGCCACCGAGCTAGACTTGTTGGCTGCG
CTGTCGCTGCACAACACTACGCGAACCGGAGTCAGCGCTGCTCCTGGCATGCAGCCTCAA
CGGACCGCGTACGCCCTTGACGGGGACTCGCGGTCGCTGCAGGTGGAAGGTACGGCCTTC
GACCACGCCATGGAGCTTCTGCGACGATCGCCAGAGTTCACAGTCCTAGCCGCTCTTCGA
CAGGAGCCGGCTAACTCGGGAACAATTCTATCCTTCTCACACGGATACAACAGGTATCTG
GAGTTGCAGTCAAGTGGTCGTCGTGATGAGGTGCGTCTTCATTACGTGGAGGCAGGTGGC
GTAACGGCCCGAGTGGAGACCTTCCCGTTTCGACTCGCGGACGGCGCATGGCATCGGGTG
GCACTTGCTGTCTCAGGGGCGCAAGCAACTCTGCTCGTTGACTGTCACCCGCTATATCGA
CGATTAATACCACCACCAGACCGGAATTTTACACAACCACAACTCTCATTGTGGGTAGGA
CAGAGAAATAGCAAGCATTCTTTATTTAAGGGAACCCTTCAAGATGTTAGATTGGTGAGT
GGGCCTCACGGCTATTTGGTACAGTGTCCGGGACTGGACTCTGAATGCCCTACTTGCGGG
CAGTTCTCACTGCTACAAGCCACTGTACAGGAACTAACTTCACATATCCATGACCTTTCA
CTAAAGCTTGTTGGCGCCGAAGCAAGACTGGCGCGTTTAGAACAATGTGATTGCCAAAAA
TCGTGTTACTCTAATGGGACAGTGCACGCAGATGGTGCAACTTGGCAAAAAGACTGCAAT
CGCTGCTCTTGCGTGCATGGTGAAATAACGTGCAGGCCAGTAGAATGCGACAGAGCGGAA
TGTAAAAATCCAGTGTTACATCCAGGAGAATGCTGTCCCACGTGTCTGAGACAGTGCCTC
CTAAAGGGCACGCTTTATGAACACGGCGAGCGATTCGCTCCCAAAGAGTGCGCGGAGTGT
GTTTGTCACGACGGTAATATGCAATGCGCACGCGTCGATCCCGACACAGCCTGCCCACCG
CTACCCTGCGACGCCCCGGACCAGTTTACTGTACCCGGGGAGTGTTGCAAGTTCTGCCCT
GGTGTGGACTACTGTAGTATGGGACATTCGTGCGATGAAAATGCTACTTGTATGAATCTT
AATACAAAGTACACTTGTAAATGCAATCAAGGATTCCAAGGGGATGGAATCACATGTGAA
GATGTAGATGAATGTCAAGCGGCTGGTGGTCTTTACGGTCACCACTGTCATTCCAACACT
CGTTGTGTGAACGTCGTAGGAGGGTACGTGTGTCAATGTCTTCCGGGATACACCAGGAGG
GATAAATTCAACTGTGTTGAGGTGGACGAGTGTTTGAGTGACACACATGGATGCGATCCT
CACGCCGAGTGCAGTAACACGCCTGGTTCATACACCTGTCTGTGTAGGGAGGGATACTCC
GGAGACGGTTATACATGTACACCTATATGTAGCGGAGGTTGTCTAAACGGCGGTGTATGT
GCCAGTCCGGAGCACTGCGCGTGCGCACGCGGTTTCGCTGGCGCTCGTTGCGAGCGGGAC
GTTGACGAATGCTTGCGTGCGGCTCACCCCGCAGCGCCGAGAGCTTGCGTGCCGCGAGCC
GCGTGCGTCAACACCCCTGGCTCATACTACTGCGTCTGCAAGAACGGCTATAGAAGAGAC
CCCCATAGAGATCACTGCGAAGATGTTGATGAATGTGCTGAAGGCTTTCATACCTGTCAT
CCAAGCGCACGGTGTGTTAACACGGACGGAGGATTCAGATGTGAATGTGATACAGAAAAT
TGTGAATTGAGTTGTTCATGGCAAGGCCGCATCGTGTCTGACGGCGGGCGGTGGTCGGAA
GGCGGTGGATGTCGGGCATGTTCGTGTGCCAGTGGGGTGGCTACCTGCGAGGATGCTGTG
TGCGCCTGTGACACGGACAACACGTCGCTAACTTTCCAGAGCTCGGAGTCTATTCCCCTG
GCTCCGTCGTCCTGCTGTCCTCACTGCGATTCTCGGTATCACTGCCGTCACCAGGAGATG
CACCACGTAACCTTCCGTAGCGGCGAACGCTGGCTCTACCAGTGCCAGATTTGTGAATGC
CTCCTGGGTGAGGTGGACTGCTGGGAGCCCGAGTGCGAGGATGGTGGAGGGTGCTGCGCC
TTTGACACGGGGGAAGCCCCGGGGAGCAAGACCCGGGGCGAAGGGGAGACCTGGCGCACG
CCCCACCGCCTCGAGCTCGCGGGCTGCGCGCCACCACACTGCCCCACGTGCCAGGGCGGG
CAGTGTGCTACTTTGAGCTCGGCGCGACGCGGCGGCGGCGGCAGCCCTGGCCCTGGCGGC
GCTGTGGTGGGGCCGCGCCCCACGACCTCACCGCAAGCACCGCGGCGCGCGGCGCTAGAG
CCGCCCTGA

Protein sequence:

MYVYRMYFIVPATELDLLAALSLHNTTRTGVSAAPGMQPQRTAYALDGDSRSLQVEGTAF
DHAMELLRRSPEFTVLAALRQEPANSGTILSFSHGYNRYLELQSSGRRDEVRLHYVEAGG
VTARVETFPFRLADGAWHRVALAVSGAQATLLVDCHPLYRRLIPPPDRNFTQPQLSLWVG
QRNSKHSLFKGTLQDVRLVSGPHGYLVQCPGLDSECPTCGQFSLLQATVQELTSHIHDLS
LKLVGAEARLARLEQCDCQKSCYSNGTVHADGATWQKDCNRCSCVHGEITCRPVECDRAE
CKNPVLHPGECCPTCLRQCLLKGTLYEHGERFAPKECAECVCHDGNMQCARVDPDTACPP
LPCDAPDQFTVPGECCKFCPGVDYCSMGHSCDENATCMNLNTKYTCKCNQGFQGDGITCE
DVDECQAAGGLYGHHCHSNTRCVNVVGGYVCQCLPGYTRRDKFNCVEVDECLSDTHGCDP
HAECSNTPGSYTCLCREGYSGDGYTCTPICSGGCLNGGVCASPEHCACARGFAGARCERD
VDECLRAAHPAAPRACVPRAACVNTPGSYYCVCKNGYRRDPHRDHCEDVDECAEGFHTCH
PSARCVNTDGGFRCECDTENCELSCSWQGRIVSDGGRWSEGGGCRACSCASGVATCEDAV
CACDTDNTSLTFQSSESIPLAPSSCCPHCDSRYHCRHQEMHHVTFRSGERWLYQCQICEC
LLGEVDCWEPECEDGGGCCAFDTGEAPGSKTRGEGETWRTPHRLELAGCAPPHCPTCQGG
QCATLSSARRGGGGSPGPGGAVVGPRPTTSPQAPRRAALEPP