New model in OGS2.0 | DPOGS200722  |
---|---|
Genomic Position | scaffold402:- 50068-66397 |
See gene structure | |
CDS Length | 2469 |
Paired RNAseq reads   | 339 |
Single RNAseq reads   | 882 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA001121 (0.0) |
Best Drosophila hit   | dumpy (3e-20) |
Best Human hit | protein kinase C-binding protein NELL1 isoform 1 precursor (1e-125) |
Best NR hit (blastp)   | protein kinase C-binding protein NELL1 precursor, putative [Pediculus humanus corporis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to protein kinase c-binding protein nell1 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0045667 regulation of osteoblast differentiation GO:0005576 extracellular region GO:0005198 structural molecule activity GO:0006917 induction of apoptosis GO:0045778 positive regulation of ossification GO:0005509 calcium ion binding GO:0005080 protein kinase C binding GO:0007399 nervous system development GO:0005737 cytoplasm GO:0007155 cell adhesion |
InterPro families    | IPR001791 Laminin G domain IPR000742 Epidermal growth factor-like, type 3 IPR001007 von Willebrand factor, type C IPR000152 EGF-type aspartate/asparagine hydroxylation site IPR013032 EGF-like region, conserved site IPR018097 EGF-like calcium-binding, conserved site IPR008985 Concanavalin A-like lectin/glucanase IPR013091 EGF calcium-binding IPR006209 EGF IPR012680 Laminin G, subdomain 2 IPR013320 Concanavalin A-like lectin/glucanase, subgroup IPR003129 Laminin G, thrombospondin-type, N-terminal IPR006552 VWC out IPR001881 EGF-like calcium-binding IPR006210 Epidermal growth factor-like |
Orthology group | MCL15163 |
Nucleotide sequence:
ATGTATGTTTATCGAATGTATTTTATTGTTCCAGCCACCGAGCTAGACTTGTTGGCTGCG
CTGTCGCTGCACAACACTACGCGAACCGGAGTCAGCGCTGCTCCTGGCATGCAGCCTCAA
CGGACCGCGTACGCCCTTGACGGGGACTCGCGGTCGCTGCAGGTGGAAGGTACGGCCTTC
GACCACGCCATGGAGCTTCTGCGACGATCGCCAGAGTTCACAGTCCTAGCCGCTCTTCGA
CAGGAGCCGGCTAACTCGGGAACAATTCTATCCTTCTCACACGGATACAACAGGTATCTG
GAGTTGCAGTCAAGTGGTCGTCGTGATGAGGTGCGTCTTCATTACGTGGAGGCAGGTGGC
GTAACGGCCCGAGTGGAGACCTTCCCGTTTCGACTCGCGGACGGCGCATGGCATCGGGTG
GCACTTGCTGTCTCAGGGGCGCAAGCAACTCTGCTCGTTGACTGTCACCCGCTATATCGA
CGATTAATACCACCACCAGACCGGAATTTTACACAACCACAACTCTCATTGTGGGTAGGA
CAGAGAAATAGCAAGCATTCTTTATTTAAGGGAACCCTTCAAGATGTTAGATTGGTGAGT
GGGCCTCACGGCTATTTGGTACAGTGTCCGGGACTGGACTCTGAATGCCCTACTTGCGGG
CAGTTCTCACTGCTACAAGCCACTGTACAGGAACTAACTTCACATATCCATGACCTTTCA
CTAAAGCTTGTTGGCGCCGAAGCAAGACTGGCGCGTTTAGAACAATGTGATTGCCAAAAA
TCGTGTTACTCTAATGGGACAGTGCACGCAGATGGTGCAACTTGGCAAAAAGACTGCAAT
CGCTGCTCTTGCGTGCATGGTGAAATAACGTGCAGGCCAGTAGAATGCGACAGAGCGGAA
TGTAAAAATCCAGTGTTACATCCAGGAGAATGCTGTCCCACGTGTCTGAGACAGTGCCTC
CTAAAGGGCACGCTTTATGAACACGGCGAGCGATTCGCTCCCAAAGAGTGCGCGGAGTGT
GTTTGTCACGACGGTAATATGCAATGCGCACGCGTCGATCCCGACACAGCCTGCCCACCG
CTACCCTGCGACGCCCCGGACCAGTTTACTGTACCCGGGGAGTGTTGCAAGTTCTGCCCT
GGTGTGGACTACTGTAGTATGGGACATTCGTGCGATGAAAATGCTACTTGTATGAATCTT
AATACAAAGTACACTTGTAAATGCAATCAAGGATTCCAAGGGGATGGAATCACATGTGAA
GATGTAGATGAATGTCAAGCGGCTGGTGGTCTTTACGGTCACCACTGTCATTCCAACACT
CGTTGTGTGAACGTCGTAGGAGGGTACGTGTGTCAATGTCTTCCGGGATACACCAGGAGG
GATAAATTCAACTGTGTTGAGGTGGACGAGTGTTTGAGTGACACACATGGATGCGATCCT
CACGCCGAGTGCAGTAACACGCCTGGTTCATACACCTGTCTGTGTAGGGAGGGATACTCC
GGAGACGGTTATACATGTACACCTATATGTAGCGGAGGTTGTCTAAACGGCGGTGTATGT
GCCAGTCCGGAGCACTGCGCGTGCGCACGCGGTTTCGCTGGCGCTCGTTGCGAGCGGGAC
GTTGACGAATGCTTGCGTGCGGCTCACCCCGCAGCGCCGAGAGCTTGCGTGCCGCGAGCC
GCGTGCGTCAACACCCCTGGCTCATACTACTGCGTCTGCAAGAACGGCTATAGAAGAGAC
CCCCATAGAGATCACTGCGAAGATGTTGATGAATGTGCTGAAGGCTTTCATACCTGTCAT
CCAAGCGCACGGTGTGTTAACACGGACGGAGGATTCAGATGTGAATGTGATACAGAAAAT
TGTGAATTGAGTTGTTCATGGCAAGGCCGCATCGTGTCTGACGGCGGGCGGTGGTCGGAA
GGCGGTGGATGTCGGGCATGTTCGTGTGCCAGTGGGGTGGCTACCTGCGAGGATGCTGTG
TGCGCCTGTGACACGGACAACACGTCGCTAACTTTCCAGAGCTCGGAGTCTATTCCCCTG
GCTCCGTCGTCCTGCTGTCCTCACTGCGATTCTCGGTATCACTGCCGTCACCAGGAGATG
CACCACGTAACCTTCCGTAGCGGCGAACGCTGGCTCTACCAGTGCCAGATTTGTGAATGC
CTCCTGGGTGAGGTGGACTGCTGGGAGCCCGAGTGCGAGGATGGTGGAGGGTGCTGCGCC
TTTGACACGGGGGAAGCCCCGGGGAGCAAGACCCGGGGCGAAGGGGAGACCTGGCGCACG
CCCCACCGCCTCGAGCTCGCGGGCTGCGCGCCACCACACTGCCCCACGTGCCAGGGCGGG
CAGTGTGCTACTTTGAGCTCGGCGCGACGCGGCGGCGGCGGCAGCCCTGGCCCTGGCGGC
GCTGTGGTGGGGCCGCGCCCCACGACCTCACCGCAAGCACCGCGGCGCGCGGCGCTAGAG
CCGCCCTGA
Protein sequence:
MYVYRMYFIVPATELDLLAALSLHNTTRTGVSAAPGMQPQRTAYALDGDSRSLQVEGTAF
DHAMELLRRSPEFTVLAALRQEPANSGTILSFSHGYNRYLELQSSGRRDEVRLHYVEAGG
VTARVETFPFRLADGAWHRVALAVSGAQATLLVDCHPLYRRLIPPPDRNFTQPQLSLWVG
QRNSKHSLFKGTLQDVRLVSGPHGYLVQCPGLDSECPTCGQFSLLQATVQELTSHIHDLS
LKLVGAEARLARLEQCDCQKSCYSNGTVHADGATWQKDCNRCSCVHGEITCRPVECDRAE
CKNPVLHPGECCPTCLRQCLLKGTLYEHGERFAPKECAECVCHDGNMQCARVDPDTACPP
LPCDAPDQFTVPGECCKFCPGVDYCSMGHSCDENATCMNLNTKYTCKCNQGFQGDGITCE
DVDECQAAGGLYGHHCHSNTRCVNVVGGYVCQCLPGYTRRDKFNCVEVDECLSDTHGCDP
HAECSNTPGSYTCLCREGYSGDGYTCTPICSGGCLNGGVCASPEHCACARGFAGARCERD
VDECLRAAHPAAPRACVPRAACVNTPGSYYCVCKNGYRRDPHRDHCEDVDECAEGFHTCH
PSARCVNTDGGFRCECDTENCELSCSWQGRIVSDGGRWSEGGGCRACSCASGVATCEDAV
CACDTDNTSLTFQSSESIPLAPSSCCPHCDSRYHCRHQEMHHVTFRSGERWLYQCQICEC
LLGEVDCWEPECEDGGGCCAFDTGEAPGSKTRGEGETWRTPHRLELAGCAPPHCPTCQGG
QCATLSSARRGGGGSPGPGGAVVGPRPTTSPQAPRRAALEPP