New model in OGS2.0 | DPOGS204559  |
---|---|
Genomic Position | scaffold1545:+ 20619-33006 |
See gene structure | |
CDS Length | 2103 |
Paired RNAseq reads   | 228 |
Single RNAseq reads   | 555 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004329 (6e-54) |
Best Drosophila hit   | ND |
Best Human hit | collagen alpha-1(IX) chain isoform 1 precursor (3e-19) |
Best NR hit (blastp)   | PREDICTED: similar to Collagen alpha-1(IX) chain precursor [Apis mellifera] (3e-60) |
Best NR hit (blastx)   | collagen alpha-1 precursor, putative [Pediculus humanus corporis] (1e-22) |
GeneOntology terms    | GO:0046872 metal ion binding GO:0005578 proteinaceous extracellular matrix GO:0005576 extracellular region GO:0007155 cell adhesion GO:0005198 structural molecule activity GO:0051216 cartilage development GO:0003417 growth plate cartilage development GO:0001894 tissue homeostasis |
InterPro families    | IPR008160 Collagen triple helix repeat IPR003129 Laminin G, thrombospondin-type, N-terminal IPR008985 Concanavalin A-like lectin/glucanase |
Orthology group | MCL39849 |
Nucleotide sequence:
ATGGATGTATTCTGGGTTATTTTCCTTCTTTTGGGAGCGTTATTTAAACACGGAAGCAAT
GGAGATGCAGAAACAGTTACAGCGTCTCCGTTAAACGCCTGCCAGTCACTTCGTCCGGGC
GACATAGACTTCCAATCAGTGGATCTGATAGCGGTGTACCGTTTGGACGGAGCTGATACA
ACAGGGGTCACCTTGGTCCAGGGCTCTCAGGACCTGCAGAGGGCGTACCGCGTTGGGGAT
GGAGCGAATCTTACTTTGCCTTTACGCGAGGCTCTACCAGCGGGTTTGCCTTCAACGTTC
ACTATAACTTCTACCTTCAGAACAAACAATCGCCGACCGTGGAGTCTTATCAGAGTTCGT
TCTACTTCTCTTCTTTTCTCTATAACTCTCTTACCAAATGTTAAGAAGATGGCCGTTTTC
GTTCAGGGATCACGTGTTGTCTTCTCTACGCCCACACTGTTCAAGCCGTTCTGGCACAAA
GTTCATATAGCTATAGACAACGATACTGTACACGCGGCCATAGACTGTAATGAGTTGGAG
CCGGAATCAATAGGTGGATGGGACTTTGATAACGCGACCAGCATCAGTATCGTCTCCAAC
GATGATGGAACCCCAGCTCCTGTAGACCTCCAATGGCTGTCATTGAGTTGCAATCGCTAC
AACATTACAGAAGACAGTTGTGAAGAAATCGAAATACCGGAGTCACTTATCGCAACCGTC
ACCCCTCCAATCGTAACACCGGGAGCAAACGACTTTCCTCTTGTATGTAATCAAACATGT
CCACCAGGTCCAGTGGGTCCGCCGGGGCCACCGGGAGAGATTGGTCCTCTTGGGTACACG
GGACTGCCAGGGAAACGAGGTGTGGACGGGCCTCCAGGCCCCCTAGGTCCGACGGGACCT
AAAGGAGAAAAAGGAGACATAGGTCCCCCGGGCTCTGCAAGCAATGTCTCCGTGATTGGC
CCGCCAGGGGTACCTGGGAGAAAAGGTTCCAAAGGTGACAAAGGAGATTCGGGAGAAAAA
GGTGATAGAGGTGATGTGGGCCTGGTGGGCCTGGCTGGAGTCCCCGGTGTCGATGGGAAG
GATGGTCCACCAGGTCCCGTCGGTCCTCCAGGGGCTCCCGGTGAACCTGGCCCTGTGGGC
CCCCCAGGACCCGCCAGTAAAGGATTTCTTCCTCTTGTGCAAGGTAGTAAGGGCGAGCAA
GGAATTCCGGGGGAGCCTGGTAGAGATGGCTACCCCGGAGTGAGAGGTTTACCAGGATTA
GATGGAACACCGGGAAGCCCGGGTATCCAAGGGATGCAAGGGTTACCTGGATTGCCCGGA
GAGAGAGGCTTGATTGGTCTTCCGGGTACTCCGGGAGAAATGGGCCCTGAAGGTCCAGCG
GGACCCCAAGGTCCGCCAGGACTTCCAGGACCTGCCGGTCCTCCAGGTGTGAGTACATCA
ACAGCTGGTGTTACCGTTCCAGGCCCTCCAGGGCCACCGGGTTTAATGGGTCTGAAAGGT
GAACAAGGATTCCCAGGATTGCCAGGGCGAGATGGCTTAGACGGTATCCCTGGGCTACCC
GGGCAGAGGGGTCCTCCAGGGCCCCCCGGTTCACTTAGCTTAGTACAAGAACAACGTCCA
TCATTATCAGAAAACGACGTGAGAAACATCTGTGAGGACATAATAAAAGTGCGTCTGGCT
GACTTTTCATCGGGTCTTGTGATGCCAACTGCGAAACCGGGACGCAGAGGCCCCCCAGGA
CCGCCGGGGGCACCAGGGAGCCCCGGTTCCGTGGGCGAAACAGGACCGATGGGTCCCAGG
GGGTATCCGGGTGAAACAGGCGAACCTGGTCGACCAGGATATCCAGGACCCAGCGGCGAC
AAAGGAGACAAAGGGGATAGAGGTCCTCAAGGAGTGGGCATCCCAGGTCCAGAGGGATCA
CCCGGGATGACTGGTCCCATGGGTCCCGCTGGGATCGAAGGAAGAACTGGACCTCGGGGT
GATCCGGGCCCGTCAGGTCCTGTAGGCCCACGGGGAGTTCCAGGTCCAAGAGGAAGCTGC
GACTGTTCATCATCAGCGTATTACGCGTACGCGCCCATTTTAGGGAACAACAAAGGACCC
TAG
Protein sequence:
MDVFWVIFLLLGALFKHGSNGDAETVTASPLNACQSLRPGDIDFQSVDLIAVYRLDGADT
TGVTLVQGSQDLQRAYRVGDGANLTLPLREALPAGLPSTFTITSTFRTNNRRPWSLIRVR
STSLLFSITLLPNVKKMAVFVQGSRVVFSTPTLFKPFWHKVHIAIDNDTVHAAIDCNELE
PESIGGWDFDNATSISIVSNDDGTPAPVDLQWLSLSCNRYNITEDSCEEIEIPESLIATV
TPPIVTPGANDFPLVCNQTCPPGPVGPPGPPGEIGPLGYTGLPGKRGVDGPPGPLGPTGP
KGEKGDIGPPGSASNVSVIGPPGVPGRKGSKGDKGDSGEKGDRGDVGLVGLAGVPGVDGK
DGPPGPVGPPGAPGEPGPVGPPGPASKGFLPLVQGSKGEQGIPGEPGRDGYPGVRGLPGL
DGTPGSPGIQGMQGLPGLPGERGLIGLPGTPGEMGPEGPAGPQGPPGLPGPAGPPGVSTS
TAGVTVPGPPGPPGLMGLKGEQGFPGLPGRDGLDGIPGLPGQRGPPGPPGSLSLVQEQRP
SLSENDVRNICEDIIKVRLADFSSGLVMPTAKPGRRGPPGPPGAPGSPGSVGETGPMGPR
GYPGETGEPGRPGYPGPSGDKGDKGDRGPQGVGIPGPEGSPGMTGPMGPAGIEGRTGPRG
DPGPSGPVGPRGVPGPRGSCDCSSSAYYAYAPILGNNKGP