DPGLEAN20342 in OGS1.0

New model in OGS2.0DPOGS204559 
Genomic Positionscaffold1545:+ 20619-33006
See gene structure
CDS Length2103
Paired RNAseq reads  228
Single RNAseq reads  555
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004329 (6e-54)
Best Drosophila hit  ND
Best Human hitcollagen alpha-1(IX) chain isoform 1 precursor (3e-19)
Best NR hit (blastp)  PREDICTED: similar to Collagen alpha-1(IX) chain precursor [Apis mellifera] (3e-60)
Best NR hit (blastx)  collagen alpha-1 precursor, putative [Pediculus humanus corporis] (1e-22)
GeneOntology terms






  
GO:0046872 metal ion binding
GO:0005578 proteinaceous extracellular matrix
GO:0005576 extracellular region
GO:0007155 cell adhesion
GO:0005198 structural molecule activity
GO:0051216 cartilage development
GO:0003417 growth plate cartilage development
GO:0001894 tissue homeostasis
InterPro families

  
IPR008160 Collagen triple helix repeat
IPR003129 Laminin G, thrombospondin-type, N-terminal
IPR008985 Concanavalin A-like lectin/glucanase
Orthology groupMCL39849

Nucleotide sequence:

ATGGATGTATTCTGGGTTATTTTCCTTCTTTTGGGAGCGTTATTTAAACACGGAAGCAAT
GGAGATGCAGAAACAGTTACAGCGTCTCCGTTAAACGCCTGCCAGTCACTTCGTCCGGGC
GACATAGACTTCCAATCAGTGGATCTGATAGCGGTGTACCGTTTGGACGGAGCTGATACA
ACAGGGGTCACCTTGGTCCAGGGCTCTCAGGACCTGCAGAGGGCGTACCGCGTTGGGGAT
GGAGCGAATCTTACTTTGCCTTTACGCGAGGCTCTACCAGCGGGTTTGCCTTCAACGTTC
ACTATAACTTCTACCTTCAGAACAAACAATCGCCGACCGTGGAGTCTTATCAGAGTTCGT
TCTACTTCTCTTCTTTTCTCTATAACTCTCTTACCAAATGTTAAGAAGATGGCCGTTTTC
GTTCAGGGATCACGTGTTGTCTTCTCTACGCCCACACTGTTCAAGCCGTTCTGGCACAAA
GTTCATATAGCTATAGACAACGATACTGTACACGCGGCCATAGACTGTAATGAGTTGGAG
CCGGAATCAATAGGTGGATGGGACTTTGATAACGCGACCAGCATCAGTATCGTCTCCAAC
GATGATGGAACCCCAGCTCCTGTAGACCTCCAATGGCTGTCATTGAGTTGCAATCGCTAC
AACATTACAGAAGACAGTTGTGAAGAAATCGAAATACCGGAGTCACTTATCGCAACCGTC
ACCCCTCCAATCGTAACACCGGGAGCAAACGACTTTCCTCTTGTATGTAATCAAACATGT
CCACCAGGTCCAGTGGGTCCGCCGGGGCCACCGGGAGAGATTGGTCCTCTTGGGTACACG
GGACTGCCAGGGAAACGAGGTGTGGACGGGCCTCCAGGCCCCCTAGGTCCGACGGGACCT
AAAGGAGAAAAAGGAGACATAGGTCCCCCGGGCTCTGCAAGCAATGTCTCCGTGATTGGC
CCGCCAGGGGTACCTGGGAGAAAAGGTTCCAAAGGTGACAAAGGAGATTCGGGAGAAAAA
GGTGATAGAGGTGATGTGGGCCTGGTGGGCCTGGCTGGAGTCCCCGGTGTCGATGGGAAG
GATGGTCCACCAGGTCCCGTCGGTCCTCCAGGGGCTCCCGGTGAACCTGGCCCTGTGGGC
CCCCCAGGACCCGCCAGTAAAGGATTTCTTCCTCTTGTGCAAGGTAGTAAGGGCGAGCAA
GGAATTCCGGGGGAGCCTGGTAGAGATGGCTACCCCGGAGTGAGAGGTTTACCAGGATTA
GATGGAACACCGGGAAGCCCGGGTATCCAAGGGATGCAAGGGTTACCTGGATTGCCCGGA
GAGAGAGGCTTGATTGGTCTTCCGGGTACTCCGGGAGAAATGGGCCCTGAAGGTCCAGCG
GGACCCCAAGGTCCGCCAGGACTTCCAGGACCTGCCGGTCCTCCAGGTGTGAGTACATCA
ACAGCTGGTGTTACCGTTCCAGGCCCTCCAGGGCCACCGGGTTTAATGGGTCTGAAAGGT
GAACAAGGATTCCCAGGATTGCCAGGGCGAGATGGCTTAGACGGTATCCCTGGGCTACCC
GGGCAGAGGGGTCCTCCAGGGCCCCCCGGTTCACTTAGCTTAGTACAAGAACAACGTCCA
TCATTATCAGAAAACGACGTGAGAAACATCTGTGAGGACATAATAAAAGTGCGTCTGGCT
GACTTTTCATCGGGTCTTGTGATGCCAACTGCGAAACCGGGACGCAGAGGCCCCCCAGGA
CCGCCGGGGGCACCAGGGAGCCCCGGTTCCGTGGGCGAAACAGGACCGATGGGTCCCAGG
GGGTATCCGGGTGAAACAGGCGAACCTGGTCGACCAGGATATCCAGGACCCAGCGGCGAC
AAAGGAGACAAAGGGGATAGAGGTCCTCAAGGAGTGGGCATCCCAGGTCCAGAGGGATCA
CCCGGGATGACTGGTCCCATGGGTCCCGCTGGGATCGAAGGAAGAACTGGACCTCGGGGT
GATCCGGGCCCGTCAGGTCCTGTAGGCCCACGGGGAGTTCCAGGTCCAAGAGGAAGCTGC
GACTGTTCATCATCAGCGTATTACGCGTACGCGCCCATTTTAGGGAACAACAAAGGACCC
TAG

Protein sequence:

MDVFWVIFLLLGALFKHGSNGDAETVTASPLNACQSLRPGDIDFQSVDLIAVYRLDGADT
TGVTLVQGSQDLQRAYRVGDGANLTLPLREALPAGLPSTFTITSTFRTNNRRPWSLIRVR
STSLLFSITLLPNVKKMAVFVQGSRVVFSTPTLFKPFWHKVHIAIDNDTVHAAIDCNELE
PESIGGWDFDNATSISIVSNDDGTPAPVDLQWLSLSCNRYNITEDSCEEIEIPESLIATV
TPPIVTPGANDFPLVCNQTCPPGPVGPPGPPGEIGPLGYTGLPGKRGVDGPPGPLGPTGP
KGEKGDIGPPGSASNVSVIGPPGVPGRKGSKGDKGDSGEKGDRGDVGLVGLAGVPGVDGK
DGPPGPVGPPGAPGEPGPVGPPGPASKGFLPLVQGSKGEQGIPGEPGRDGYPGVRGLPGL
DGTPGSPGIQGMQGLPGLPGERGLIGLPGTPGEMGPEGPAGPQGPPGLPGPAGPPGVSTS
TAGVTVPGPPGPPGLMGLKGEQGFPGLPGRDGLDGIPGLPGQRGPPGPPGSLSLVQEQRP
SLSENDVRNICEDIIKVRLADFSSGLVMPTAKPGRRGPPGPPGAPGSPGSVGETGPMGPR
GYPGETGEPGRPGYPGPSGDKGDKGDRGPQGVGIPGPEGSPGMTGPMGPAGIEGRTGPRG
DPGPSGPVGPRGVPGPRGSCDCSSSAYYAYAPILGNNKGP