DPGLEAN04970 in OGS1.0

New model in OGS2.0DPOGS213101 
Genomic Positionscaffold1526:- 38960-44446
See gene structure
CDS Length2757
Paired RNAseq reads  269
Single RNAseq reads  666
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007861 (1e-07)
Best Drosophila hit  ND
Best Human hitcollagen alpha-1(XI) chain isoform C preproprotein (8e-31)
Best NR hit (blastp)  PREDICTED: similar to Collagen alpha-1(XI) chain precursor isoform 1 [Apis mellifera] (9e-140)
Best NR hit (blastx)  collagen alpha chain [Culex quinquefasciatus] (1e-35)
GeneOntology terms


  
GO:0005201 extracellular matrix structural constituent
GO:0005581 collagen
GO:0005198 structural molecule activity
GO:0007155 cell adhesion
InterPro families



  
IPR008985 Concanavalin A-like lectin/glucanase
IPR003129 Laminin G, thrombospondin-type, N-terminal
IPR013320 Concanavalin A-like lectin/glucanase, subgroup
IPR008160 Collagen triple helix repeat
IPR012680 Laminin G, subdomain 2
Orthology groupMCL10248

Nucleotide sequence:

ATGAAAGGGTTGCTCCTTATATTGTTTTGCGCTCTCGCCCGGGGAAAGGGGGAAGACAAG
CTCGAGATATTTGACGTTCTCTCCGTGGTCAACTATGATGAACTTCCTGAAGGGGTGTCA
CGGACTCCAGGGCGATGTCAAAATAATCCTTCAGAGCAAGATTACTCAGCTCTTTCCCTT
AACGAGAACGCAACATTGCATCAGCTCGCTGCAGGAATGTTTTATAATACATTTCCTGAA
GACTTTTCCATATTGTCAGTCGTTAGACTAAATGGCCCAGAACAAAATCCTCTCTTCGTA
TTATACTCGGACGCCGGAGACGAACAACTACAAGTGGTCGTTGGGGAGATGGTAGAATTG
TATTACGAAGATACACGAGGAGATCCCGAGGATCATGAGCTACTCAGCTTCCGAGTCAAC
ATTACAGATGAAAAATGGCATAGAGTGGCACTAAGTATAAAGGGTGACTCAGTAACATTG
TTAGTAGACTGTGAAATTCAAGATACACTGCCTTTACGACGTCATCCGGGAAGTACATTC
AACTTGGCAGGAGTTTTAGTAGTCGGAGCTCAAGTGACGCCAGATCAATATTACGAGGGC
GATATTGAATTGCTCCAATTTGCTAATCAACCGGACCTTGCCTACGATATGTGTATATCA
ATCGCCCCAGACTGTGGATCCTTTGGGTTGCAAACATCAGATTTAGCGTTGGATAATACT
GATTACGATGAAAAAGACTACGACGATCGTATTGAAGTAACGGAATCGTTAGTTCACGTT
GAAAGTGAGGGGAAACCTTCGGATCAATTTGATCAGAGTTTACTTATAGAAAGACAGAAT
CAATCGGAGAATCAGTGGGCAGTTTCAAGCAGAGATATGCCTACATTGAGGCCGTATTGG
CAAGTCTCGGAGTCGTATGCTGTGACCCCACCCTATCCTCTAGTAACAGATAATGATGAA
AATAGCATCTTCGGGAGATTTACGGATTCAGACGATTTTATTTCATCCGGTAACATTCAA
GAAACGCCACCCTCACTCCTAGAAGTATCTAGTACAACGGTTAAAAATGATGTTGATGAA
GACGTTGCTCTGACCACAACTGAAGTATCGTCTGCGGGGATAACGACTGAAGATACGGAT
TATCTCACCCACCCACCAGATATTAGGGGTAACTCTTCTACTGGGTATGATAATTCAGAC
AGTTACTACGACTATGGCACAATTGGCACTTATTTGGGACCACGTGGGTACCCTGGACCT
CCTGGTAGGCAAGGTCCACGAGGACCCAAAGGGGAACCTGGGAAGCCAGGTGCCGAGGGT
CAACAAGGGTTTCAGGGAGCTCCAGGGCACGTTTTCGTAGTCCCGTTGCCACAGTCGGGA
AATGATAAGGGACCTGACGCACATTCGGAAGCACTACGCCAAATGTTAACACAACATATG
GCTTCAATGCGGGGTGCGGAAGGCCCCATGGGTCTCACAGGACCGCCGGGTCCTGAAGGC
CCTACTGGAGTTGAAGGTTCGAAGGGCGAACAAGGAGATCAAGGTGAACCAGGACCACCA
GGATCAAGGGGCCTCCAAGGACAACCTGGAAGATTAGGACGTCGAGGCCACCCAGGCAGA
GATGGAGAACGAGGTCCACCCGGACCTCAAGGTCTAAAAGGAGATCAAGGATACCCTGGC
CAAGCCGGAATACCTGGAGATAAAGGCGAAAGGGGTACACCAGGACAACAGGGAGAAACA
GGTGCTCCAGGTTTAGATGGACCTCCTGGTGAAGATGGACCTCCAGGACCCCCAGGAATT
TCGGGTGAATTGGGGCCTAGAGGCTTTACCGGCCCAAGAGGATTTCCGGGTCTTATTGGA
TATCCCGGTATACCAGGAAACGAAGGCCAACAAGGCATAAAAGGTGCCGCAGGACAGCCA
GGTCCACCTGGATCCCCGGGTCAACCAGGAGTAATGGGACCACCCGGATCCCCAGGACCT
CAGGGTCCCATTGGGGCCCCAGGATTACAGGGGTCCCAAGGGAAGCAGGGAATATCTGGT
TTGCCAGGACCCGAAGGCTCACCAGGTACACCGGGTACACCTGGACAACAAGGTCCTGCA
GGAGATGTGGGATTACCTGGTCCACAGGGTATGTTAGGATTTCCGGGGCCACGAGGTCTA
AAAGGAGATGACGGACCGCGTGGTCCACCCGGTGATAAAGGAGACAAGGGAATAAGAGGA
ATTGAAGGAGAGAAAGGTGAAATGGGGCAAAAAGGGGAGCGCGGGGTGGCTGGAGAGCCT
GGCCCTGCTGGTATCGAAGGACCAGAAGGACAAAAAGGTTCAGAAGGTCCTAGAGGTGAA
ACTGGTTCAATCGGTCCTGTTGGTGAAAAAGGTGCAACAGGACCTCAAGGACCATCAGGT
TACCCTGGAGCTCAAGGCGAAAAGGGAGATAAAGGGGCTTCCGGCAGACGGGGAAGACGA
GGGAGCAAAGGAGTTGCGGGCTTAGTAGGAATTCCCGGAGACCGAGGCGAAAGCGGACCA
AGGGGCTATCGTGGCCCAAGAGGTCGTAGAGGATCAGATGGGCCGCCAGGACCTAAAGGC
GATACAGGACAACCCGGGCCTCCGGGGTCAAGTGGTGAACGTGGTCCACAGGGTTTGGAA
GGGCCTAGGGGATATCCTGGGTCCATCGGTCCACCGGGTATTGATGGAAAACCAGGAATT
CCAGGGACTCCCGGAGAAAGAGGATCGACGGTACTTGCAATTTATAATTATTTTTAG

Protein sequence:

MKGLLLILFCALARGKGEDKLEIFDVLSVVNYDELPEGVSRTPGRCQNNPSEQDYSALSL
NENATLHQLAAGMFYNTFPEDFSILSVVRLNGPEQNPLFVLYSDAGDEQLQVVVGEMVEL
YYEDTRGDPEDHELLSFRVNITDEKWHRVALSIKGDSVTLLVDCEIQDTLPLRRHPGSTF
NLAGVLVVGAQVTPDQYYEGDIELLQFANQPDLAYDMCISIAPDCGSFGLQTSDLALDNT
DYDEKDYDDRIEVTESLVHVESEGKPSDQFDQSLLIERQNQSENQWAVSSRDMPTLRPYW
QVSESYAVTPPYPLVTDNDENSIFGRFTDSDDFISSGNIQETPPSLLEVSSTTVKNDVDE
DVALTTTEVSSAGITTEDTDYLTHPPDIRGNSSTGYDNSDSYYDYGTIGTYLGPRGYPGP
PGRQGPRGPKGEPGKPGAEGQQGFQGAPGHVFVVPLPQSGNDKGPDAHSEALRQMLTQHM
ASMRGAEGPMGLTGPPGPEGPTGVEGSKGEQGDQGEPGPPGSRGLQGQPGRLGRRGHPGR
DGERGPPGPQGLKGDQGYPGQAGIPGDKGERGTPGQQGETGAPGLDGPPGEDGPPGPPGI
SGELGPRGFTGPRGFPGLIGYPGIPGNEGQQGIKGAAGQPGPPGSPGQPGVMGPPGSPGP
QGPIGAPGLQGSQGKQGISGLPGPEGSPGTPGTPGQQGPAGDVGLPGPQGMLGFPGPRGL
KGDDGPRGPPGDKGDKGIRGIEGEKGEMGQKGERGVAGEPGPAGIEGPEGQKGSEGPRGE
TGSIGPVGEKGATGPQGPSGYPGAQGEKGDKGASGRRGRRGSKGVAGLVGIPGDRGESGP
RGYRGPRGRRGSDGPPGPKGDTGQPGPPGSSGERGPQGLEGPRGYPGSIGPPGIDGKPGI
PGTPGERGSTVLAIYNYF