New model in OGS2.0 | DPOGS213101  |
---|---|
Genomic Position | scaffold1526:- 38960-44446 |
See gene structure | |
CDS Length | 2757 |
Paired RNAseq reads   | 269 |
Single RNAseq reads   | 666 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007861 (1e-07) |
Best Drosophila hit   | ND |
Best Human hit | collagen alpha-1(XI) chain isoform C preproprotein (8e-31) |
Best NR hit (blastp)   | PREDICTED: similar to Collagen alpha-1(XI) chain precursor isoform 1 [Apis mellifera] (9e-140) |
Best NR hit (blastx)   | collagen alpha chain [Culex quinquefasciatus] (1e-35) |
GeneOntology terms    | GO:0005201 extracellular matrix structural constituent GO:0005581 collagen GO:0005198 structural molecule activity GO:0007155 cell adhesion |
InterPro families    | IPR008985 Concanavalin A-like lectin/glucanase IPR003129 Laminin G, thrombospondin-type, N-terminal IPR013320 Concanavalin A-like lectin/glucanase, subgroup IPR008160 Collagen triple helix repeat IPR012680 Laminin G, subdomain 2 |
Orthology group | MCL10248 |
Nucleotide sequence:
ATGAAAGGGTTGCTCCTTATATTGTTTTGCGCTCTCGCCCGGGGAAAGGGGGAAGACAAG
CTCGAGATATTTGACGTTCTCTCCGTGGTCAACTATGATGAACTTCCTGAAGGGGTGTCA
CGGACTCCAGGGCGATGTCAAAATAATCCTTCAGAGCAAGATTACTCAGCTCTTTCCCTT
AACGAGAACGCAACATTGCATCAGCTCGCTGCAGGAATGTTTTATAATACATTTCCTGAA
GACTTTTCCATATTGTCAGTCGTTAGACTAAATGGCCCAGAACAAAATCCTCTCTTCGTA
TTATACTCGGACGCCGGAGACGAACAACTACAAGTGGTCGTTGGGGAGATGGTAGAATTG
TATTACGAAGATACACGAGGAGATCCCGAGGATCATGAGCTACTCAGCTTCCGAGTCAAC
ATTACAGATGAAAAATGGCATAGAGTGGCACTAAGTATAAAGGGTGACTCAGTAACATTG
TTAGTAGACTGTGAAATTCAAGATACACTGCCTTTACGACGTCATCCGGGAAGTACATTC
AACTTGGCAGGAGTTTTAGTAGTCGGAGCTCAAGTGACGCCAGATCAATATTACGAGGGC
GATATTGAATTGCTCCAATTTGCTAATCAACCGGACCTTGCCTACGATATGTGTATATCA
ATCGCCCCAGACTGTGGATCCTTTGGGTTGCAAACATCAGATTTAGCGTTGGATAATACT
GATTACGATGAAAAAGACTACGACGATCGTATTGAAGTAACGGAATCGTTAGTTCACGTT
GAAAGTGAGGGGAAACCTTCGGATCAATTTGATCAGAGTTTACTTATAGAAAGACAGAAT
CAATCGGAGAATCAGTGGGCAGTTTCAAGCAGAGATATGCCTACATTGAGGCCGTATTGG
CAAGTCTCGGAGTCGTATGCTGTGACCCCACCCTATCCTCTAGTAACAGATAATGATGAA
AATAGCATCTTCGGGAGATTTACGGATTCAGACGATTTTATTTCATCCGGTAACATTCAA
GAAACGCCACCCTCACTCCTAGAAGTATCTAGTACAACGGTTAAAAATGATGTTGATGAA
GACGTTGCTCTGACCACAACTGAAGTATCGTCTGCGGGGATAACGACTGAAGATACGGAT
TATCTCACCCACCCACCAGATATTAGGGGTAACTCTTCTACTGGGTATGATAATTCAGAC
AGTTACTACGACTATGGCACAATTGGCACTTATTTGGGACCACGTGGGTACCCTGGACCT
CCTGGTAGGCAAGGTCCACGAGGACCCAAAGGGGAACCTGGGAAGCCAGGTGCCGAGGGT
CAACAAGGGTTTCAGGGAGCTCCAGGGCACGTTTTCGTAGTCCCGTTGCCACAGTCGGGA
AATGATAAGGGACCTGACGCACATTCGGAAGCACTACGCCAAATGTTAACACAACATATG
GCTTCAATGCGGGGTGCGGAAGGCCCCATGGGTCTCACAGGACCGCCGGGTCCTGAAGGC
CCTACTGGAGTTGAAGGTTCGAAGGGCGAACAAGGAGATCAAGGTGAACCAGGACCACCA
GGATCAAGGGGCCTCCAAGGACAACCTGGAAGATTAGGACGTCGAGGCCACCCAGGCAGA
GATGGAGAACGAGGTCCACCCGGACCTCAAGGTCTAAAAGGAGATCAAGGATACCCTGGC
CAAGCCGGAATACCTGGAGATAAAGGCGAAAGGGGTACACCAGGACAACAGGGAGAAACA
GGTGCTCCAGGTTTAGATGGACCTCCTGGTGAAGATGGACCTCCAGGACCCCCAGGAATT
TCGGGTGAATTGGGGCCTAGAGGCTTTACCGGCCCAAGAGGATTTCCGGGTCTTATTGGA
TATCCCGGTATACCAGGAAACGAAGGCCAACAAGGCATAAAAGGTGCCGCAGGACAGCCA
GGTCCACCTGGATCCCCGGGTCAACCAGGAGTAATGGGACCACCCGGATCCCCAGGACCT
CAGGGTCCCATTGGGGCCCCAGGATTACAGGGGTCCCAAGGGAAGCAGGGAATATCTGGT
TTGCCAGGACCCGAAGGCTCACCAGGTACACCGGGTACACCTGGACAACAAGGTCCTGCA
GGAGATGTGGGATTACCTGGTCCACAGGGTATGTTAGGATTTCCGGGGCCACGAGGTCTA
AAAGGAGATGACGGACCGCGTGGTCCACCCGGTGATAAAGGAGACAAGGGAATAAGAGGA
ATTGAAGGAGAGAAAGGTGAAATGGGGCAAAAAGGGGAGCGCGGGGTGGCTGGAGAGCCT
GGCCCTGCTGGTATCGAAGGACCAGAAGGACAAAAAGGTTCAGAAGGTCCTAGAGGTGAA
ACTGGTTCAATCGGTCCTGTTGGTGAAAAAGGTGCAACAGGACCTCAAGGACCATCAGGT
TACCCTGGAGCTCAAGGCGAAAAGGGAGATAAAGGGGCTTCCGGCAGACGGGGAAGACGA
GGGAGCAAAGGAGTTGCGGGCTTAGTAGGAATTCCCGGAGACCGAGGCGAAAGCGGACCA
AGGGGCTATCGTGGCCCAAGAGGTCGTAGAGGATCAGATGGGCCGCCAGGACCTAAAGGC
GATACAGGACAACCCGGGCCTCCGGGGTCAAGTGGTGAACGTGGTCCACAGGGTTTGGAA
GGGCCTAGGGGATATCCTGGGTCCATCGGTCCACCGGGTATTGATGGAAAACCAGGAATT
CCAGGGACTCCCGGAGAAAGAGGATCGACGGTACTTGCAATTTATAATTATTTTTAG
Protein sequence:
MKGLLLILFCALARGKGEDKLEIFDVLSVVNYDELPEGVSRTPGRCQNNPSEQDYSALSL
NENATLHQLAAGMFYNTFPEDFSILSVVRLNGPEQNPLFVLYSDAGDEQLQVVVGEMVEL
YYEDTRGDPEDHELLSFRVNITDEKWHRVALSIKGDSVTLLVDCEIQDTLPLRRHPGSTF
NLAGVLVVGAQVTPDQYYEGDIELLQFANQPDLAYDMCISIAPDCGSFGLQTSDLALDNT
DYDEKDYDDRIEVTESLVHVESEGKPSDQFDQSLLIERQNQSENQWAVSSRDMPTLRPYW
QVSESYAVTPPYPLVTDNDENSIFGRFTDSDDFISSGNIQETPPSLLEVSSTTVKNDVDE
DVALTTTEVSSAGITTEDTDYLTHPPDIRGNSSTGYDNSDSYYDYGTIGTYLGPRGYPGP
PGRQGPRGPKGEPGKPGAEGQQGFQGAPGHVFVVPLPQSGNDKGPDAHSEALRQMLTQHM
ASMRGAEGPMGLTGPPGPEGPTGVEGSKGEQGDQGEPGPPGSRGLQGQPGRLGRRGHPGR
DGERGPPGPQGLKGDQGYPGQAGIPGDKGERGTPGQQGETGAPGLDGPPGEDGPPGPPGI
SGELGPRGFTGPRGFPGLIGYPGIPGNEGQQGIKGAAGQPGPPGSPGQPGVMGPPGSPGP
QGPIGAPGLQGSQGKQGISGLPGPEGSPGTPGTPGQQGPAGDVGLPGPQGMLGFPGPRGL
KGDDGPRGPPGDKGDKGIRGIEGEKGEMGQKGERGVAGEPGPAGIEGPEGQKGSEGPRGE
TGSIGPVGEKGATGPQGPSGYPGAQGEKGDKGASGRRGRRGSKGVAGLVGIPGDRGESGP
RGYRGPRGRRGSDGPPGPKGDTGQPGPPGSSGERGPQGLEGPRGYPGSIGPPGIDGKPGI
PGTPGERGSTVLAIYNYF