New model in OGS2.0 | DPOGS207247  |
---|---|
Genomic Position | scaffold584:+ 22089-41709 |
See gene structure | |
CDS Length | 2718 |
Paired RNAseq reads   | 29 |
Single RNAseq reads   | 76 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012062 (3e-147) |
Best Drosophila hit   | multiplexin, isoform E (9e-71) |
Best Human hit | collagen alpha-1(XV) chain precursor (6e-45) |
Best NR hit (blastp)   | PREDICTED: similar to CG33171-PC, isoform C [Apis mellifera] (2e-116) |
Best NR hit (blastx)   | PREDICTED: similar to collagen alpha 1(xviii) chain [Tribolium castaneum] (4e-109) |
GeneOntology terms    | GO:0005581 collagen GO:0007155 cell adhesion GO:0005198 structural molecule activity GO:0031012 extracellular matrix GO:0005488 binding GO:0008045 motor axon guidance |
InterPro families    | IPR016186 C-type lectin-like IPR010515 Collagenase NC10/endostatin IPR008160 Collagen triple helix repeat IPR016187 C-type lectin fold |
Orthology group | MCL12955 |
Nucleotide sequence:
ATGGCGATGGTGATATCACAGACGACTAAAGCAGCTATTGCTGGATGTTTAGCTTTAATG
ATATTAGGAGCTGTTCTGGTCGTTGGTGTGGCGTTCGGTTGGTTTGATCCCAAACGAGAA
GATGAAGACACACCTGCAAAGATCAGTGCAAGACTTCAAGGCCGGACATCTTTCGTCACT
CACAGAGCTGTGAATCCAGACGAAAGCAGATTACCAGTTATTTTACATACATCCTCATAT
TCTCAGAAAGGAAATAAATTCGAAAATGGGTCGAGAGGAATAGGCCCGTCCGGGATGGAC
CGTAAAGACAAACGTGTAACGCTGTCGAGTCAATGCCAATGTACGGTAAATGATATATTT
AGAATGATGGAGGTTGTACCTGGTTTGGTTGGACCACCTGGACCTCCAGGGCCGACTGGT
GCCGATGGCATGACAGGTGCCCCAGGAAAAACTGGACAAATAGGAGAAACAGGAGCTCCT
GGACCTCAAGGGACAAAGGGTGATCGTGGAGAGAGGGGCGATACAGGTCCCACCGGAATA
GAGGGCCAACCAGGACCTAAAGGAGAGCCTGGTGCTGATGGTTCACCAGGTCTTCAGGGA
CCGCCGGGACCTCCAGGTCCACCTGGATTGCCTTCATCGCCAATGATGGAATCAACAGGA
TTATACGTAGCTGGGGATCGTGGAGTTATGGGACCTCCCGGCGAACGAGGCCCAATGGGT
CTACCCGGACCCCAAGGAGAAAGGGGTTACGCGGGGAATAAAGGCGAAAAAGGATTACAT
GGGGCTAAAGGCGATAAAGGTGAACGGGGGTACGTGGGATTGCGGGGGCCACATGGTGCG
AAAGGGGAACGAGGGGTGCCAGGAAAAGATGGCACACCTGGTTTGCCCGGCGCTCACGGA
CGCCCAGCGGAAAAGGGTGAAAAGGGTGCTCGCGGACTTCCTGGTTTACCCGGACCTTCA
GTAGTCGGCATTTCTGAAAATTCTGTTTTAAGTGAAATCGCACTTCCAGGATCTCGTGAC
GTCATGAAGTTGAAAGGCGAAAGAGGAGAAAGGGGCGAAAAGGGTGAAAAAGGTAGCAGA
GGTATGGAAGGCCCACAAGGTTTTCCAGGCACTGATGGAAGGCCTGGTGAAAGAGGGGAT
ATCGGCCCATCAGGTGTCCCTGGACCTCAAGGAGTACTAGGTCCACCCGGACCTGTAGCA
ATTTCAAGAGAAGAAGCTTTGATCATGACCAAGGGTGAAAAGGGTGAGACCGGTCCCAGG
GGAAAACGGGGTCACCCCGGCGCTCCAGGCCCGAGAGGACCGCCAGGGCTTCCTGGACCC
CCCGGAGTCCCTGGAATTAATGGACCTTCTGGTGATATTGGCCTGCCAGGATGGACGGGT
CCACCGGGTGTAGCGGGTCAACCAGGCCCGCCGGGACAAAAAGGTGAAAAAGGAGACTCC
GGTATATCACCAGCCGACCTCGAAAAGGTAAAAGGTGAAAAAGGTGAACGTGGCTACGAT
GGAACCTCTGGACCACCGGGTAAAGATGGTCCTAGAGGTCCGCCTGGACCTCCAGGAACT
CCAAGCACAAGTTTGCAATATATTCCGGTTCCTGGCCCTCCTGGCCCCCCAGGGCCACCC
GGGCCTCCTGCGGTTTTCACGAATAACGTTCCAATCGACGCTTTGACAGATAGCCCTGGG
ATTAATCGCCTTCAACCTGGCACAGGAAAACCACGAGATCCGCTACAAATTCTAAGAAAC
TTGAATAATTTGATGCAGTACCGCCAAGAACAATTTGAGCCTGGAATTCGAGATTCACTA
GATAGTGACGGAGAAAATACAGATTTCGATGATGAAGAAGATGGCAGGACTCTGGTCGGC
ACTATACTATTTAAATCAACTGAATCATTATTACGGTTGGGAACAAACACTCCTCGAGGA
ACATTAGCATACGTATTGCAAGAGCAAGCGCTCCTTGTAAGAGTCAACAAAGGCTGGCAA
TATGTTGCAATGGGTTCGCTTCTAAAGATACCGAGTCCGCCGGGTAGCGGCGTTACGCTC
ACTCCAGTTCAAAATATATTAGAAACTTCTAGTTTGGTACATCATAAAAACTCGGCAACA
GGCGGACCTGCGCTTCGTCTGGCAACACTTAATGAGCCTCACACAGGCGATATGCATGGA
GTCAGCAGCACAAACTACGAATGTCACAGACAAGCTGAAAGATCCGGATTAGATGGAACT
TTCAGAGCCTTTATTACTTCAAGGGTACAAAACATAGAGTCCATAGTGAATTGGGTGGAC
CGTGAAATACCAGTAGTGAATATCCGAGGGGACATTCTCTTCAATTCGTGGGGTGAAATG
TTGGATGGGTCTGGTGCTGTATTTGCACACGCTCCTAAATTATACAGCTTCAATGGAAAA
AACGTAATGATGGATCCCAGTTGGCCAACAAAAGCTGTTTGGCATGGGGCCACACCAAAT
GGGGAACCGGCAATGGATGCGTATTGTGACGCATGGCACAGCAGTAGCCCGACAAAATTC
GGATTGGCCTCTTCATTACGCTCTAACAAGCTTTTAGATCAAGAAACGTACCCGTGCAGC
ACGCGACTAATCGTGCTCTGCATTGAAACTACTCCGCTTAACACAGTGAGAAGAAAAAAA
CGTTCCAAATATCGGGTATCCGACAAAACACATTTCCTCAAAGACATCGAAAAACGAAAC
GAAACTCTAAACTTATAG
Protein sequence:
MAMVISQTTKAAIAGCLALMILGAVLVVGVAFGWFDPKREDEDTPAKISARLQGRTSFVT
HRAVNPDESRLPVILHTSSYSQKGNKFENGSRGIGPSGMDRKDKRVTLSSQCQCTVNDIF
RMMEVVPGLVGPPGPPGPTGADGMTGAPGKTGQIGETGAPGPQGTKGDRGERGDTGPTGI
EGQPGPKGEPGADGSPGLQGPPGPPGPPGLPSSPMMESTGLYVAGDRGVMGPPGERGPMG
LPGPQGERGYAGNKGEKGLHGAKGDKGERGYVGLRGPHGAKGERGVPGKDGTPGLPGAHG
RPAEKGEKGARGLPGLPGPSVVGISENSVLSEIALPGSRDVMKLKGERGERGEKGEKGSR
GMEGPQGFPGTDGRPGERGDIGPSGVPGPQGVLGPPGPVAISREEALIMTKGEKGETGPR
GKRGHPGAPGPRGPPGLPGPPGVPGINGPSGDIGLPGWTGPPGVAGQPGPPGQKGEKGDS
GISPADLEKVKGEKGERGYDGTSGPPGKDGPRGPPGPPGTPSTSLQYIPVPGPPGPPGPP
GPPAVFTNNVPIDALTDSPGINRLQPGTGKPRDPLQILRNLNNLMQYRQEQFEPGIRDSL
DSDGENTDFDDEEDGRTLVGTILFKSTESLLRLGTNTPRGTLAYVLQEQALLVRVNKGWQ
YVAMGSLLKIPSPPGSGVTLTPVQNILETSSLVHHKNSATGGPALRLATLNEPHTGDMHG
VSSTNYECHRQAERSGLDGTFRAFITSRVQNIESIVNWVDREIPVVNIRGDILFNSWGEM
LDGSGAVFAHAPKLYSFNGKNVMMDPSWPTKAVWHGATPNGEPAMDAYCDAWHSSSPTKF
GLASSLRSNKLLDQETYPCSTRLIVLCIETTPLNTVRRKKRSKYRVSDKTHFLKDIEKRN
ETLNL