New model in OGS2.0 | DPOGS210324 |
---|---|
Genomic Position | scaffold2467:+ 42311-52037 |
See gene structure | |
CDS Length | 1677 |
Paired RNAseq reads | 38 |
Single RNAseq reads | 95 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011958 (7e-119) |
Best Drosophila hit | CG7763 (2e-06) |
Best Human hit | versican core protein isoform 1 precursor (6e-11) |
Best NR hit (blastp) | PREDICTED: similar to notch homolog 5 [Tribolium castaneum] (5e-85) |
Best NR hit (blastx) | PREDICTED: similar to notch homolog 5 [Tribolium castaneum] (3e-86) |
GeneOntology terms | GO:0005540 hyaluronic acid binding GO:0005578 proteinaceous extracellular matrix GO:0005488 binding GO:0005576 extracellular region GO:0005509 calcium ion binding GO:0007155 cell adhesion GO:0005529 sugar binding |
InterPro families | IPR016187 C-type lectin fold IPR001304 C-type lectin IPR016186 C-type lectin-like |
Orthology group | MCL18986 |
Nucleotide sequence:
ATGGCAAAGTCGATGGAGGAACTCAGTACTATGCTCGAAGGCCTCAATCGAGTCTCCCAA
CGGGTAGGTCTTTTTATGAACATGGACAAGACGAAACTCATGTCTAATGTCCATGTTGCA
CCTACCCCTGTTATGGTTGAGAACTCGGTACTTACAGTTGTTGACGAGTATATATACCTG
GGACAGACAGTCCAGTTAGGAAGGTCCAACTTCGCGAAAGAGATCAACCGCCGAATCCAG
CTCGGATGGGCAGCGTTCGGGAAGCTCCATAACGTCTTTTCGTCCAAAATACCTCAGTGC
CTTAAGACGAAGTGCCAGTGGACAGGTCATATATCCCGGAGAACAGATGGCCGTTGGGGC
CGAAAAGTGCTCGAATGGAGACCACGGATCGGAAAGTGCAGCGTCGGACGTCCACCAACG
AGATGGACGGACGACTTAGTCAAGGCCGCGGGTTCACGGTGGATGCAGGCCGCTTCCGAC
CGAACCGCTTATTGTGTGATACGTGTCCGAGAGGTTCGGGCAGGCCGTGTTCACTCCGGT
TACCTCCCAAGGAGTCACGGTTGCAAACTGCTCTTCCCCATACCCTCTCCAAAAAACGCC
CTTTTAGTCGAGCTCCACAAGCTAAATGTACCATGTTCCAGCGGATACCTAAGATTTGCT
ACTGGTTTTCCACCGGTATGTGGAAAGCTGGAACAGATAGCAATACCGAACAGACGACAT
TTATACCAATCCTCGAGCAAGCCGGAAATTGAAATCCATGGTCGACCCACGTTCGCCGCG
ACTTATCGTGTTGTAGATCATTGTCATGATGTTCTTCTAACGGAAAGAAACGGCTCGTTC
GAAGTCGGCCCAACATTCAAACTATTTTGCTCCTATAAAATTCACTTGCCTTATGGAAAC
CGAGTTGCCTTACGTCTCCAAATGGGAACTGGTCCGATGGTTAAAAAGAATTCAGACAAT
TTTAATATTATTCATGAAGACGGTCATAGCTTTTGCAAAGGTATGGAGTTGAACCTAGTA
GATGGTGATTCAAGATGGAAACATTGCTCACAGCCGGGGGATCCTTTGCGAAGTGTGCAA
ATAATTTCAGAAAGAAATTCAGTCAAGCTTAATATAAGTATTTTAGCAAAGAAAAATTCA
TCCGCAATGTGGTTAAAAGTATGGTGGATGGATAAACCTATCGAGGAAGTTATAGGACAA
TGTGATTTTGGTTGGGTGGTGTCCGGAGATTTTTGTGTTACCTCTGTGAGGGAAACAAAG
AGTTCGTGGCGACAAGCCGAGCTCGAGTGTGTTCGACTTGGGGGTCACCTGGCAAGCATC
CTTAACGAACGTCAGCAACAAATTATCGACCAACTACTTATTCACACACCAGGAGCCGGC
GTCGATGACGTCTATTGGATAGGTGCCACCGACTCCGTCCACGAAGGAGAATTCCGTTGG
TCGGATGGACTACCTTTTTCATATGCACACTGGTTTCCCGGTTGGCGTAAACACGCTGGC
CAACCAAACGACGACGGAACCTCAGGGCAGGACTGTGTGGAGGTACGACGAGAACTGCCC
CCCAGACCAGCTCATCCAACCTTCATGTGGAACGATAGAAGCTGCAGGGAGAGGAACTAC
TACGTTTGCGAGAGACCAGGCGTTGAAGGTGAGGAAATATTCTTGAGAAAAAATTAG
Protein sequence:
MAKSMEELSTMLEGLNRVSQRVGLFMNMDKTKLMSNVHVAPTPVMVENSVLTVVDEYIYL
GQTVQLGRSNFAKEINRRIQLGWAAFGKLHNVFSSKIPQCLKTKCQWTGHISRRTDGRWG
RKVLEWRPRIGKCSVGRPPTRWTDDLVKAAGSRWMQAASDRTAYCVIRVREVRAGRVHSG
YLPRSHGCKLLFPIPSPKNALLVELHKLNVPCSSGYLRFATGFPPVCGKLEQIAIPNRRH
LYQSSSKPEIEIHGRPTFAATYRVVDHCHDVLLTERNGSFEVGPTFKLFCSYKIHLPYGN
RVALRLQMGTGPMVKKNSDNFNIIHEDGHSFCKGMELNLVDGDSRWKHCSQPGDPLRSVQ
IISERNSVKLNISILAKKNSSAMWLKVWWMDKPIEEVIGQCDFGWVVSGDFCVTSVRETK
SSWRQAELECVRLGGHLASILNERQQQIIDQLLIHTPGAGVDDVYWIGATDSVHEGEFRW
SDGLPFSYAHWFPGWRKHAGQPNDDGTSGQDCVEVRRELPPRPAHPTFMWNDRSCRERNY
YVCERPGVEGEEIFLRKN