DPGLEAN18885 in OGS1.0

New model in OGS2.0DPOGS210324 
Genomic Positionscaffold2467:+ 42311-52037
See gene structure
CDS Length1677
Paired RNAseq reads  38
Single RNAseq reads  95
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011958 (7e-119)
Best Drosophila hit  CG7763 (2e-06)
Best Human hitversican core protein isoform 1 precursor (6e-11)
Best NR hit (blastp)  PREDICTED: similar to notch homolog 5 [Tribolium castaneum] (5e-85)
Best NR hit (blastx)  PREDICTED: similar to notch homolog 5 [Tribolium castaneum] (3e-86)
GeneOntology terms





  
GO:0005540 hyaluronic acid binding
GO:0005578 proteinaceous extracellular matrix
GO:0005488 binding
GO:0005576 extracellular region
GO:0005509 calcium ion binding
GO:0007155 cell adhesion
GO:0005529 sugar binding
InterPro families

  
IPR016187 C-type lectin fold
IPR001304 C-type lectin
IPR016186 C-type lectin-like
Orthology groupMCL18986

Nucleotide sequence:

ATGGCAAAGTCGATGGAGGAACTCAGTACTATGCTCGAAGGCCTCAATCGAGTCTCCCAA
CGGGTAGGTCTTTTTATGAACATGGACAAGACGAAACTCATGTCTAATGTCCATGTTGCA
CCTACCCCTGTTATGGTTGAGAACTCGGTACTTACAGTTGTTGACGAGTATATATACCTG
GGACAGACAGTCCAGTTAGGAAGGTCCAACTTCGCGAAAGAGATCAACCGCCGAATCCAG
CTCGGATGGGCAGCGTTCGGGAAGCTCCATAACGTCTTTTCGTCCAAAATACCTCAGTGC
CTTAAGACGAAGTGCCAGTGGACAGGTCATATATCCCGGAGAACAGATGGCCGTTGGGGC
CGAAAAGTGCTCGAATGGAGACCACGGATCGGAAAGTGCAGCGTCGGACGTCCACCAACG
AGATGGACGGACGACTTAGTCAAGGCCGCGGGTTCACGGTGGATGCAGGCCGCTTCCGAC
CGAACCGCTTATTGTGTGATACGTGTCCGAGAGGTTCGGGCAGGCCGTGTTCACTCCGGT
TACCTCCCAAGGAGTCACGGTTGCAAACTGCTCTTCCCCATACCCTCTCCAAAAAACGCC
CTTTTAGTCGAGCTCCACAAGCTAAATGTACCATGTTCCAGCGGATACCTAAGATTTGCT
ACTGGTTTTCCACCGGTATGTGGAAAGCTGGAACAGATAGCAATACCGAACAGACGACAT
TTATACCAATCCTCGAGCAAGCCGGAAATTGAAATCCATGGTCGACCCACGTTCGCCGCG
ACTTATCGTGTTGTAGATCATTGTCATGATGTTCTTCTAACGGAAAGAAACGGCTCGTTC
GAAGTCGGCCCAACATTCAAACTATTTTGCTCCTATAAAATTCACTTGCCTTATGGAAAC
CGAGTTGCCTTACGTCTCCAAATGGGAACTGGTCCGATGGTTAAAAAGAATTCAGACAAT
TTTAATATTATTCATGAAGACGGTCATAGCTTTTGCAAAGGTATGGAGTTGAACCTAGTA
GATGGTGATTCAAGATGGAAACATTGCTCACAGCCGGGGGATCCTTTGCGAAGTGTGCAA
ATAATTTCAGAAAGAAATTCAGTCAAGCTTAATATAAGTATTTTAGCAAAGAAAAATTCA
TCCGCAATGTGGTTAAAAGTATGGTGGATGGATAAACCTATCGAGGAAGTTATAGGACAA
TGTGATTTTGGTTGGGTGGTGTCCGGAGATTTTTGTGTTACCTCTGTGAGGGAAACAAAG
AGTTCGTGGCGACAAGCCGAGCTCGAGTGTGTTCGACTTGGGGGTCACCTGGCAAGCATC
CTTAACGAACGTCAGCAACAAATTATCGACCAACTACTTATTCACACACCAGGAGCCGGC
GTCGATGACGTCTATTGGATAGGTGCCACCGACTCCGTCCACGAAGGAGAATTCCGTTGG
TCGGATGGACTACCTTTTTCATATGCACACTGGTTTCCCGGTTGGCGTAAACACGCTGGC
CAACCAAACGACGACGGAACCTCAGGGCAGGACTGTGTGGAGGTACGACGAGAACTGCCC
CCCAGACCAGCTCATCCAACCTTCATGTGGAACGATAGAAGCTGCAGGGAGAGGAACTAC
TACGTTTGCGAGAGACCAGGCGTTGAAGGTGAGGAAATATTCTTGAGAAAAAATTAG

Protein sequence:

MAKSMEELSTMLEGLNRVSQRVGLFMNMDKTKLMSNVHVAPTPVMVENSVLTVVDEYIYL
GQTVQLGRSNFAKEINRRIQLGWAAFGKLHNVFSSKIPQCLKTKCQWTGHISRRTDGRWG
RKVLEWRPRIGKCSVGRPPTRWTDDLVKAAGSRWMQAASDRTAYCVIRVREVRAGRVHSG
YLPRSHGCKLLFPIPSPKNALLVELHKLNVPCSSGYLRFATGFPPVCGKLEQIAIPNRRH
LYQSSSKPEIEIHGRPTFAATYRVVDHCHDVLLTERNGSFEVGPTFKLFCSYKIHLPYGN
RVALRLQMGTGPMVKKNSDNFNIIHEDGHSFCKGMELNLVDGDSRWKHCSQPGDPLRSVQ
IISERNSVKLNISILAKKNSSAMWLKVWWMDKPIEEVIGQCDFGWVVSGDFCVTSVRETK
SSWRQAELECVRLGGHLASILNERQQQIIDQLLIHTPGAGVDDVYWIGATDSVHEGEFRW
SDGLPFSYAHWFPGWRKHAGQPNDDGTSGQDCVEVRRELPPRPAHPTFMWNDRSCRERNY
YVCERPGVEGEEIFLRKN