DPGLEAN02221 in OGS1.0

New model in OGS2.0DPOGS200046 
Genomic Positionscaffold3103:- 2-7947
See gene structure
CDS Length1512
Paired RNAseq reads  237
Single RNAseq reads  663
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013907 (6e-133)
Best Drosophila hit  SP2353 (2e-36)
Best Human hitpikachurin isoform 2 (4e-31)
Best NR hit (blastp)  hypothetical protein BRAFLDRAFT_70146 [Branchiostoma floridae] (2e-52)
Best NR hit (blastx)  PREDICTED: similar to ENSANGP00000003646 [Nasonia vitripennis] (3e-39)
GeneOntology terms










  
GO:0030054 cell junction
GO:0045202 synapse
GO:0005515 protein binding
GO:0031012 extracellular matrix
GO:0030198 extracellular matrix organization
GO:0005539 glycosaminoglycan binding
GO:0019800 peptide cross-linking via chondroitin 4-sulfate glycosaminoglycan
GO:0005604 basement membrane
GO:0010811 positive regulation of cell-substrate adhesion
GO:0005576 extracellular region
GO:0005578 proteinaceous extracellular matrix
GO:0005614 interstitial matrix
InterPro families




  
IPR008985 Concanavalin A-like lectin/glucanase
IPR001791 Laminin G domain
IPR013320 Concanavalin A-like lectin/glucanase, subgroup
IPR012680 Laminin G, subdomain 2
IPR012679 Laminin G, subdomain 1
IPR013032 EGF-like region, conserved site
Orthology groupMCL10886

Nucleotide sequence:

ATGAGAGACGCCAAGGACCACAATAAACTGGACAATTCCATACATTACAACGTTAAAATG
TTCAATGACAAATACGACTTGATTGATGATAAGAAACTCAATGACATTGATAATGAACTG
AGTGTTCAGGACTGTAAATGTGAGCATGGGGGAGGTTGTGTGGAACATGGCTGTCTCTGT
CCTCTTGGATATGCGGGAGAGAGATGCGAAATCACTTTGGACCTAAAGGTGCCACGCTTC
AACGGTTCGTCGTACTTACGATTGCCCGGCCTGGGAAACACAGCGCAATCTTGGCTGGAT
ATTCGGATAACCGTAAAACCGACTAGCGGTGATGGGCTTCTGTTATATGACGCGGAACAC
CCCAGCGGCGATGGTGACTTCTTCTCCCTTCATCTTCGTGATTTTTTCGTCGAATTCGCC
TTCGATCTTGGATCTGGGATCGCTCTTGTGAGATCCGCCTACCCGCTGTCACCAAACAAG
TGGCATAGCATATCAATAAGCCGTACGGGTCGTCACGCGTCGATCCGCGTTAGATCTTAC
GACACGAGCGACGTGACGGATACGACGCGATCGGTGACGTCACGCGGCGCCGCGAGGAGA
CTCACCCTCACCCAGCCGATGTTGCTAGGAGGCGCGCCCTATCCGTTGCCACAGAGACTC
GCCTTGAAAACCTCCTTCAGTGGCTGCGTGGGCAAGTTGGTGATCAACGAGGAGGAGTTG
TCTGTAGTTTCCGCTGCTCTCGGCGGCGTTGATGTCGACAACTGTGACGCGCCTCACAAC
ACGTGCACGGACTGCAAAGAGACGTTATACCAAACACCAGAATACCCACGCGAGCTATCA
GCCATCCACCACTCTATAGTCGCGAAAAAGGGATTCAAAATCAAAAATCACAAAACCAAA
ACTAAAAAACATCACGAGAAAAAGAAATATCCTAAGAAGTACGTACAAAATGGCGTGCAT
ATGCAGAACGATATAGACAAGGGTGTTACGGAACAGCCTTACGACGGACGGACATACATG
CAAGTGAAATACCTAGACTCCAACGAGATCAACTGGGGGGACACGAACACCTACCCGAGT
TTCACGGGAACTGATAGCTTTATACATATAGATGACGAAGAGACTATGAAAAGGTTGCTG
AGCTACACCCTGGACATCAACATCCGTTTCCGTTCCGTGTCCTCCAACGGTCTGTTAGTG
TGGAGCGGTCGGGTCACACACACACACGCAGAGAACAATATGAACACGAACACACACACA
AGCGACTTCCTTTCATTGGCTGTGGAAAACTCCGTGCTTGTATTCAGATACGATCTCGGC
AGCGGCGAGGTGGTCATTATAGCGAACCACACGAAAGTGGACGACGGTTTGTGGCACAGA
GCGAGAGCAACCAGGAACAGACAAGCAGGTGTTCTGGAAGTAGACGGCTTGGGGTCTGTT
GGGAAAATATCACCTGGAAAACTGAAACAACTGAACACCGAGAACGGACTTTATATCGGT
AGGTGTTGTTGA

Protein sequence:

MRDAKDHNKLDNSIHYNVKMFNDKYDLIDDKKLNDIDNELSVQDCKCEHGGGCVEHGCLC
PLGYAGERCEITLDLKVPRFNGSSYLRLPGLGNTAQSWLDIRITVKPTSGDGLLLYDAEH
PSGDGDFFSLHLRDFFVEFAFDLGSGIALVRSAYPLSPNKWHSISISRTGRHASIRVRSY
DTSDVTDTTRSVTSRGAARRLTLTQPMLLGGAPYPLPQRLALKTSFSGCVGKLVINEEEL
SVVSAALGGVDVDNCDAPHNTCTDCKETLYQTPEYPRELSAIHHSIVAKKGFKIKNHKTK
TKKHHEKKKYPKKYVQNGVHMQNDIDKGVTEQPYDGRTYMQVKYLDSNEINWGDTNTYPS
FTGTDSFIHIDDEETMKRLLSYTLDINIRFRSVSSNGLLVWSGRVTHTHAENNMNTNTHT
SDFLSLAVENSVLVFRYDLGSGEVVIIANHTKVDDGLWHRARATRNRQAGVLEVDGLGSV
GKISPGKLKQLNTENGLYIGRCC