New model in OGS2.0 | DPOGS200046  |
---|---|
Genomic Position | scaffold3103:- 2-7947 |
See gene structure | |
CDS Length | 1512 |
Paired RNAseq reads   | 237 |
Single RNAseq reads   | 663 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013907 (6e-133) |
Best Drosophila hit   | SP2353 (2e-36) |
Best Human hit | pikachurin isoform 2 (4e-31) |
Best NR hit (blastp)   | hypothetical protein BRAFLDRAFT_70146 [Branchiostoma floridae] (2e-52) |
Best NR hit (blastx)   | PREDICTED: similar to ENSANGP00000003646 [Nasonia vitripennis] (3e-39) |
GeneOntology terms    | GO:0030054 cell junction GO:0045202 synapse GO:0005515 protein binding GO:0031012 extracellular matrix GO:0030198 extracellular matrix organization GO:0005539 glycosaminoglycan binding GO:0019800 peptide cross-linking via chondroitin 4-sulfate glycosaminoglycan GO:0005604 basement membrane GO:0010811 positive regulation of cell-substrate adhesion GO:0005576 extracellular region GO:0005578 proteinaceous extracellular matrix GO:0005614 interstitial matrix |
InterPro families    | IPR008985 Concanavalin A-like lectin/glucanase IPR001791 Laminin G domain IPR013320 Concanavalin A-like lectin/glucanase, subgroup IPR012680 Laminin G, subdomain 2 IPR012679 Laminin G, subdomain 1 IPR013032 EGF-like region, conserved site |
Orthology group | MCL10886 |
Nucleotide sequence:
ATGAGAGACGCCAAGGACCACAATAAACTGGACAATTCCATACATTACAACGTTAAAATG
TTCAATGACAAATACGACTTGATTGATGATAAGAAACTCAATGACATTGATAATGAACTG
AGTGTTCAGGACTGTAAATGTGAGCATGGGGGAGGTTGTGTGGAACATGGCTGTCTCTGT
CCTCTTGGATATGCGGGAGAGAGATGCGAAATCACTTTGGACCTAAAGGTGCCACGCTTC
AACGGTTCGTCGTACTTACGATTGCCCGGCCTGGGAAACACAGCGCAATCTTGGCTGGAT
ATTCGGATAACCGTAAAACCGACTAGCGGTGATGGGCTTCTGTTATATGACGCGGAACAC
CCCAGCGGCGATGGTGACTTCTTCTCCCTTCATCTTCGTGATTTTTTCGTCGAATTCGCC
TTCGATCTTGGATCTGGGATCGCTCTTGTGAGATCCGCCTACCCGCTGTCACCAAACAAG
TGGCATAGCATATCAATAAGCCGTACGGGTCGTCACGCGTCGATCCGCGTTAGATCTTAC
GACACGAGCGACGTGACGGATACGACGCGATCGGTGACGTCACGCGGCGCCGCGAGGAGA
CTCACCCTCACCCAGCCGATGTTGCTAGGAGGCGCGCCCTATCCGTTGCCACAGAGACTC
GCCTTGAAAACCTCCTTCAGTGGCTGCGTGGGCAAGTTGGTGATCAACGAGGAGGAGTTG
TCTGTAGTTTCCGCTGCTCTCGGCGGCGTTGATGTCGACAACTGTGACGCGCCTCACAAC
ACGTGCACGGACTGCAAAGAGACGTTATACCAAACACCAGAATACCCACGCGAGCTATCA
GCCATCCACCACTCTATAGTCGCGAAAAAGGGATTCAAAATCAAAAATCACAAAACCAAA
ACTAAAAAACATCACGAGAAAAAGAAATATCCTAAGAAGTACGTACAAAATGGCGTGCAT
ATGCAGAACGATATAGACAAGGGTGTTACGGAACAGCCTTACGACGGACGGACATACATG
CAAGTGAAATACCTAGACTCCAACGAGATCAACTGGGGGGACACGAACACCTACCCGAGT
TTCACGGGAACTGATAGCTTTATACATATAGATGACGAAGAGACTATGAAAAGGTTGCTG
AGCTACACCCTGGACATCAACATCCGTTTCCGTTCCGTGTCCTCCAACGGTCTGTTAGTG
TGGAGCGGTCGGGTCACACACACACACGCAGAGAACAATATGAACACGAACACACACACA
AGCGACTTCCTTTCATTGGCTGTGGAAAACTCCGTGCTTGTATTCAGATACGATCTCGGC
AGCGGCGAGGTGGTCATTATAGCGAACCACACGAAAGTGGACGACGGTTTGTGGCACAGA
GCGAGAGCAACCAGGAACAGACAAGCAGGTGTTCTGGAAGTAGACGGCTTGGGGTCTGTT
GGGAAAATATCACCTGGAAAACTGAAACAACTGAACACCGAGAACGGACTTTATATCGGT
AGGTGTTGTTGA
Protein sequence:
MRDAKDHNKLDNSIHYNVKMFNDKYDLIDDKKLNDIDNELSVQDCKCEHGGGCVEHGCLC
PLGYAGERCEITLDLKVPRFNGSSYLRLPGLGNTAQSWLDIRITVKPTSGDGLLLYDAEH
PSGDGDFFSLHLRDFFVEFAFDLGSGIALVRSAYPLSPNKWHSISISRTGRHASIRVRSY
DTSDVTDTTRSVTSRGAARRLTLTQPMLLGGAPYPLPQRLALKTSFSGCVGKLVINEEEL
SVVSAALGGVDVDNCDAPHNTCTDCKETLYQTPEYPRELSAIHHSIVAKKGFKIKNHKTK
TKKHHEKKKYPKKYVQNGVHMQNDIDKGVTEQPYDGRTYMQVKYLDSNEINWGDTNTYPS
FTGTDSFIHIDDEETMKRLLSYTLDINIRFRSVSSNGLLVWSGRVTHTHAENNMNTNTHT
SDFLSLAVENSVLVFRYDLGSGEVVIIANHTKVDDGLWHRARATRNRQAGVLEVDGLGSV
GKISPGKLKQLNTENGLYIGRCC