New model in OGS2.0 | DPOGS203288  |
---|---|
Genomic Position | scaffold6:- 1456778-1468965 |
See gene structure | |
CDS Length | 6237 |
Paired RNAseq reads   | 881 |
Single RNAseq reads   | 2184 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012237 (3e-06) |
Best Drosophila hit   | kon-tiki (0.0) |
Best Human hit | chondroitin sulfate proteoglycan 4 precursor (1e-140) |
Best NR hit (blastp)   | Chondroitin sulfate proteoglycan 4 precursor, putative [Pediculus humanus corporis] (0.0) |
Best NR hit (blastx)   | Chondroitin sulfate proteoglycan 4 precursor, putative [Pediculus humanus corporis] (0.0) |
GeneOntology terms    | GO:0016203 muscle attachment GO:0007155 cell adhesion GO:0005488 binding GO:0016021 integral to membrane GO:0051491 positive regulation of filopodium assembly GO:0007517 muscle organ development GO:0051286 cell tip GO:0005515 protein binding |
InterPro families    | IPR001791 Laminin G domain IPR013320 Concanavalin A-like lectin/glucanase, subgroup IPR008985 Concanavalin A-like lectin/glucanase |
Orthology group | MCL11234 |
Nucleotide sequence:
ATGCATACTACTTTATACGACCCACTTTATTGTCATCGTTTATTCAATTCTAACCTTTCC
GTTGACAATGTGGCGTTAAGTACTCGCATCGAAAGCGGCGGTAACAGATATCTAGACCTC
TCTGATACTTTCTATCTAGGTGGTATAGAATCTGAGAAACGACAAAGAGCCTTCGCTCGA
GGCGTCAAAGCAGCTGATTCAAGTATCATGGGTTGCATCAAACCAATCGAGGTGGACGAC
AGGTTGTATGGTCTGCCTAACGCGGTTGTCACATACGGTATTAGTCCGAAGTGTGTTTGG
TGGTACCCCTGCCAGAGCGCCCATCCATGCGTGTCACAGGCCGTCTGCGAGCAACACGGT
CTAGATCACTTCACTTGCAAGTGTGATAGCGATCTCTGCATAAATCCCGACTACGCTGAA
AAATATAAGGTTTTTTCAAAATCGAGTAGCGAATTGGAACTGGTAACGTTATATCCTCTG
ACCGTCCAAGAGGGTGGTGTGGCTGTTATAACGTCACAGAACATAGATGTCGTATTGGAT
CACCATAAGTATGGCGTGCGTCCGTCTGGGGTACTGCTGCATGTGGCGAGGTCGCCACAA
CACGGCCGCATAGCCATAGACCTCTCGCTACAGAGGAATGCGCCACAGTATACAAATTAT
GTGGACGGTGAAAAAGCCAAACAGTTTTTCACGCTCATGGACCTTACCAGAGATAAGGTT
CGCTACGTCCACGACGGCTCAGAAAATCACCAAGACGCCATAGTATTAGAGATGGAACTC
ATACCAGAGACGAAGTTCACACTGCCAAGCTACCTCCAAGGACGCAACACGTTTGTCCTA
CACGTTAACGTGACGCCGGTCAACGATCCGCCAGTACTGAATTTGTTGCCAGGGAAAATA
TTAAGATTGACCCAGGGAACACGAAAGGTTATAACCTCCGACATTTTGAAGGCAGAGGAT
CCTGATACACTGCCCGAAGATCTCTTGTATACCGTACTGCATGGAAAGAATGAAGCAAAT
AGCGGCCACATCGAGATGTCTGGTCAGCCTGTGGACTCTTTCACTCAGCAGGACATCGAC
TCCGGCATAATATCCTACGTTCACGGTACAACCAGCGACAAGCAGCTGAATAAAACATCG
CTGAGACTGACATTACAGGTCTCCGATGGCATAGAAACTAGCGGGCCGGGCGTCCTCCGC
ATATCTATAGTACCGCTACAAGTGCGTCTCGTTAACAACACCGGACTGTTCCTAGTGCAT
AACTCGTACGCCATAATAACAGCCGACAACCTGACATTCGCCACTAACGCAGACGAAACC
AACGTCCAAGTCAAATACGACGTAGTGAAACCTCCTCAGTTTGGCGTTGTCGAACGTCTC
CGAGTACTAGATGGCACTTGGCAGACTGTTGACACGTTCACCAGTGAAATGATCAGTTCC
GGTAGAGTCCGCTACATGCATATATTAGGAAACCCATCGCACGATGAATTTAAATTCAAA
GCCTCCGTCGGCACAGTACGGACGAACACTCTATACGATTTTCGATTGACTTTCATCAAA
CTCGAACTATATCAAATGACGAACGAAGAATTGGTTTTGAACAATACCAGAGAGGCGTTC
GTTTCCGATCAACACTTGCGTTTCAAAACGAAGCCGCTCGCGCTGACGGGTGACAGGATA
CTATTCACAATAATTAAACCGCCGAAGTATGGTATCCTACATCTGTCGTCTGGTAAACAT
CATTTGCAACTGCACAGCACTTTCACACAGCACGATATAGATTCGGACCAGCTGTGGTAC
AGATTACACAGACGCGCATACTCTCACATACAGGACGAGTTCACTTTCGTGGTAGGGGCT
ACGGAATGTGAGAATATCACAGGAGTAATGACAATAAGACATGTGCCGGGCACATCCAGT
AGCGATCACTTGTCAGGGAGGATACACACCACGTTGGAGAGATTGCAAGTCATAGAAGGT
TCCAGAATGGCGATACCAGCTACTCACCTTAATTTCAGAACGGATTCAATAACCAACCTA
GTGTTCAATATAACGCGACCGCCCAAACACGGCAAAATCGAAGTTATCACCGATCATTTG
AAAATACTGAGAGACAATACCACGTACTTCACTCTACAGGAATTGAATTCCGACAGAGTT
TATTATACCCACGACGATTCGGAAAGCAGGCACGATTCTTTTCATTTCATGGCATTAAGT
CCTGAGCCGGAAGACTTTCAGTACGTTGGAGTTTTCCATATCGACATCATACTGAAAAAT
GACAATAGTCCCGTGCGGGCGAATGAAAAAGTGTTTCATATAGTCCACGGAGGGGCGAGG
CTTATAATGGCTAGGGATTTGAGTTACACCGACGCTGACTTGGACACTAAGCCTTCAGAC
ATAGTGTATACCGTACAGAGATTCACGAAAGATCCTCCAAACGGCGGCATATTCCGTGCA
GATAACCCATCCGAACAAATTGCTCAGTTCACGCAGGACGACATTAATAAAAACCTTGTA
ATGTTTAAGCATCAAGGCAAAGAGTACGGCAAAATAGCGTTTTGGATATCAGACGGGCTA
TTCGACGTGAACGGTAATTTGGAGATACAAGCTTCACCTCCCTTCATAAGAATGTATCCA
ACTAACGGTTCAATTGTAGAGAATGGTAAATCCGTTGTCTTAACTACCAAATACATGCAG
GTGGACACTAACATGAATTGCCTTGAAGAAGATATCAGATACGAAATTATACAAGAACCC
AAACAGGGGTCTATAGAAGTTGGTGAAATTTTGGGAGCAATTGCATTCACTCAATTGGAC
ATAGCGGCTGGAAGAGTGGCCTATAAACACAGGGAACCGGAAACGCAAAACGATGCTTTT
AGGTTTAAAGTTACGTGCCTTGAGGCCTGGGGTGAGGGTATATACCCTATTAAGATATTT
CCGTCCAGTTACTGGGAACCTCTAAAATTAACGAATAATAAAGCATTAGTTGTCGAAGAA
TCAACTAGCCTGAATATCACGAGAGATATACTAGAAGTCATGCATCCGCAAATTGAACCT
TCAAATATTCTGTACCAAGTCACCGATGGCCCGTACCACGGTTGGCTCGAAGTTACAGCA
GTGGGTACGGTTGAATTGGAGAATTACAACGAGGAGCCAGTGCAAACTAAAGTGTTCGAT
CAATCTATCATAAACTCAAATAGATTAGTCTACGTACAGGCCGGTGTGAATCGAACCAGA
GACAAAATCAAAATGGACGTAACCAATGGGATCGTTTGGCTCAGAGGAATAGAGCTTACT
GTCATAATAATACCGGAGCATTTTTACGTAGTCTCCTCAAACCTGACGGTGGTGGAGGGG
ATGTCCGTCAGTATCAAGCAGGACTTGTTCAGTACGGTCACGGAGTACTACCGCGGGCGA
GTCGTCTCCTACAAAGTTGTTCAGAATCCGAAATACGGCAAGATCGTTATGGATGAGCAG
GAATTGACATTGCTGCCTGTGCTTAAGTTGAATTCCGGAAATATCGTGTACACTAATGAC
GGTTCTGAAGAGTCAACTGATGTGATAAAGTTGGTTGCGATAACAGAAACCGGTAAGGAA
AGTGAACCGTTCTATCTCCGCATCAATATAGAGCCAGTTAACGATGAACCACCAATCGTG
GCCGCTAACACCGGCCTCTGTGTGTGGGAAGGTGGCACATTTACGTTTACTAGAAATGAA
CTTTATGTAAACGATATTGACACGCCATTAAGAAACGTCACAATTAGGGTAGTGGATATT
GTCTCTGGCTACATCGCCACACGAGGCGATCTAGACACTCCCATAGATCACTTCACACAA
GCGGATATTGATAACAGATATGTCGTATTTGTTCATAAAAACGGATCCAAGGGCAAGATG
ATATTTAACGTAACCGATGGTCTTCACGAACTATCAAAAATAACATTCCTTATAACAACA
AAATCTGTGTCCCTTAAGCTGGTCAGAAAACATTCATTGCGAGTATTCCCTCTGATGAGA
GAGCCGCTCAACAATTACCTCCTGATGGCGAAATGCACAGATCCATCCAGACCGATAGTT
TTTAAAATTGTAAGAGCACCAGCTTTAGGTAGGCTGGTTATGCTGAGTGGGGATAACCAT
CACAGATCCGTAACACAATTCACACAGAGGGATATAAATGAAACAACAGTCTATTACGAA
CATACTCACCCATTTTCTGATCTTTATACTAACGACTCCTTCATATTTAAAGTGGAGGCG
GCATTGGCTAAGCCGGTACTAGATCAAATCTTTCATATCGATATATCAGTGGCATCCGGT
GGTTTGGCAAAGTATGTGAATATTCCATTGACCAAAGTCAAGGAAGGCGACAAAATTCCA
TTGCGTGTGAACGTGACTAATGTTATAACATATTTGGAGACACAGGCTGGTGTTAGACAA
CCACAAATTGAGGCACAGTGGTCGTTACCGATGCACGGTGTCTTAAATCCTTTGTTATCT
TCGCTTACACAAAGCCAGCTGGAAGATGGTGTCGTAACATATGAGCATGATGACTCGGAC
ACGGTTGAAGACAGTATAGATATGGCGCTGTATTTGCTACCAGATTATGTTCTCTTGTGT
AATGTCACTATACCGATTCATATTGTGCCAGTAAACGATCAGCCGTTCAGGTTGTTAACG
GACACTCCACAAATACAGGTGGTGCAAGGAGAAAATTATACTTTAACTAAGAATGATTTG
CTCACTGAAGATGGTGACACTGTGCCATCGGGTATACTCTATGATATAATAAGTGGCCCG
ACACAAGGTAGACTAGTCATGATGGATGAAAATCAGACACTAGACGAGGCGCAATCCATA
AACAAATTCACCCAACAGGATATAAATGAGGGTAGGATTGTATATGAACATTCAGGCATA
TTGCAAACAGCGACATTCTATTTCCGGGTATGGGACGGAGAGTTCAAACCGACCTATACG
GTTTTCACTATAGACGTCATACCAGTTATATTGAACGCGTCATCACTACATCCGATATTC
TTGCAGCAAGGTTCAAACGTGGCGACTGTGGCACCAGATCAAATATATGTAGAGACCAAT
GCCAAAAAAGATAAAGTCTGGTACAATATAACAAGACAACCAGTTCACGGGATGATATAC
TTGGGAAGGAATCCTGTAACTTATTTTTCACATAAGGATATAATGGATAAAGTAGTCATT
TATATGCAGAATGATATGACGGTAGCGAATGATAGTTTTGATCTGATCGCCTATGTCCAT
AACAGCAACGCCACACAGCCTTTCACCATAGACGTTGTTGTGCAACCGTTACTAGTATTG
GGGGATTTGAAAATTATTGAACAGAAAACAAAAATAACATTAAATAATTTGGATGCGAAT
GAGTTGGCAAAACTAACAGCGAGCGATCCGGTTTACACGATATTGAGGAAGCCAAATTAT
GGAAGCATAAAAAAGATAATAAGAAGCTCTGGCGAGAAAACCAGTGCGAGGGAGAGGGAG
ATAGCGTATTTCACTCACGAGGACATTAAAGCTGGTGTCATATACTATGTGGCCAGGAAG
AAATTGGCTGCTCTGAACGGTGTCCAAGACAGTCTTGGTTTCCTACTCGCTGCGACAATA
TTCCAACCAGCCACCGGTGAGCTTGATATTTATATCGGCAAAAAGGGTGACAAGAAAAGT
TTACTGGGACCAAGCGATCCTGAAGGCCACGAAGGGATTCCGGTAAAAAATGGACAAACA
TCGTCATATTATATGATGGTGATAATGACGTTGCTTGGTGTACTCCTGGCTGTTATAATA
CTAGTCAGTCTATTGAAATGTCGTCGTTATATGACAAGGGATCAGAACGCCATGGTGAAA
ATACACGGACAGAGTCAGGGCGCGGTTGCTCCTATACCGTTACCTCGACCACCAGACCAC
TTGATGCCGTCACCAACCCAAGCCAGTCCTCCAATAAAGAGATATGTGTCTTCGGAGCAA
TCGGTACACACTGGCACCAGCACTCCTCTACCGTCAGGTGGTAGTGTAGCTTGTAAGGTG
ACCCCGTTAGCGGACGCTGGCCTTCCAGACCTCAACGCAAGGTATCCTTATGGAGCCGAT
GACCATACCGATGCGGAAGATTGGAGCAGCTATGAGGCTAGCGAGTCAGCCTTCCCGGTC
CGCTCAGGCGGTGTCCCCACCAACCCGATGCTGCGCCGCAACCAGTACTGGGTCTGA
Protein sequence:
MHTTLYDPLYCHRLFNSNLSVDNVALSTRIESGGNRYLDLSDTFYLGGIESEKRQRAFAR
GVKAADSSIMGCIKPIEVDDRLYGLPNAVVTYGISPKCVWWYPCQSAHPCVSQAVCEQHG
LDHFTCKCDSDLCINPDYAEKYKVFSKSSSELELVTLYPLTVQEGGVAVITSQNIDVVLD
HHKYGVRPSGVLLHVARSPQHGRIAIDLSLQRNAPQYTNYVDGEKAKQFFTLMDLTRDKV
RYVHDGSENHQDAIVLEMELIPETKFTLPSYLQGRNTFVLHVNVTPVNDPPVLNLLPGKI
LRLTQGTRKVITSDILKAEDPDTLPEDLLYTVLHGKNEANSGHIEMSGQPVDSFTQQDID
SGIISYVHGTTSDKQLNKTSLRLTLQVSDGIETSGPGVLRISIVPLQVRLVNNTGLFLVH
NSYAIITADNLTFATNADETNVQVKYDVVKPPQFGVVERLRVLDGTWQTVDTFTSEMISS
GRVRYMHILGNPSHDEFKFKASVGTVRTNTLYDFRLTFIKLELYQMTNEELVLNNTREAF
VSDQHLRFKTKPLALTGDRILFTIIKPPKYGILHLSSGKHHLQLHSTFTQHDIDSDQLWY
RLHRRAYSHIQDEFTFVVGATECENITGVMTIRHVPGTSSSDHLSGRIHTTLERLQVIEG
SRMAIPATHLNFRTDSITNLVFNITRPPKHGKIEVITDHLKILRDNTTYFTLQELNSDRV
YYTHDDSESRHDSFHFMALSPEPEDFQYVGVFHIDIILKNDNSPVRANEKVFHIVHGGAR
LIMARDLSYTDADLDTKPSDIVYTVQRFTKDPPNGGIFRADNPSEQIAQFTQDDINKNLV
MFKHQGKEYGKIAFWISDGLFDVNGNLEIQASPPFIRMYPTNGSIVENGKSVVLTTKYMQ
VDTNMNCLEEDIRYEIIQEPKQGSIEVGEILGAIAFTQLDIAAGRVAYKHREPETQNDAF
RFKVTCLEAWGEGIYPIKIFPSSYWEPLKLTNNKALVVEESTSLNITRDILEVMHPQIEP
SNILYQVTDGPYHGWLEVTAVGTVELENYNEEPVQTKVFDQSIINSNRLVYVQAGVNRTR
DKIKMDVTNGIVWLRGIELTVIIIPEHFYVVSSNLTVVEGMSVSIKQDLFSTVTEYYRGR
VVSYKVVQNPKYGKIVMDEQELTLLPVLKLNSGNIVYTNDGSEESTDVIKLVAITETGKE
SEPFYLRINIEPVNDEPPIVAANTGLCVWEGGTFTFTRNELYVNDIDTPLRNVTIRVVDI
VSGYIATRGDLDTPIDHFTQADIDNRYVVFVHKNGSKGKMIFNVTDGLHELSKITFLITT
KSVSLKLVRKHSLRVFPLMREPLNNYLLMAKCTDPSRPIVFKIVRAPALGRLVMLSGDNH
HRSVTQFTQRDINETTVYYEHTHPFSDLYTNDSFIFKVEAALAKPVLDQIFHIDISVASG
GLAKYVNIPLTKVKEGDKIPLRVNVTNVITYLETQAGVRQPQIEAQWSLPMHGVLNPLLS
SLTQSQLEDGVVTYEHDDSDTVEDSIDMALYLLPDYVLLCNVTIPIHIVPVNDQPFRLLT
DTPQIQVVQGENYTLTKNDLLTEDGDTVPSGILYDIISGPTQGRLVMMDENQTLDEAQSI
NKFTQQDINEGRIVYEHSGILQTATFYFRVWDGEFKPTYTVFTIDVIPVILNASSLHPIF
LQQGSNVATVAPDQIYVETNAKKDKVWYNITRQPVHGMIYLGRNPVTYFSHKDIMDKVVI
YMQNDMTVANDSFDLIAYVHNSNATQPFTIDVVVQPLLVLGDLKIIEQKTKITLNNLDAN
ELAKLTASDPVYTILRKPNYGSIKKIIRSSGEKTSAREREIAYFTHEDIKAGVIYYVARK
KLAALNGVQDSLGFLLAATIFQPATGELDIYIGKKGDKKSLLGPSDPEGHEGIPVKNGQT
SSYYMMVIMTLLGVLLAVIILVSLLKCRRYMTRDQNAMVKIHGQSQGAVAPIPLPRPPDH
LMPSPTQASPPIKRYVSSEQSVHTGTSTPLPSGGSVACKVTPLADAGLPDLNARYPYGAD
DHTDAEDWSSYEASESAFPVRSGGVPTNPMLRRNQYWV