New model in OGS2.0 | DPOGS202079  |
---|---|
Genomic Position | scaffold389:- 24937-41609 |
See gene structure | |
CDS Length | 1251 |
Paired RNAseq reads   | 108 |
Single RNAseq reads   | 243 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011301 (4e-101) |
Best Drosophila hit   | CG12594 (1e-22) |
Best Human hit | collagen alpha-2(XI) chain isoform 3 preproprotein (7e-09) |
Best NR hit (blastp)   | PREDICTED: similar to CG12594 CG12594-PA [Tribolium castaneum] (9e-67) |
Best NR hit (blastx)   | PREDICTED: similar to CG12594 CG12594-PA [Tribolium castaneum] (3e-57) |
GeneOntology terms    | GO:0005198 structural molecule activity GO:0007155 cell adhesion |
InterPro families    | IPR008985 Concanavalin A-like lectin/glucanase IPR000884 Thrombospondin, type 1 repeat IPR013320 Concanavalin A-like lectin/glucanase, subgroup IPR012680 Laminin G, subdomain 2 IPR003129 Laminin G, thrombospondin-type, N-terminal IPR001791 Laminin G domain |
Orthology group | MCL18340 |
Nucleotide sequence:
ATGGCATCTCGGATGTGCCGTGGCATTTCCAATGGATATATCTTGATGGTCGTTTTGTTT
TTGTCGGGGCAAGCTGTAGTTTGTGACAGCCAGTGCCCAAAGTTTGCAGAAAGGCCTCTG
GAAACGAGAGTTCAAGATGCTTCAATAGTTTTTAGGGCGGTGGTCGTTCAGGCACACTAT
CAGATAAAGACATTTGATTTGGCTTTAGTGTCAATATACAGAGGTGGAGTTGAGTTGGCA
TCGATCAGCCAATACGCAGGATCACCCTACAACACGACAGATAGGCAGGTAAATCTCAAA
ATCAATAATCAATTACGTGATTGCTTTAACTGGAGCATGGTCCAACAAAGCGAGCTTGTG
GTTTTCGCTCGTGTCAGTGAACCGGCTGTGGACCTGGAAACAACACCAGCTGATGGGCCC
TGGCTGGAAGCTACTGCAGCAGCAGTTCCTTGGAGCTTGGGAGTCGATATAGCAATATGG
AATGCTGTCGGCTGGGCTGGCTGGGGGGAGTGGGGTGTGTGTAGCAAGACGTGTGGTGGG
GGAAGACAAACCAGAAGAAGATACTGCTCAAGAAATTTTTGTGAAGGTTACGGAGAACAG
GGAAGGTCATGTAATTCCTTCAAATGTGATGGTACAATAAATCCTCTGGCACCAGACGCC
AGGCGAAATTTTCATCCAGCACAAGCCAGATGGGGTCTAGTACCAGATAGACCTCATGCC
TTTAGTCTGAAACCCAACTCTTATATCTGGATAGCGTCTTCCGAACTCTTCGCTCCAGGC
AAGACCTTCCCCAGAGAATTCACACTATTCATTTCTTTAAGATTAAGACCTGAGAGCGGG
GGTTACGGACAAGGAACGTTATTTTCAGTTCGTTCAAGACGTAAAACTGGTTCATTTTTG
TCTCTGGAACTAGCCGGGCGAGGAGCAGCTAGATTGGTTCATTCAGGTGCTGGAACTTCC
CGGTCTATATACCTCGCTGTCCCACTTTATGACTTTAGGTGGCACCACATCGCTATAAGT
GTCCATGACGACAACACTGTGAGAGTGTATGTGGATTGCCGATGGCTGAGGACTGACGTA
CTCGAAAAGGACGCTTTAGATACACCAAAGGACGCTGATCTCATTATAGGCTATCTCTTC
TCAGGGGACTTGGAACAAATGGTCGTTGTGCCGAAAGCCGGTCAAGCCCACGAGCAGTGC
TCTAGCCAAGTGACTGGCATAACACCATTCGTTACCCCGCGCGACACATAA
Protein sequence:
MASRMCRGISNGYILMVVLFLSGQAVVCDSQCPKFAERPLETRVQDASIVFRAVVVQAHY
QIKTFDLALVSIYRGGVELASISQYAGSPYNTTDRQVNLKINNQLRDCFNWSMVQQSELV
VFARVSEPAVDLETTPADGPWLEATAAAVPWSLGVDIAIWNAVGWAGWGEWGVCSKTCGG
GRQTRRRYCSRNFCEGYGEQGRSCNSFKCDGTINPLAPDARRNFHPAQARWGLVPDRPHA
FSLKPNSYIWIASSELFAPGKTFPREFTLFISLRLRPESGGYGQGTLFSVRSRRKTGSFL
SLELAGRGAARLVHSGAGTSRSIYLAVPLYDFRWHHIAISVHDDNTVRVYVDCRWLRTDV
LEKDALDTPKDADLIIGYLFSGDLEQMVVVPKAGQAHEQCSSQVTGITPFVTPRDT