New model in OGS2.0 | DPOGS207912  |
---|---|
Genomic Position | scaffold2649:+ 21031-34745 |
See gene structure | |
CDS Length | 1263 |
Paired RNAseq reads   | 1457 |
Single RNAseq reads   | 5158 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014456 (1e-65) |
Best Drosophila hit   | galectin, isoform E (2e-10) |
Best Human hit | galectin-4 (2e-22) |
Best NR hit (blastp)   | PREDICTED: similar to Galectin-4 (Lactose-binding lectin 4) (L-36 lactose-binding protein) (L36LBP) [Apis mellifera] (3e-41) |
Best NR hit (blastx)   | PREDICTED: similar to Galectin-4 (Lactose-binding lectin 4) (L-36 lactose-binding protein) (L36LBP) [Apis mellifera] (8e-39) |
GeneOntology terms   | GO:0005529 sugar binding |
InterPro families    | IPR001079 Galectin, carbohydrate recognition domain IPR013320 Concanavalin A-like lectin/glucanase, subgroup IPR008985 Concanavalin A-like lectin/glucanase |
Orthology group | MCL18027 |
Nucleotide sequence:
ATGGCCGCTCAGCCCATATACAACCCTGTAATCCCATGTGTCCACCCGATCCCTGGTGGC
TTGTTCCCCGGTCGCATGATAAGGTTCCAAGGGAGTGTACCGCCCGGCGCCCAACGATTC
GCGATCAATTTCCAATGCGGTCCGAACACTGATCCCCGGGACGACATCGCCCTCCATCTC
AACTTCCGCTTCGTGGAGATGTGCGTCGTTAGGAACCACTTGACGGCGATGAGCTGGGGG
GTGGAGGAGACCAACGGCGGCATGCCTCTAGTGCGAGGGGAGGCTTTCGAGGCCCTGGTT
CTGTGTGAGCCGCAGTCCATCAAGGTCGCGCTGAATGGGGTGCACTTCTGTGAGTTTCCG
CATCGTATACCCTTCCAAAGGATCAGTCACCTGACCGTGGACGGTGACGTCATGCTGCAG
TTCGTCGGCTTCGAGGGAGCCCAGCCAAGCCAGATGTACATGGCGGAACCTCCATCATAT
GCCAGCTATGGCGCTCCGCCCTCGTACGGCGCCCCCGGCTATGGAGCACCCCAAGGTGGT
TTCGGTGGAGCGGTACCCCCACAATACGCGGGCGCCCAAACAGTACCCCAGTATACTCAA
GAGCGTCGTGGTATGGGAACCGGGGCGGCCGTGGGATTGGGCGTTGGGGCCCTGGCCGCT
GGTGGGCTGGCGGGTTATGCACTAGGCGGGGGCTTCAGCAGCAATAGCCCTACCGAGGAG
CCAGGCAGCCACAGACGACGAGGCCATGGTTCATACGATGGTCAAGGGCCCTACGGTGGT
CAAGGTCCTTACGGTGGTCAAGGGCCCTTCGGTGGTCAAGGTCCTTACGGCGGTCAAGGT
CCAGGGCTCTTTGGTGGTCCGAATACTGGCTATGGAAGTCCTGATCAAACTCGGAACTAT
GTGCAAGATCCTCTCAATCCTCACCAGGGGCCAGTTCAGCCTCCAGTCGCGAATGTCACA
CCACCTCCGTACGGACAAGGTCAAGACAGTCACTACCCCCCGGGATATCAGCTTCATAAT
CAAGGTTATCCTTACGGTCAAGGTTACCCACCACACGGCCAAGGTTACCCACCCCAAGGA
CAGGGATATCCACCTTACGGTCAAGACTACGGTCAAGGGTACCCACCTCAGGGCCCAGGG
TATCCACCTCAAGGACCAGGATATCCGGGATATGGCCAAGGATACCCACCACAGGGCTAC
CCTGGTCAAGGACCTCCAGGCACGTTTTCACAGAATTATCAATTTTTGCTACATAATTTG
TAA
Protein sequence:
MAAQPIYNPVIPCVHPIPGGLFPGRMIRFQGSVPPGAQRFAINFQCGPNTDPRDDIALHL
NFRFVEMCVVRNHLTAMSWGVEETNGGMPLVRGEAFEALVLCEPQSIKVALNGVHFCEFP
HRIPFQRISHLTVDGDVMLQFVGFEGAQPSQMYMAEPPSYASYGAPPSYGAPGYGAPQGG
FGGAVPPQYAGAQTVPQYTQERRGMGTGAAVGLGVGALAAGGLAGYALGGGFSSNSPTEE
PGSHRRRGHGSYDGQGPYGGQGPYGGQGPFGGQGPYGGQGPGLFGGPNTGYGSPDQTRNY
VQDPLNPHQGPVQPPVANVTPPPYGQGQDSHYPPGYQLHNQGYPYGQGYPPHGQGYPPQG
QGYPPYGQDYGQGYPPQGPGYPPQGPGYPGYGQGYPPQGYPGQGPPGTFSQNYQFLLHNL