DPGLEAN11387 in OGS1.0

New model in OGS2.0DPOGS207912 
Genomic Positionscaffold2649:+ 21031-34745
See gene structure
CDS Length1263
Paired RNAseq reads  1457
Single RNAseq reads  5158
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014456 (1e-65)
Best Drosophila hit  galectin, isoform E (2e-10)
Best Human hitgalectin-4 (2e-22)
Best NR hit (blastp)  PREDICTED: similar to Galectin-4 (Lactose-binding lectin 4) (L-36 lactose-binding protein) (L36LBP) [Apis mellifera] (3e-41)
Best NR hit (blastx)  PREDICTED: similar to Galectin-4 (Lactose-binding lectin 4) (L-36 lactose-binding protein) (L36LBP) [Apis mellifera] (8e-39)
GeneOntology terms  GO:0005529 sugar binding
InterPro families

  
IPR001079 Galectin, carbohydrate recognition domain
IPR013320 Concanavalin A-like lectin/glucanase, subgroup
IPR008985 Concanavalin A-like lectin/glucanase
Orthology groupMCL18027

Nucleotide sequence:

ATGGCCGCTCAGCCCATATACAACCCTGTAATCCCATGTGTCCACCCGATCCCTGGTGGC
TTGTTCCCCGGTCGCATGATAAGGTTCCAAGGGAGTGTACCGCCCGGCGCCCAACGATTC
GCGATCAATTTCCAATGCGGTCCGAACACTGATCCCCGGGACGACATCGCCCTCCATCTC
AACTTCCGCTTCGTGGAGATGTGCGTCGTTAGGAACCACTTGACGGCGATGAGCTGGGGG
GTGGAGGAGACCAACGGCGGCATGCCTCTAGTGCGAGGGGAGGCTTTCGAGGCCCTGGTT
CTGTGTGAGCCGCAGTCCATCAAGGTCGCGCTGAATGGGGTGCACTTCTGTGAGTTTCCG
CATCGTATACCCTTCCAAAGGATCAGTCACCTGACCGTGGACGGTGACGTCATGCTGCAG
TTCGTCGGCTTCGAGGGAGCCCAGCCAAGCCAGATGTACATGGCGGAACCTCCATCATAT
GCCAGCTATGGCGCTCCGCCCTCGTACGGCGCCCCCGGCTATGGAGCACCCCAAGGTGGT
TTCGGTGGAGCGGTACCCCCACAATACGCGGGCGCCCAAACAGTACCCCAGTATACTCAA
GAGCGTCGTGGTATGGGAACCGGGGCGGCCGTGGGATTGGGCGTTGGGGCCCTGGCCGCT
GGTGGGCTGGCGGGTTATGCACTAGGCGGGGGCTTCAGCAGCAATAGCCCTACCGAGGAG
CCAGGCAGCCACAGACGACGAGGCCATGGTTCATACGATGGTCAAGGGCCCTACGGTGGT
CAAGGTCCTTACGGTGGTCAAGGGCCCTTCGGTGGTCAAGGTCCTTACGGCGGTCAAGGT
CCAGGGCTCTTTGGTGGTCCGAATACTGGCTATGGAAGTCCTGATCAAACTCGGAACTAT
GTGCAAGATCCTCTCAATCCTCACCAGGGGCCAGTTCAGCCTCCAGTCGCGAATGTCACA
CCACCTCCGTACGGACAAGGTCAAGACAGTCACTACCCCCCGGGATATCAGCTTCATAAT
CAAGGTTATCCTTACGGTCAAGGTTACCCACCACACGGCCAAGGTTACCCACCCCAAGGA
CAGGGATATCCACCTTACGGTCAAGACTACGGTCAAGGGTACCCACCTCAGGGCCCAGGG
TATCCACCTCAAGGACCAGGATATCCGGGATATGGCCAAGGATACCCACCACAGGGCTAC
CCTGGTCAAGGACCTCCAGGCACGTTTTCACAGAATTATCAATTTTTGCTACATAATTTG
TAA

Protein sequence:

MAAQPIYNPVIPCVHPIPGGLFPGRMIRFQGSVPPGAQRFAINFQCGPNTDPRDDIALHL
NFRFVEMCVVRNHLTAMSWGVEETNGGMPLVRGEAFEALVLCEPQSIKVALNGVHFCEFP
HRIPFQRISHLTVDGDVMLQFVGFEGAQPSQMYMAEPPSYASYGAPPSYGAPGYGAPQGG
FGGAVPPQYAGAQTVPQYTQERRGMGTGAAVGLGVGALAAGGLAGYALGGGFSSNSPTEE
PGSHRRRGHGSYDGQGPYGGQGPYGGQGPFGGQGPYGGQGPGLFGGPNTGYGSPDQTRNY
VQDPLNPHQGPVQPPVANVTPPPYGQGQDSHYPPGYQLHNQGYPYGQGYPPHGQGYPPQG
QGYPPYGQDYGQGYPPQGPGYPPQGPGYPGYGQGYPPQGYPGQGPPGTFSQNYQFLLHNL