DPGLEAN13227 in OGS1.0

New model in OGS2.0DPOGS205026 
Genomic Positionscaffold1171:+ 7451-18408
See gene structure
CDS Length1161
Paired RNAseq reads  332
Single RNAseq reads  1097
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010360 (2e-76)
Best Drosophila hit  galectin, isoform E (5e-20)
Best Human hitgalectin-4 (2e-12)
Best NR hit (blastp)  PREDICTED: similar to galectin (AGAP008844-PA) [Tribolium castaneum] (1e-28)
Best NR hit (blastx)  PREDICTED: similar to galectin (AGAP008844-PA) [Tribolium castaneum] (8e-27)
GeneOntology terms

  
GO:0015143 urate transmembrane transporter activity
GO:0016936 galactoside binding
GO:0005829 cytosol
InterPro families

  
IPR001079 Galectin, carbohydrate recognition domain
IPR013320 Concanavalin A-like lectin/glucanase, subgroup
IPR008985 Concanavalin A-like lectin/glucanase
Orthology groupMCL15719

Nucleotide sequence:

ATGTCTGGAATAGAGGATGGCGAACTAGCTAGGGACTTTGAATATGACTTTGCTGAAGAT
GCTTTGGACTTTGAGCAGCAGTCACCGTCGCCCACGGACAATCGCAACATAGATGTTGTG
CATGCGTTGCCGACCTTAGTTGTTGAGGAAAAGAGAGGTGTGGGAATAAGGAATGACAGG
AAAGGGTGGGAACAGGAAAAGGGCAATCGGCTTTCTCACTCATCGGACGAAACGCAGTCA
TTAAATACTACTTCACGCCGATCTTGTGAGAGGGTGAATCAGTTATCAGAAGCTCGGGAT
GTCAAGTTCACGCAGAATCTAACGGAACCGCTCTCTATTGGATCTCATATTATTTGTACT
GGAACTCCTAGCGACGATCTGCCATGGTTCGCAGTAAACATAGGTTGTGGGGACCCGTCT
CGAAGCGATATAGCGATTCATTTCAATGTACGACTGCCACAGTGTTACGTGGTGCGAAAT
ACAAAGAGGCATGACAAATGGGGTATGGAAGAAACCACGGCATATAGGTCGTTCCCCTTT
AAAATAGACCGTCCGTTTACGATTGAGGTTCTAATAGACGAAAAGGAGGCTTTATGGGCC
ATTGATGGCGAGCATTACTGCAGTTACGCCCACAGAAATCCGAGCCCACTGAACGCCACG
TGGGTTCAAGTGACAGGAATACGAGACGCTGCCTTGAAAATACAGAAAACTGACATATAC
CCAACTCTATCGCCGCCTCCACTAGCAGTACCAATAAAATCTGACTCAAATGACACCATA
GAAGACGAACCGAAATGGCATCCAAATGTCATAGCGACTATTGCTGACGGAATCCCGGAG
GGACACCAAATCGTTATACGCGGACGACTACGTCCCATGCTACACTCGTTCACTATAGAC
TTGTTGGATGTAGCTCGTGAATGGCCACGTGGGAACATACTTCTTCATGTGAACGTCCGA
GCTCATGTGCAATCACAGATGTCCAGGCAGCTAGTCGTATTGAACGCGTGGCTTGGTGCG
TGGGGACAGGAGAGGAGACAGAGAACAGCTAAACTGATTCCTGGTACTGAGAAACCGCAA
TTACATCAGACAACTTCTAGCAAAAACCCACTACAACAAGATACAGATAGGGACCATCCA
TTGTTCACTCTTGGTGTTTAA

Protein sequence:

MSGIEDGELARDFEYDFAEDALDFEQQSPSPTDNRNIDVVHALPTLVVEEKRGVGIRNDR
KGWEQEKGNRLSHSSDETQSLNTTSRRSCERVNQLSEARDVKFTQNLTEPLSIGSHIICT
GTPSDDLPWFAVNIGCGDPSRSDIAIHFNVRLPQCYVVRNTKRHDKWGMEETTAYRSFPF
KIDRPFTIEVLIDEKEALWAIDGEHYCSYAHRNPSPLNATWVQVTGIRDAALKIQKTDIY
PTLSPPPLAVPIKSDSNDTIEDEPKWHPNVIATIADGIPEGHQIVIRGRLRPMLHSFTID
LLDVAREWPRGNILLHVNVRAHVQSQMSRQLVVLNAWLGAWGQERRQRTAKLIPGTEKPQ
LHQTTSSKNPLQQDTDRDHPLFTLGV