New model in OGS2.0 | DPOGS205026  |
---|---|
Genomic Position | scaffold1171:+ 7451-18408 |
See gene structure | |
CDS Length | 1161 |
Paired RNAseq reads   | 332 |
Single RNAseq reads   | 1097 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010360 (2e-76) |
Best Drosophila hit   | galectin, isoform E (5e-20) |
Best Human hit | galectin-4 (2e-12) |
Best NR hit (blastp)   | PREDICTED: similar to galectin (AGAP008844-PA) [Tribolium castaneum] (1e-28) |
Best NR hit (blastx)   | PREDICTED: similar to galectin (AGAP008844-PA) [Tribolium castaneum] (8e-27) |
GeneOntology terms    | GO:0015143 urate transmembrane transporter activity GO:0016936 galactoside binding GO:0005829 cytosol |
InterPro families    | IPR001079 Galectin, carbohydrate recognition domain IPR013320 Concanavalin A-like lectin/glucanase, subgroup IPR008985 Concanavalin A-like lectin/glucanase |
Orthology group | MCL15719 |
Nucleotide sequence:
ATGTCTGGAATAGAGGATGGCGAACTAGCTAGGGACTTTGAATATGACTTTGCTGAAGAT
GCTTTGGACTTTGAGCAGCAGTCACCGTCGCCCACGGACAATCGCAACATAGATGTTGTG
CATGCGTTGCCGACCTTAGTTGTTGAGGAAAAGAGAGGTGTGGGAATAAGGAATGACAGG
AAAGGGTGGGAACAGGAAAAGGGCAATCGGCTTTCTCACTCATCGGACGAAACGCAGTCA
TTAAATACTACTTCACGCCGATCTTGTGAGAGGGTGAATCAGTTATCAGAAGCTCGGGAT
GTCAAGTTCACGCAGAATCTAACGGAACCGCTCTCTATTGGATCTCATATTATTTGTACT
GGAACTCCTAGCGACGATCTGCCATGGTTCGCAGTAAACATAGGTTGTGGGGACCCGTCT
CGAAGCGATATAGCGATTCATTTCAATGTACGACTGCCACAGTGTTACGTGGTGCGAAAT
ACAAAGAGGCATGACAAATGGGGTATGGAAGAAACCACGGCATATAGGTCGTTCCCCTTT
AAAATAGACCGTCCGTTTACGATTGAGGTTCTAATAGACGAAAAGGAGGCTTTATGGGCC
ATTGATGGCGAGCATTACTGCAGTTACGCCCACAGAAATCCGAGCCCACTGAACGCCACG
TGGGTTCAAGTGACAGGAATACGAGACGCTGCCTTGAAAATACAGAAAACTGACATATAC
CCAACTCTATCGCCGCCTCCACTAGCAGTACCAATAAAATCTGACTCAAATGACACCATA
GAAGACGAACCGAAATGGCATCCAAATGTCATAGCGACTATTGCTGACGGAATCCCGGAG
GGACACCAAATCGTTATACGCGGACGACTACGTCCCATGCTACACTCGTTCACTATAGAC
TTGTTGGATGTAGCTCGTGAATGGCCACGTGGGAACATACTTCTTCATGTGAACGTCCGA
GCTCATGTGCAATCACAGATGTCCAGGCAGCTAGTCGTATTGAACGCGTGGCTTGGTGCG
TGGGGACAGGAGAGGAGACAGAGAACAGCTAAACTGATTCCTGGTACTGAGAAACCGCAA
TTACATCAGACAACTTCTAGCAAAAACCCACTACAACAAGATACAGATAGGGACCATCCA
TTGTTCACTCTTGGTGTTTAA
Protein sequence:
MSGIEDGELARDFEYDFAEDALDFEQQSPSPTDNRNIDVVHALPTLVVEEKRGVGIRNDR
KGWEQEKGNRLSHSSDETQSLNTTSRRSCERVNQLSEARDVKFTQNLTEPLSIGSHIICT
GTPSDDLPWFAVNIGCGDPSRSDIAIHFNVRLPQCYVVRNTKRHDKWGMEETTAYRSFPF
KIDRPFTIEVLIDEKEALWAIDGEHYCSYAHRNPSPLNATWVQVTGIRDAALKIQKTDIY
PTLSPPPLAVPIKSDSNDTIEDEPKWHPNVIATIADGIPEGHQIVIRGRLRPMLHSFTID
LLDVAREWPRGNILLHVNVRAHVQSQMSRQLVVLNAWLGAWGQERRQRTAKLIPGTEKPQ
LHQTTSSKNPLQQDTDRDHPLFTLGV