Journal of Biological Chemistry
Volume 281, Issue 47, 24 November 2006, Pages 35884-35893
Journal home page for Journal of Biological Chemistry

Glycobiology and Extracellular Matrices
Crystal Structure of the Galectin-9 N-terminal Carbohydrate Recognition Domain from Mus musculus Reveals the Basic Mechanism of Carbohydrate Recognition*

https://doi.org/10.1074/jbc.M606648200Get rights and content
Under a Creative Commons license
open access

The galectins are a family of β-galactoside-binding animal lectins with a conserved carbohydrate recognition domain (CRD). They have a high affinity for small β-galactosides, but binding specificity for complex glycoconjugates varies considerably within the family. The ligand recognition is essential for their proper function, and the structures of several galectins have suggested their mechanism of carbohydrate binding. Galectin-9 has two tandem CRDs with a short linker, and we report the crystal structures of mouse galectin-9 N-terminal CRD (NCRD) in the absence and the presence of four ligand complexes. All structures form the same dimer, which is quite different from the canonical 2-fold symmetric dimer seen for galectin-1 and -2. The β-galactoside recognition mechanism in the galectin-9 NCRD is highly conserved among other galectins. In the apo form structure, water molecules mimic the ligand hydrogen-bond network. The galectin-9 NCRD can bind both N-acetyllactosamine (Galβ1–4GlcNAc) and T-antigen (Galβ1–3GalNAc) with the proper location of Arg-64. Moreover, the structure of the N-acetyllactosamine dimer (Galβ1–4GlcNAcβ1–3Galβ1–4GlcNAc) complex shows a unique binding mode of galectin-9. Finally, surface plasmon resonance assay showed that the galectin-9 NCRD forms a homophilic dimer not only in the crystal but also in solution.

Cited by (0)

*

This work was supported by the Protein 3000 project from the Ministry of Education, Culture, Sports, Science and Technology of Japan. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The atomic coordinates and structure factors (code 2D6K, 2D6L, 2D6M, 2D6N, 2D6P, and 2D6O) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).

The on-line version of this article (available at http://www.jbc.org) contains supplemental Figs. S1 and S2.