Chapter Nine - High-Resolution Modeling of Protein Structures Based on Flexible Fitting of Low-Resolution Structural Data

https://doi.org/10.1016/bs.apcsb.2014.06.004Get rights and content

Abstract

To circumvent the difficulty of directly solving high-resolution biomolecular structures, low-resolution structural data from Cryo-electron microscopy (EM) and small angle solution X-ray scattering (SAXS) are increasingly used to explore multiple conformational states of biomolecular assemblies. One promising venue to obtain high-resolution structural models from low-resolution data is via data-constrained flexible fitting. To this end, we have developed a new method based on a coarse-grained Cα-only protein representation, and a modified form of the elastic network model (ENM) that allows large-scale conformational changes while maintaining the integrity of local structures including pseudo-bonds and secondary structures. Our method minimizes a pseudo-energy which linearly combines various terms of the modified ENM energy with an EM/SAXS-fitting score and a collision energy that penalizes steric collisions. Unlike some previous flexible fitting efforts using the lowest few normal modes, our method effectively utilizes all normal modes so that both global and local structural changes can be fully modeled with accuracy. This method is also highly efficient in computing time. We have demonstrated our method using adenylate kinase as a test case which undergoes a large open-to-close conformational change. The EM-fitting method is available at a web server (http://enm.lobos.nih.gov), and the SAXS-fitting method is available as a pre-compiled executable upon request.

Introduction

The biological functions of many biomolecules involve multiple conformational states and dynamic transitions in between. Despite rapid progress in structural biology, it remains highly difficult to directly solve all conformational states of biomolecules by high-resolution structural determination protocols such as X-ray crystallography and nuclear magnetic resonance spectroscopy. As attractive alternatives, low-resolution structure-determining techniques, including cryo-electron microscopy (cryo-EM) and small angle solution X-ray scattering (SAXS) are widely used. Cryo-EM constructs three-dimensional electron density maps for large biomolecular complexes at near/subnanometer resolutions based on a large number of two-dimensional images (Chiu et al., 2005, Saibil, 2000). SAXS measures orientationally averaged X-ray scattering intensity for biomolecules in solution which contains information about the size and shape of the biomolecules (Koch et al., 2003, Mertens and Svergun, 2010, Putnam et al., 2007). These low-resolution techniques alone cannot generate unique high-resolution structural models with atomistic details. However, they offer highly informative constraints for generating and selecting high-resolution structural models using computational methods (Esquivel-Rodriguez and Kihara, 2013, Fabiola and Chapman, 2005, Lindert et al., 2009). One viable venue of utilizing such constraints is “flexible fitting” (Flores, 2014, Lopez-Blanco and Chacon, 2013, Pandurangan et al., 2014, Wriggers and Birmanns, 2001), which flexibly deforms an initial protein structure to fit the given low-resolution data from cryo-EM or SAXS. An alternative venue is “rigid-body fitting” (Arai et al., 2004, Bernado et al., 2007, Bernado et al., 2009, Bernado et al., 2008, Rawat et al., 2003, Shiozawa et al., 2009, Volkmann et al., 2000, Wendt et al., 2001) which divides a protein complex into multiple domains and then fits them separately as rigid bodies. The rigid-body fitting methods depend on a subjective and error-prone partition of a biomolecule into rigid domains and ignore coupled motions between domains which may be functionally important. The success of the flexible fitting methods requires careful validation of the fitted models to avoid overfitting of the low-resolution data (Falkner and Schroder, 2013, Vashisth et al., 2013). This is particularly important for SAXS data which contains much less structural information than cryo-EM maps and there is additional contribution from hydration shell near the surface of biomolecules.

The flexibility of biomolecules can be simulated by various computational methods at different levels of details. Molecular dynamics (MD) simulation is, in principle, able to describe the dynamics of biomolecules under physiological conditions (i.e., in the presence of water and ions) with atomic details, which makes it a method of choice for flexible fitting (Li & Frank, 2007). Recently, several MD-based methods have been introduced for cryo-EM fitting with full flexibility. The common strategy of these methods is to bias the MD simulation toward a conformation that optimally fits the cryo-EM data by using a biasing potential function (Caulfield and Harvey, 2007, Chan et al., 2012, Noda et al., 2006, Orzechowski and Tama, 2008, Trabuco et al., 2008, Trabuco et al., 2009, Wu et al., 2013). The application of these methods, however, has been limited by the high-computational cost of running MD simulations for large biomolecular systems.

As an efficient alternative, the flexibility of biomolecules can be analyzed by coarse-grained models where a group of atoms are represented by a coarse-grained bead (Tozzini, 2005). For example, the elastic network model (ENM) (Atilgan et al., 2001, Hinsen, 1998, Tama and Sanejouand, 2001) represents a protein structure as a network of Cα atoms with neighboring ones connected by springs with a uniform force constant (Tirion, 1996). The ENM has been successfully used to assist the fitting of cryo-EM and X-ray data (Schroder et al., 2007, Schroder et al., 2010, Tan et al., 2008). In particular, ENM-based normal mode analysis (NMA) has been widely utilized to flexibly fit high-resolution structures to low-resolution structural data (Delarue and Dumas, 2004, Falke et al., 2005, Gorba et al., 2008, Hinsen et al., 2005, Mitra et al., 2005, Suhre et al., 2006, Tama et al., 2004a, Tama et al., 2004b, Tama et al., 2006), or satisfy a few pair wise distance constraints (Zheng and Brooks, 2005, Zheng and Brooks, 2006). Despite great success, the ENM/NMA-based flexible fitting methods are limited in accuracy because they usually only use a few low-frequency normal modes solved from ENM, which are less accurate for describing small local conformational changes (like rearrangement of helices inside a densely packed region) than large global ones (like domain motions) (Tama & Sanejouand, 2001). Indeed, it was found that many (> 20) normal modes are needed to accurately describe several observed conformational changes in proteins (Petrone & Pande, 2006).

Recently, to achieve both accuracy and efficiency in the flexible fitting of cryo-EM (Zheng, 2011) and SAXS (Zheng & Tekpinar, 2011) data, we have developed a coarse-grained method based on a modified form of the ENM (named mENM) that combines the harmonic interactions for maintaining pseudo-bonds and secondary structures, and the anharmonic interactions between nonbonded residues to allow them to move apart readily. As a result, mENM allows large global structural changes without distorting local structures. Our method is based on minimization of a pseudo-energy which linearly combines various terms of the mENM energy with an EM/SAXS-fitting score and a collision energy that penalizes steric collisions (see Section 2). Our minimization-based implementation has two advantages: (a) unlike some previous flexible fitting efforts using the lowest few normal modes (Tama et al., 2004a, Tama et al., 2004b), our method effectively utilizes all normal modes so that both global and local structural changes can be fully modeled with accuracy. (b) It is efficiently implemented using the Newton–Raphson algorithm based on a sparse linear-equation solver (Chen, Davis, Hager, & Rajamanickam, 2008) which is significantly faster than NMA.

In this chapter, we will describe the methodological details of our flexible fitting method (see Section 2), and demonstrate its usage by applying it to the test case of adenylate kinase using simulated EM/SAXS data (see Section 3). Please refer to our papers (Zheng, 2011, Zheng and Tekpinar, 2011) for more test cases with both simulated and experimental data.

Section snippets

Modified elastic network model (mENM)

A Cα-only ENM is constructed from the atomic coordinates of a protein structure (available from the Protein Data Bank). Each residue is represented by a bead located at the Cα atom. The original form of the ENM potential energy (Tirion, 1996) isEENM=12i<jCijθRcdij,0dijdij,02,where dij is the distance between bead i and j, and dij,0 is the value of dij given by the native structure, θ(x) is the Heaviside function, Rc is the cutoff distance chosen to be 10 Å following our previous study (Zheng,

Results

We demonstrate our flexible fitting method using a test case of adenylate kinase, which consists of two high-resolution X-ray structures corresponding to a closed and open form (PDB code: 1ake and 4ake) at the two ends of a large conformational transition (7.1 Å in RMSD). One of these structures (target structure) is used to simulate the target cryo-EM map or SAXS profile for fitting, the other one is used as the starting model for flexible fitting. The target SAXS profile is simulated using

Discussion

We have presented a new computational method to flexibly fit a given protein structure to a cryo-EM map or a SAXS profile using a coarse-grained Cα-only model. Our goal is to build a high-resolution structural model compatible with the given low-resolution data while maintaining its local structural integrity. Our method uses a modified form of ENM that allows large-scale conformational changes while maintaining pseudo-bonds and secondary structures and avoiding residue collisions. Our method

Acknowledgment

We thank the funding support from NSF (grant #0952736).

References (65)

  • C.C. Jolley et al.

    Fitting low-resolution cryo-EM maps of proteins using constrained geometric simulations

    Biophysical Journal

    (2008)
  • S. Lindert et al.

    Hybrid approaches: Applying computational methods in cryo-electron microscopy

    Current Opinion in Structural Biology

    (2009)
  • J.R. Lopez-Blanco et al.

    iMODFIT: Efficient and robust flexible fitting based on vibrational analysis in internal coordinates

    Journal of Structural Biology

    (2013)
  • H.D.T. Mertens et al.

    Structural characterization of proteins and complexes using small-angle X-ray solution scattering

    Journal of Structural Biology

    (2010)
  • M. Orzechowski et al.

    Flexible fitting of high-resolution X-ray structures into cryoelectron microscopy maps using biased molecular dynamics simulations

    Biophysical Journal

    (2008)
  • A.P. Pandurangan et al.

    Combined approaches to flexible fitting and assessment in virus capsids undergoing conformational change

    Journal of Structural Biology

    (2014)
  • M.V. Petoukhov et al.

    Global rigid body modeling of macromolecular complexes against small-angle scattering data

    Biophysical Journal

    (2005)
  • P. Petrone et al.

    Can conformational change be described by only a few normal modes?

    Biophysical Journal

    (2006)
  • G.F. Schroder et al.

    Combining efficient conformational sampling with a deformable elastic network model facilitates structure refinement at low resolution

    Structure

    (2007)
  • K. Shiozawa et al.

    Solution structure of human Pex5.Pex14.PTS1 protein complexes obtained by small angle X-ray scattering

    Journal of Biological Chemistry

    (2009)
  • F. Tama et al.

    Flexible multi-scale fitting of atomic structures into low-resolution electron density maps with elastic network normal mode analysis

    Journal of Molecular Biology

    (2004)
  • F. Tama et al.

    NMFF: Flexible high-resolution annotation of low-resolution experimental data from cryo-EM maps using normal mode analysis

    Journal of Structural Biology

    (2004)
  • R.K. Tan et al.

    YUP.SCX: Coaxing atomic models into medium resolution electron density maps

    Journal of Structural Biology

    (2008)
  • V. Tozzini

    Coarse-grained models for proteins

    Current Opinion in Structural Biology

    (2005)
  • L.G. Trabuco et al.

    Flexible fitting of atomic structures into electron microscopy maps using molecular dynamics

    Structure

    (2008)
  • L.G. Trabuco et al.

    Molecular dynamics flexible fitting: A practical guide to combine cryo-electron microscopy and X-ray crystallography

    Methods

    (2009)
  • W. Wriggers et al.

    Using situs for flexible and rigid-body fitting of multiresolution single-molecule data

    Journal of Structural Biology

    (2001)
  • W. Wriggers et al.

    Situs: A package for docking crystal structures into low-resolution maps from electron microscopy

    Journal of Structural Biology

    (1999)
  • X. Wu et al.

    Targeted conformational search with map-restrained self-guided Langevin dynamics: Application to flexible fitting into electron microscopic density maps

    Journal of Structural Biology

    (2013)
  • S. Yang et al.

    A rapid coarse residue-based computational method for X-ray solution scattering characterization of protein folds and multiple conformational states of large protein complexes

    Biophysical Journal

    (2009)
  • W. Zheng

    Accurate flexible fitting of high-resolution protein structures into cryo-electron microscopy maps using coarse-grained pseudo-energy minimization

    Biophysical Journal

    (2011)
  • W. Zheng et al.

    Normal-modes-based prediction of protein conformational changes guided by distance constraints

    Biophysical Journal

    (2005)
  • Cited by (4)

    • State-dependent sequential allostery exhibited by chaperonin TRiC/CCT revealed by network analysis of Cryo-EM maps

      2021, Progress in Biophysics and Molecular Biology
      Citation Excerpt :

      In particular, the global modes or softest modes at the lowest frequency end of the spectrum robustly enable the predisposition of the structure to cooperative changes relevant to function (Bahar et al., 2010b; Zheng et al., 2006). These properties have led to the development of many flexible fitting algorithms using NMA (Costa et al., 2020; Hinsen et al., 2005; Lopez-Blanco and Chacon, 2013; Matsumoto and Ishida, 2009; Miyashita and Tama, 2018; Suhre et al., 2006; Tama et al., 2004a; Velazquez-Muriel and Carazo, 2007; Zheng and Tekpinar, 2014) as well as hybrid methods that utilize ENMs for modelling mechanisms of action, allosteric signalling and evolution (Krieger et al., 2020). While ENM NMA emerged as an efficient tool for investigating the dynamics of large systems where high resolution atomic models may be lacking, such as those characterised by cryo-EM, a barrier has been that electron density maps could not be readily converted into a network model.

    • Flexible Fitting of Atomic Models into Cryo-EM Density Maps Guided by Helix Correspondences

      2017, Biophysical Journal
      Citation Excerpt :

      It is found by minimizing a quadratic error function similar to that in Eq. 2, except wij = 1 for any i,j. We selected six pairs of proteins from the PDB, which have been used to evaluate other protein fitting methods (43,44). The selected protein pairs have identical or nearly identical amino acid sequences (98.86–100% similarity) but exhibit a wide variety of collective conformational changes.

    View full text