The Unfolded State of the Villin Headpiece Helical Subdomain: Computational Studies of the Role of Locally Stabilized Structure

https://doi.org/10.1016/j.jmb.2006.04.070Get rights and content

Abstract

The 36 residue villin headpiece helical subdomain (HP36) is one of the fastest cooperatively folding proteins, folding on the microsecond timescale. HP36‘s simple three helix topology, fast folding and small size have made it an attractive model system for computational and experimental studies of protein folding. Recent experimental studies have explored the denatured state of HP36 using fragment analysis coupled with relatively low-resolution spectroscopic techniques. These studies have shown that there is apparently only a small tendency to form locally stabilized secondary structure. Here, we complement the experimental studies by using replica exchange molecular dynamics with explicit solvent to investigate the structural features of these peptide models of unfolded HP36. To ensure convergence, two sets of simulations for each fragment were performed with different initial structures, and simulations were continued until these generated very similar final ensembles. These simulations reveal low populations of native-like structure and early folding events that cannot be resolved by experiment. For each fragment, calculated J-coupling constants and helical propensities are in good agreement with experimental trends. HP-1, corresponding to residues 41 to 53 and including the first α-helix, contains the highest helical population. HP-3, corresponding to residues 62 through 75 and including the third α-helix, contains a small population of helical turn residing at the N terminus while HP-2, corresponding to residues 52 through 61 and including the second α-helix, formed little to no structure in isolation. Overall, HP-1 was the only fragment to adopt a native-like conformation, but the low population suggests that formation of significant structure only occurs after formation of specific tertiary interactions.

Introduction

Structure in the unfolded state may play a significant role in the rapid folding of proteins by limiting the conformational search. Recent experimental work from the Fersht and Oas laboratories has highlighted the role of unfolded state structure in the rapid folding of helical proteins.1., 2. Other work has suggested the importance of polyproline II conformation (PPII) structure in the unfolded ensemble.3., 4., 5., 6. Unfortunately, direct experimental studies of the unfolded state are difficult because the most relevant unfolded state is that which is in equilibrium with the folded state under native conditions. The normal high cooperativity of folding together with the free energy balance of folding means that this state is only sparsely populated at equilibrium. Experimental difficulties also arise because of the short lifetime of the denatured state in refolding experiments. Consequently, indirect methods have to be employed but many approaches fail to examine the unfolded state under equilibrium conditions.7., 8., 9., 10., 11.

One indirect approach to studying the denatured state under native conditions is to analyze peptide fragments corresponding to elements of secondary structure derived from the whole protein. Peptide fragment analysis provides the local propensity for secondary structure formation and a potential glimpse at structures that may form in the early stages of folding. Such locally stabilized structure can play a role in rapid folding by limiting the early stages of the conformational search. For example, one popular model for folding, the diffusion collision model, postulates a critical role for locally stabilized microdomains. The determination of these structural details is potentially of great importance for the folding of helical proteins.2., 12., 13.

The villin headpiece helical subdomain (HP36), the C-terminal portion of the villin headpiece, is the shortest naturally occurring sequence that has been shown to fold cooperatively (Figure 1). Its rapid folding, small size and simple topology of three helices have made this domain an extremely popular system for computational and theoretical studies.12., 14., 15., 16., 17., 18., 19., 20., 21., 22. These studies have largely focused on generation of the correct native topology and have not investigated the details of the folding mechanism or the role of residual structure in the unfolded state.

Recent experimental work has explored the possibility of residual structure in the unfolded state of HP36.23 In that work, a set of fragments corresponding to the three α-helices were studied as well as a larger fragment containing the first two helices. None of the individual peptide fragments showed significant helical content as judged by circular dichroism (CD) spectroscopy. However, two of the helices in HP36 are quite small in fragments 1 (HP-1) and 2 (HP-2) and the CD spectra of short helices are not well understood.24., 25., 26. Thus it is not clear how best to interpret CD studies of the small helices that may be formed by these fragments, particularly when NMR studies hint at some tendency to form non-random structure. The experimentally measured 1H-alpha chemical shift deviations from random coil (approximately 0.25 ppm upfield) observed for the HP36 fragments suggest either sparsely populated helical conformations or ring current effects in HP-1 and fragment 3 (HP-3). These potential ambiguities are due to the limitations of the experimental methods.

Simulations can help overcome these limitations and allow for the observation of structure at the level of individual molecules instead of the ensemble averages typically provided by experiments. Computational studies can also provide atomic level detail concerning specific interactions that may not be readily available from experimental studies of rapidly interconverting ensembles. This enhances the understanding of mechanistic details of protein folding and structure. However, conformational sampling remains a significant obstacle in molecular dynamics (MD) simulations. Generation of precise populations at equilibrium is difficult due to the protein folding time-scale being much longer than is typically accessible to simulation. Hence, the study of partially populated states through simulation is hampered by poor convergence.

Replica exchange molecular dynamics (REMD) is an enhanced sampling technique27., 28., 29. that can help overcome the limited time-scale issues, yet it remains a challenging task to obtain converged results, particularly for large systems. Many different studies have used REMD to study folding in smaller model peptide systems,30., 31., 32., 33., 34., 35. however studies of unfolded state structure have been more limited.30., 36.

Here, we analyze the same set of short fragments of HP-36 that were studied experimentally in an attempt to clarify the extent of locally stabilized secondary structure. We conducted REMD simulations using both an implicit and explicit solvent model for each fragment. The results demonstrate that explicit solvent is the more accurate approach for studying these small peptides. We find that HP-1 possesses the most native-like structure of the three fragments, and the potential role that locally stabilized structure may play in the fast folding of HP36 is discussed.

Section snippets

Convergence of simulations allows for reliable populations of low levels of structure

When the goal of a simulation study is simply to identify a low-energy conformation, it is typically unnecessary to generate a Boltzmann-weighted ensemble with conformations populated according to relative energies. However, when one wishes to use these results to gain insight into the relationship of the unfolded state to the folding process, it is necessary to obtain a reliable and quantitative estimation of the extent to which any residual structure is present in the unfolded state, with

What may stabilize the high population of helical structure in HP-1?

We examined the entire HP-1 ensemble to identify contacts that may be playing a part in stabilizing the helical structure. Approximately 50% of the ensemble had ion-pair contacts between D44–K48 (27( ± 4) %), E45–K48(8( ± 2) %), and both D44 and E45 with K48 (14.0( ± 0.03) %). Another contact was present (85.0( ± 0.5) %) involving the D44 backbone carbonyl and the charged side-chain of K48; this is present alone (56( ± 4) %) and with the charged side-chain of D44 (30( ± 6) %). These contacts are not

Methods

Three fragments were built from the sequence of HP36: M41–F76 (MLSDEDF KAVFGMTRSAFANLPWKQQNLKKEKGLF). HP-1 (M41–M53) corresponds to the N-terminal helix of HP36. HP-2 (G52–L61) contains the second helix and HP-3 (P62–L75) contains the C-terminal helix. HP-1 has a free N terminus while HP-2 and HP-3 have acetylated N termini. All C termini were amidated. These sequences and termini correspond to those used in the experimental studies.23 All side-chains for Asp, Glu, Lys, and Arg were charged

Acknowledgements

The authors thank Ms Yuefeng Tang for valuable discussions concerning HP36. We gratefully acknowledge computer time at NCSA through MCA02N028 and financial support from the National Institutes of Health (NIH GM6167803 (CS) and Department of Energy (Contract DE-AC02N0280. Additional computer time and assistance was provided by Roberto Gomperts and the SGI Engineering group. C.S. is a Cottrell Scholar of the Research Corporation.

References (70)

  • M. Feig et al.

    Recent advances in the development and application of implicit solvent models in biomolecule simulations

    Curr. Opin. Struct. Biol.

    (2004)
  • T. Ghosh et al.

    Role of backbone hydration and salt-bridge formation in stability of alpha helix in solution

    Biophys. J.

    (2003)
  • V. Daggett et al.

    Is there a unifying mechanism for protein folding?

    Trends Biochem. Sci.

    (2003)
  • J.P. Ryckaert et al.

    Numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes

    J. Comput. Phys.

    (1977)
  • G.D. Hawkins et al.

    Pairwise solute descreening of solute charges from a dielectric medium

    Chem. Phys. Letters

    (1995)
  • T.L. Religa et al.

    Solution structure of a protein denatured state and folding intermediate

    Nature

    (2005)
  • J.K. Myers et al.

    Preorganized secondary structure as an important determinant of fast protein folding

    Nature Struct. Biol.

    (2001)
  • Z. Shi et al.

    Polyproline II structure in a sequence of seven alanine residues

    Proc. Natl Acad. Sci. USA

    (2002)
  • S.A. Asher et al.

    UV Raman demonstrates that alpha-helical polyalanine peptides melt to polyproline II conformations

    J. Am. Chem. Soc.

    (2004)
  • M. Mezei et al.

    Polyproline II helix is the preferred conformation for unfolded polyalanine in water

    Proteins: Struct. Funct. Genet.

    (2004)
  • A. Kentsis et al.

    Unfolded state of polyalanine is a segmented polyproline II helix

    Proteins: Struct. Funct. Genet.

    (2004)
  • D. Shortle

    The denatured state (the other half of the folding equation) and its role in protein stability

    Faseb J.

    (1996)
  • D. Shortle et al.

    Persistence of native-like topology in a denatured protein in 8 M urea

    Science

    (2001)
  • O. Zhang et al.

    NMR studies of unfolded states of an SH3 domain in aqueous solution and denaturing conditions

    Biochemistry

    (1997)
  • M. Karplus et al.

    Protein folding dynamics: the diffusion-collision model and experimental data

    Protein Sci.

    (1994)
  • Y. Duan et al.

    Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution

    Science

    (1998)
  • B. Zangrovic et al.

    Native-like mean structure in the unfolded ensemble of small proteins

    J. Mol. Biol.

    (2002)
  • S. Jang et al.

    Ab initio folding of helix bundle proteins using molecular dynamics simulations

    J. Am. Chem. Soc.

    (2003)
  • A. Fernandez et al.

    Large-scale context in protein folding: villin headpiece

    Biochemistry

    (2003)
  • D.C. Sullivan et al.

    Conformation spaces of proteins

    Proteins: Struct. Funct. Genet.

    (2001)
  • G.M. De Mori et al.

    Study of the Villin headpiece folding dynamics by combining coarse-grained Monte Carlo evolution and all-atom molecular dynamics

    Proteins: Struct. Funct. Genet.

    (2005)
  • D.C. Sullivan et al.

    Protein folding as biased conformational diffusion

    J. Phys. Chem. B

    (2002)
  • Y. Tang et al.

    Peptide models provide evidence for significant structure in the denatured state of a rapidly folding protein: the villin headpiece subdomain

    Biochemistry

    (2004)
  • M.C. Manning et al.

    Theoretical CD studies of polypeptide helices: examination of important electronic and geometric factors

    Biopolymers

    (1991)
  • D.H. Chin et al.

    Circular dichroism spectra of short, fixed-nucleus alanine helices

    Proc. Natl Acad. Sci. USA

    (2002)
  • Cited by (46)

    • Molecular simulation of peptides coming of age: Accurate prediction of folding, dynamics and structures

      2019, Archives of Biochemistry and Biophysics
      Citation Excerpt :

      Studies such as these highlight how small force field inaccuracies can have an additive effect and can result in stabilising misfolded non-native structures. AMBER's ff99SB was the first successful version to convincingly compensate for that and displayed superior performance over the rest of the force fields of its time: literature consistently provided overwhelming evidence of significantly improved balance of secondary structural elements and reasonable agreement with experimental data through multiple comparative validation studies comprising from short glycine and alanine peptides to longer 10–20 residue peptides (disordered, helical, beta) up to small proteins like lysozyme and ubiquitin [8,58,79,92,94,99,102,123,144,145,168,168,169,197,198]. Follow-up studies suggested two more force fields, f99SB-ILDN-ϕ and ff99SB-ILDN-NMR, with significantly improved agreement with NMR observables, highlighting though that large uncertainties accompany this agreement: statistical calculated errors in the validation were in the margin of systematic experimental errors of these parameters [9].

    • A compact native 24-residue supersecondary structure derived from the villin headpiece subdomain

      2015, Biophysical Journal
      Citation Excerpt :

      Even though the CD spectrum of HP24wt shows a very characteristic random-coil-like spectrum, the NMR data unambiguously point to the presence of two very short helices connected by a rigid loop within the central part of HP24, which could be described as a residual structure in the unfolded state of HP35wt. The presence of residual structure in other truncated variants (helices 1 and 2) was also observed in previous studies (6,20,50,54,68), and thus the central part of HP35 plays a crucial role in local structuring of the unfolded state. Our NMR relaxation data show that the helices have very flexible ends but are not fully unstructured, which can be interpreted by means of a diffusing boundary model (72) of two transiently stable helices.

    • Uncovering specific electrostatic interactions in the denatured states of proteins

      2010, Biophysical Journal
      Citation Excerpt :

      Because force fields have been traditionally developed by targeting native-state properties, the accuracy for describing the energetics in denatured states remains to be established. Nevertheless, limited published data points toward a major problem involving the backbone torsional bias in commonly used force fields (28–30,45). Overstabilization of helical segments and local electrostatic interactions as revealed by our data may be a major reason for overcompaction of denatured states as observed in previous implicit-solvent simulation studies (45,46).

    View all citing articles on Scopus
    View full text