Decoupling a tandem-repeat protein: Impact of multiple loop insertions on a modular scaffold

Perez-Riba, Albert; Komives, Elizabeth; Main, Ewan R. G.; Itzhaki, Laura S.

doi:10.1038/s41598-019-49905-4

Download PDF

Article
Open access
Published: 28 October 2019

Decoupling a tandem-repeat protein: Impact of multiple loop insertions on a modular scaffold

Albert Perez-Riba¹^nAff2,
Elizabeth Komives³,
Ewan R. G. Main⁴ &
…
Laura S. Itzhaki¹

Scientific Reports volume 9, Article number: 15439 (2019) Cite this article

1711 Accesses
2 Citations
Metrics details

Subjects

Abstract

The simple topology and modular architecture of tandem-repeat proteins such as tetratricopeptide repeats (TPRs) and ankyrin repeats makes them straightforward to dissect and redesign. Repeat-protein stability can be manipulated in a predictable way using site-specific mutations. Here we explore a different type of modification - loop insertion - that will enable a simple route to functionalisation of this versatile scaffold. We previously showed that a single loop insertion has a dramatically different effect on stability depending on its location in the repeat array. Here we dissect this effect by a combination of multiple and alternated loop insertions to understand the origins of the context-dependent loss in stability. We find that the scaffold is remarkably robust in that its overall structure is maintained. However, adjacent repeats are now only weakly coupled, and consequently the increase in solvent protection, and thus stability, with increasing repeat number that defines the tandem-repeat protein class is lost. Our results also provide us with a rulebook with which we can apply these principles to the design of artificial repeat proteins with precisely tuned folding landscapes and functional capabilities, thereby paving the way for their exploitation as a versatile and truly modular platform in synthetic biology.

De novo design of knotted tandem repeat proteins

Article Open access 24 October 2023

Lindsey A. Doyle, Brittany Takushi, … Philip Bradley

Rigid fusions of designed helical repeat binding proteins efficiently protect a binding surface from crystal contacts

Article Open access 07 November 2019

Patrick Ernst, Annemarie Honegger, … Andreas Plückthun

Design of functionalised circular tandem repeat proteins with longer repeat topologies and enhanced subunit contact surfaces

Article Open access 29 October 2021

Jazmine P. Hallinan, Lindsey A. Doyle, … Barry L. Stoddard

Introduction

Repeat proteins such as ankyrin repeats and tetratricopeptide repeats (TPRs) can be viewed as quasi one-dimensional arrays of small structural elements (typically 20–40 residues). They fold into elongated, non-globular structures that are stabilised only by local interactions whether within repeats or between adjacent repeats^{1,2,3,4,5,6,7,8,9,10,11}. This architecture contrasts with the three-dimensional connectivity of typical globular protein, which contains many sequence-distant interactions that usually play critical roles in their folding, and it has been widely exploited in the design of repeat proteins^{12,13,14,15,16,17}. The translational structural symmetry of repeat proteins is reflected in their energy landscapes^8,11 and makes them both amenable to gross manipulation without destroying the overall structure (e.g. addition or deletion of repeats^{10,14,18,19,20}) as well as sensitive to even small perturbations (e.g. the folding route can be redirected by single amino-acid substitutions^{2,4,21,22,23,24}). The repetitive architecture also means that we can describe their thermodynamic stability using a simple one-dimensional Ising formalism. According to the Ising model, the protein is considered to be a collection of interacting units, each of which is in one of two states (folded or unfolded). Each unit is defined by an intrinsic energy term (energy difference between folded and unfolded states) and an interaction energy term (“coupling” or interfacial interaction between adjacent folded units). Thus, for artificial proteins made up of identical repeats, their thermodynamic stabilities can be adequately defined using these two parameters of the intrinsic repeat and the inter-repeat (interfacial) energies. This is referred to as a homopolymer Ising model^10,25. The stabilities of non-identical repeat proteins can also be described using an Ising model, but a more complex description is required, which is referred to as the heteropolymer model^{25,26,27,28,29}.

We previously used Ising models to explore the energetic consequences of extending the short (4-residue) loop between adjacent repeats of TPR proteins²⁹. Strikingly, we found that the effect of loop extension is profoundly context-dependent: Extension of the central loop (between the third and fourth repeats) in a protein comprising six consensus-designed repeats (CTPR6) is much more destabilising than the same loop extension in the two-repeat protein (CTPR2). This behaviour might seem counterintuitive; however, it can be rationalised within the framework of stabilising nearest-neighbour effects of the repeat-protein architecture. Here we construct series of CTPR proteins configured with different combinations of short and long inter-repeat loops. We find that for the most extreme configuration, in which every loop of the repeat array is extended, adjacently repeats are now only very weakly coupled. Consequently, the increase in stability with repeat number, which is the defining feature of tandem-repeat proteins, breaks down. We obtain estimates of the energetics of loop extension as a function of the location and number of loops within the repeat array, and we show how dramatic this context dependence is: The cost of inserting the same 10-residue sequence ranges from as little as 0.5 kcal mol⁻¹ to as much as 4 kcal mol⁻¹. Our findings provide us with a “rulebook” for building artificial repeat proteins with customised energy landscapes that we can implement to functionalise this versatile scaffold, for example through the insertion of binding or catalytic loops. This truly modular toolbox thereby offers a platform for the development of designer proteins in synthetic biology.

Results

Design of CTPR constructs

20 different CTPR proteins were constructed (Fig. 1). The proteins comprised two, three, four and six consensus repeats and had one of the following features: (i) wild-type inter-repeat loop sequence (DPRS) (referred to as CTPRa proteins^14,30) (Fig. 1a-c); (ii) multi-loop variants with 10-residue or 25-residue extensions inserted into every inter-repeat loop (CTPRm proteins) (Fig. 1a,d), (iii) “alt-loop” variants, which have 10-residue extensions in alternate inter-repeat loops (CTPRalt proteins) (Fig. 1e); and (iv) a single 10-residue loop extension inserted between either the third and fourth repeats or the first and second repeats of a six repeat protein (Fig. 1e). The 10-residue and 25-residue loop extensions contained a poly(GS) sequence and a thrombin cleavage site (used previously to confirm that the loop extensions were solvent-accessible²⁹). It is important to note that the simple schematics show the TPR proteins as linear but in fact they form a superhelix and therefore adjacent loops are offset relative to each other (Fig. 1f)³¹. In order to simplify our data analysis, the proteins did not contain a C-terminal capping helix (used in some previous studies^{10,11,14,28,29,30}).

Structure, stability and folding of 10- and 25-residue multi-loop CTPR proteins

We first investigated the native structure, folding cooperativity and thermodynamic stability of the 25- and 10-residue multiloop insertion series and compared these values with those of the wild-type series (CTPRm25, CTPRm10 versus CTPRa series, respectively). Native secondary structure was characterised using far-UV circular dichroism spectroscopy (CD) (Fig. S1), with folding cooperativity and thermodynamic stability probed using chemical-induced equilibrium denaturation curves monitored by intrinsic fluorescence (Fig. 2) and CD (Fig. S2). The CD spectra indicate that all proteins had a high α-helical content, as shown by the negative ellipticity at 222 nm. Importantly, within each series of proteins the α-helical signal at 222 nm increased in rough proportion to the number of CTPR motifs in the protein (Table S2). This observation suggests that all of the proteins adopt the native TPR structure with a full complement of α-helices.

Next, the chemical-induced equilibrium denaturation curves of each series were compared (Figs 2 and S2). The results show that the CTPRa and CTPRm proteins unfold via a single transition with every multi-loop protein being destabilised relative to its wild-type CTPRa parent. To obtain a quantitative comparison, each denaturation curve was first fitted with a two-state equation to give midpoint of unfolding (D_50%), m-value and free energy of unfolding (Tables 1 and S3). The data highlight a significant difference between the CTPRa proteins and the multi-loop variants. In contrast to the “normal” consensus-designed repeat CTPRa series, wherein the midpoint, m-value and thus stability increases with increasing repeat number, there is no such additive increase in stability for the CTPRm25 series. There is a very small increase in stability between two and three repeats (0.3 kcal mol⁻¹) and no further change in stability between three, four and six repeats. The behaviour of the shorter 10-residue multi-loop series (CTPRm10) was similar to that of the CTPRm25 series (Fig. 2 and Table 1). The midpoints of unfolding increase modestly with increasing number of repeats but there is not a corresponding increase in the m-values, and therefore there is only a small increase in stability with increasing number of repeats. The absence of the additive increase in stability with increasing repeat number meant that neither of the multi-loop protein series could be fitted to a 1-D homopolymer or heteropolymer Ising models. In comparison, the CTPRa protein series fit well to the 1-D homopolymer model (Fig. S3), giving intrinsic and interface energies that were in agreement with previous studies (Table S4: −1.0 kcal mol⁻¹ and −3.7 kcal mol⁻¹, respectively)³⁰. These results show that, although the introduction of multiple long inter-repeat loops into the CTPR scaffold does not cause the individual repeats to be natively unfolded, it does significantly reduce the stability and uncouples the co-operative unfolding of the CTPR proteins to an extent that is dependent on the length of the loop extensions.

Table 1 Parameters obtained by fitting the equilibrium denaturations monitored by fluorescence of the CTPRa, CTPRm25, CTPRm10, CTPRm10D series to a two-state folding model.

Full size table

Hydrogen-deuterium exchange highlights the repeat decoupling induced by loop extension

To explore the origin of the length-dependent uncoupling of the CTPRm proteins, we investigated the dynamics of the CTPR4m25 variant compared with its CTPR4a wild-type parent using hydrogen/deuterium exchange mass spectroscopy (HDX MS). In this approach, the protein is incubated with deuterated buffer for different times, and the samples are subsequently protease-digested into small peptides. Mass spectrometry is used subsequently to map out the solvent-protection of the amide protons and thereby gives us information on local stabilities. Our consensus-designed repeat proteins pose a challenge due to their identical repeat sequences, i.e. internal and terminal repeats will yield the same peptides upon protease digestion. Therefore, to distinguish between the repeats a small number of different point mutations were introduced into CTPR4m25 and CTPR4a (Table S5); these mutant variants are referred to as CTPR4m25X and CTPR4aX, respectively. After H/D exchange and pepsin cleavage, four of the resulting peptides were selected as representative reporters, as they covered both helices of each repeat (Fig. 3b, Table S6). The deuterium uptake for each of the peptides in the two proteins is shown in Fig. 3a,c as butterfly plots. For all peptides, the plots of deuterium uptake versus time were fitted to a double-exponential to yield two rate constants (Fig. S4). Biphasic kinetics are expected for HDX MS data, as the kinetics reflect the average behaviour of the peptide fragment rather than the behaviour of individual amino acids. The faster phase corresponds to the rapid exchange of residues that are solvent-exposed in the native proteins. The data show that the repeats of CTPR4m25X all undergo faster exchange than the repeats of CTPR4a, reflecting the lower thermodynamic stability of the former protein. Second, the internal (second and third) repeats of CTPR4aX are more protected than the terminal repeats. This result is in agreement with previous HDX NMR studies on CTPR proteins and consensus-designed ankyrin-repeat proteins, which show that internal repeats have a greater degree of solvent protection than the outer repeats^32,33. Such behaviour reflects the hierarchical nature of repeat stability, whereby internal repeats are more stable than outer ones. In contrast, there is no such increase in protection for the multi-loop CTPR4m25X protein. The exchange rates of all repeats of CTPR4m25X are the same within error, and they can be fitted globally using shared rate constants for each of the two phases (Fig S5). The absence of enhanced protection of internal relative to terminal repeats shows that the insertion of loops causes a more dynamic native structure that undergoes more substantial local “breathing”. This conclusion is further supported by an increase in the relative amplitude of the fast phase for CTPR4m25X compared with CTPR4aX. Importantly, however, when chemically denatured CTPR4m25X undergoes H/D exchange there is very low protection in comparison to native CTPR4m25X (Figs 3d and S5). Thus, even though the inter-repeat interfaces of CTPR4m25 are more dynamic than wild-type, the global tertiary structure is retained. This conclusion is consistent with the fluorescence- and CD-monitored denaturation results.

Sequence-dependent effects of loop extension

There are two possible, mutually non-exclusive explanations for the increased dynamics/local unfolding of the internal repeats in the multi-loop CTPR proteins relative to the CTPRa proteins. One is the entropic cost of loop insertion on the inter-repeat interface. The other is that there are steric clashes between the loops and/or between the loops and the helical TPR core. Both would prevent the characteristic build-up in stability with increasing repeat number and would lead to a loss of cooperative folding. To investigate these possibilities, we next constructed two further sets of multi-loop CTPR proteins, one with a modified loop sequence and the other with alternating short (consensus) loops and extended loops.

We reasoned that, if the effects observed are due to steric clashes, they might be sequence dependent. We noted also that the CTPRm proteins showed a decrease in solubility with increasing number of repeats. Therefore, two hydrophobic residues within the 10-residue loop were substituted for aspartate residues (GSLVPRGS to GSDDPRGS) both to test the hypothesis and to aid solubility.

This new 10-residue multi-loop series (CTPRm10D comprising 2-, 3-, 4- and 6-repeat proteins) showed greatly improved solubility compared with the original CTPRm10 series. For example, CTPR6m10D expressed in the soluble fraction with yields of over 130 mg/L of culture (compared with negligible amounts of CTPR6m10 in the soluble fraction). All proteins were found to be folded (as shown by their far-UV CD spectra; Fig. S1c), and the equilibrium chemical denaturation curves monitored by fluorescence and CD were very similar to those of the respective parent (CTPRm10) proteins (Fig. 2, Table 1, Fig. S2 and Table S3). Thus, energetic cost of multi-loop extension does not appear to be sequence-dependent and is more likely to be an entropic effect.

Context dependence of loop extensions

As a further test of this model, we made a further multi-loop series in which alternate loops were extended to see the effect on the build-up of global stability with repeat number. This series should tell us whether the breakdown in the Ising-like behaviour of the CTPRm proteins is due solely to the entropic penalty of closing the long loop in order to form the inter-repeat interface or whether there is an additional effect of steric clashes between the loops. CTPR proteins with extensions in alternate inter-repeat loops were constructed. We used the CTPRm10D sequence, as this sequence had improved solubility relative to the original loop CTPRm10 sequence. Taking the CTPR2m10D as a “module” to be repeated twice and thrice, we created a CTPR4 protein with two extended loops (CTPR4alt10D) and a CTPR6 protein with three extended loops (CTPR6alt10D), respectively (Fig. 1e).

Fluorescence-monitored equilibrium denaturation experiments show that there is a significant increase in stability of the four-repeat array comprising two CTPR2m10D modules relative to the two-repeat array comprising only one module (Fig. 4a and Table S7). This behaviour is as expected given that CTPR4alt10D contains a native non-loop extended CTPR interface between the second and third repeats. Addition of a third module (i.e. CTPR6 with three extended loops) shows only a marginal further increase in stability. The CD-monitored denaturation curves reveal that there is a significant loss in ellipticity of CTPR4alt10D and CTPR6alt10D proteins before the major unfolding transition (Fig. 4c). This behaviour is consistent with the loop-extended inter-repeat interfaces unfolding at lower denaturant concentrations than the consensus interfaces. The midpoint of the major unfolding transition is same for fluorescence- and CD-monitored curves (Table S7).

Last, we compared a six-repeat protein having a single loop extension located between the first and second repeats (CTPR6L1-2) and compared this protein with a six-repeat protein having a single loop extension located between the third and fourth repeats (CTPR6L3-4, characterised previously in²⁹). The Ising model of build-up of stability with increasing number of repeats would predict that the internal loop extension would have the larger energetic cost. In Fig. 4 we compare the equilibrium denaturation curves of these two single-loop 6-repeat proteins with those of CTPR6a and CTPR2a (Fig. 4b,d) and the two alt-loop proteins (CTPR4alt10D and CTPR6alt10D) (Fig. 4a,c), monitored both by fluorescence (Fig. 4 top) and CD (Fig. 4 bottom). The fluorescence data indicate that CTPR6L3-4 is significantly destabilised relative to CTPR6a (~3.5 kcal mol⁻¹ from a two-state fit), whereas CTPR6L1-2 appears to have the same stability as CTPR6a within error (Table S7). However, the CD data reveal a more complex scenario for CTPR6L1-2: There is a loss of helical structure at low denaturant concentration, which is consistent with the unfolding of the first repeat resulting in an intermediate comprising five folded repeats. This partial unfolding is similar to what is seen for the alt-loop proteins and is consistent with the loop-extended inter-repeat interfaces unfolding at lower denaturant concentrations than the consensus interfaces.

Thus, although every loop in every protein in Fig. 4 has the same sequence and is located between repeats of identical sequence, the energetic cost of loop extension varies dramatically and is dependent on the “local stability” associated with its location in the array. We have used the two-state fits of the fluorescence-monitored denaturation curves to compile estimates of these context-specific energy penalties (Table 2). We find that the cost per loop extension is as little as ~0.5 kcal mol⁻¹ in a two-repeat array, versus 2–2.4 kcal mol⁻¹ for alternate loops (whether in four-repeat or six-repeat array), and 4 kcal mol⁻¹ for the innermost loop in a six-repeat array.

Table 2 Energetic cost of loop insertion is dependent on location and context within the repeat array.

Full size table

Discussion

Here we investigated the relationship between the changes in global stability, folding cooperativity and local dynamics that result from the introduction of long loops into repeat-protein arrays. The cooperativity of repeat protein folding is determined by the difference between the intrinsic folding free energies of the repeats (ΔG_i) and the interfacial free energies (ΔG_ij). Thus, proteins with intrinsically unstable repeats and very stable interfaces tend to unfold with high cooperativity²⁵. This is typically the situation for small naturally occurring repeat proteins and their consensus-derived counterparts. Nature may have selected for proteins with high cooperativity to avoid the population of partly folded intermediates that can lead to misfolding or aggregation. In contrast, Geiger-Schuller et. al. recently determined the Ising parameters of a collection of de novo Rosetta-designed repeat proteins³⁴. They found that all of these proteins had ΔG_i of −1.4 to −3.5 kcal mol⁻¹ and ΔG_ij of −4.8 to −10 kcal mol⁻¹. Thus, in contrast to Nature, Rosetta has optimised both the intrinsic and the interfacial free energies. The question of the balance between these two parameters is key to our study: A repeat array with low cooperativity will respond to interface disruption differently from a repeat array with high cooperativity. The consensus-designed TPR sequence used here has ΔG_i of −1 kcal mol⁻¹ and ΔG_ij of −3.7 kcal mol⁻¹, and these proteins are consequently less cooperatively folded than consensus-designed ankyrin repeats (ΔG_i of +4.4 kcal mol⁻¹ and ΔG_ij of −11.2 kcal mol⁻¹) and CTPR sequences used elsewhere (ΔG_i of +1.4 kcal mol⁻¹ and ΔG_ij of −4.3 kcal mol⁻¹). Loop insertion makes our rather low-cooperativity system even less cooperative but does not compromise the global native structure because the intrinsic repeat free energy, at −1 kcal mol⁻¹, is mildly stabilising and only a single interface (i.e. two repeats) is required for an independent folding unit. However, for proteins with intrinsically very unstable repeats that require more than two inter-repeat interfaces to form a stable unit, loop extension might prevent folding altogether.

Loop insertion reduces the interfacial free energy because of the entropic penalty of closing that loop^35,36,37. In the multi-loop CTPR series, the result is natively structured arrays that show little to no increase in global stability with increasing number of repeats (and these series are, consequently, not amenable to Ising analysis). Consistent with this behaviour, the HDX MS results show that the internal repeats are not more protected than the terminal repeats, in striking contrast to what is expected for repeat proteins^{7,32,33,38,39}.

In conclusion: (1) Proteins containing long sequence insertions at every inter-CTPR inter-repeat loop are folded and stable, but the multiple insertions reduce the cooperativity and thereby the build-up in thermodynamic stabilities with increasing number of repeats that characterise the repeat-protein class; (2) The energetic cost of loop insertion is highly context dependent because the local stability of each repeat within an array is also context dependent; consequently, the cost of an inserted loop determined for one CTPR array cannot be generalised to any repeat in any size of CTPR array but rather is dependent on the length of the array and the location of the insertion along the array. Importantly, the stability costs are unlikely to be prohibitive for future applications of loop-extended CTPR proteins because the CTPR scaffold start from such high global stability to begin with. For example, the most extreme case in our study, the multi-loop extended six-repeat protein CTPR6m10D is still a highly soluble and stable protein with a melting temperature of over 80 °C (Fig. S6) even though almost half of its polypeptide chain consists of inter-repeat loops. CTPR6ml10D is, therefore, more than adequate as a scaffold for biotechnology applications. We have pushed the cooperativity of a repeat protein to its absolute limit, and our findings provide us with the framework required to exploit this simple and modular architecture to build functional protein-based nanomaterials and to create designer molecular-recognition proteins in synthetic biology and medicine. It will be interesting to explore to what extent other helical repeat proteins are amenable to loop insertion. Given that the current work provides us with a set of guidelines with which we can anticipate the loop energetics, this should be relatively straightforward. We also look forward to exploring the limits of the approach in terms determining what is the maximum length of loop that can be inserted; the sequence composition will be a determining factor for very long loops, as we know that this has a profound effect on both the structure (degree of compactness) and the solubility of intrinsically disordered polypeptides⁴⁰.

Materials and Methods

Assembly of tandem-repeat protein genes from single repeat sequences

CTPRm25, CTPRm10 and CTPRm10D constructs

All genes were synthesised by GeneArt Invitrogen. Each construct contained BamHI and HindIII restriction sites for subcloning into pRSet for His-tag purification as previously described²⁹. The sequences are given in Table S1.

CTPRa and CTPRalt constructs

The tandem repeat arrays of CTPRa or CTPR10D (a single CTPR with the ‘10D’ loop sequence) were built by concatemerization of individual CTPRa or CTPR10D using BamHI and BglII sites as previously described^10,29. Briefly, a single consensus tetratricopeptide repeat (CTPRa1) was purchased as a short double-stranded DNA fragment and inserted into the T7-regulated expression vector pRSET B between the BamHI and HindIII restriction sites (ThermoFisher Scientific). The CTPRa1 fragment was then PCR-amplified using T7 promoter primers. The CTPRa1 PCR product and CTPRa1 pRSET B vector were then digested with BamHI/HindIII and BglII/HindIII restriction enzymes, respectively. The result is two concatamerized CTPRa1 genes froming a CTPRa2 (i.e. two-repeat array). The concatamerization of BamHI and BglII results in an Arg and a Ser after the highly conserved Pro31 of the CTPR sequence. As a result, the CTPRa2 contains the well-studied DPRS loop³⁰. This process can be repeated to generate CTPRa proteins of different lengths. Addition of the CTPR10D module generates the CTPRalt constructs. The sequences are given in Table S1.

Non-identical CTPR4aX and CTPR4m25X constructs for HDX MS

All constructs were commercially synthesised by IDT as “gBlocks” in the form of a two-repeats array with and without the loop extension. The two-repeat proteins were built up to four-repeat proteins following the concatemerization method used for CTPRa constructs. The sequences of CTPR4aX and CTPR4X are given in Table S5.

Protein purification

Protein purification of the CTPR proteins was carried as previously described²⁹. The 6xHis-tagged constructs were chemically transformed into competent E. coli C41 cells by heat shock. Colonies from a selective LB-Amp plate were grown in 2xYT media containing ampicillin (50 μg/mL) at 37 °C, 220 rpm until the optical density (O.D.) at 600 nm reached 0.6. Bacterial cultures were then induced overnight with IPTG (0.5 mM) for 16 h at 20 °C. For large-scale preparations (1 L cultures), cells were centrifugated at 3000 g (4 °C, 10 min) and resuspended in lysis buffer (10 mM sodium phosphate pH 7.4, 150 mM NaCl, 1 tablet of SIGMAFAST protease inhibitor cocktail (EDTA-free), and lysed on an Emulsiflex C5 homogenizer at 15000 psi. The insoluble fraction was separated by centrifugation at 15,000 g at 4 °C for 45 min. Ni-NTA beads were pre-washed once with lysis buffer before incubation with the supernatant from the cell lysate for 1 hr at 4 °C in batch. The loaded beads were washed thrice with phosphate buffer (10 mM sodium phosphate pH 7.4, 150 mM NaCl) containing 30 mM of imidazole to prevent nonspecific interactions. Protein were eluted with phosphate buffer containing 300 mM imidazole and further purified and buffer-exchanged by size-exclusion gel filtration using a HiLoad 16/60 Superdex G75 column (GE Life-Science) in phosphate buffer. Purity was checked by NuPage protein gel (Invitrogen) and proteins were flash-frozen and stored at −80 °C.

For small-scale preparations (15 ml culture), cells were pelleted by centrifugation at 3000 g (4 °C, 10 min) and resuspended in 1 ml of BugBuster Master Mix (Millipore). Ni-NTA were added to the supernatant from the cell lysate for 20 min at 4 °C in batch. The Ni-NTA beads were pre-washed thrice with phosphate buffer (1 mL) containing 30 mM of imidazole. Protein was eluted using phosphate buffer with 300 mM imidazole and dialysed against 50 mM sodium phosphate buffer pH 6.8, 150 mM NaCl. Inclusions bodies were purified from small scale preparations by resuspension of the insoluble pellet in 10 mM sodium phosphate buffer pH 7.4, 150 mM NaCl, 6 M GdmHCl.

Protein concentrations were determined using the calculated extinction coefficients (ExPASy ProtParam)⁴¹. Molecular weight and purity was confirmed by mass spectrometry (MALDI).

CD spectroscopy

Circular dichroism experiments were performed as previously described²⁹. Briefly, CD measurements were carried on a Chirascan CD spectrometer (Applied Photophysics) in 1 mm pathlength Precision Cells (110-QS, Hellma Analytics) at 25 °C. The CD spectra of all protein samples was measured between 200 nm to 280 nm wavelengths using a 1 nm of bandwidth. Proteins were studied in 50 mM sodium phosphate buffer pH 6.8, 150 mM NaCl at concentrations ranging 5–20 µM. Spectra were acquired at 1 nm intervals every 0.5 s; each reading was taken between three to five times, and the data were averaged.

Chemical denaturation experiments monitored by fluorescence

Equilibrium denaturation experiments monitored by fluorescence were carried as previously described⁴². In brief, plate measurements were taken on a CLARIOstar Plate Reader (BMG labtech) with a tryptophan detection filter set at 25 °C. Stock solutions of GdmHCl and were dispensed into Corning® 96-well, half area, black polystyrene plates (CLS3993) with a Microlab ML510B dispenser. The protein concentration used was between 0.3 µM and 1 µM. For each protein tested, three replicate sets of serial dilutions were plated consecutively. Final protein concentrations were 0.3–1 µM. Plates were covered with a Microplate Aluminium Sealing Tape and incubated at 25 °C for 1 h shaking.

Chemical denaturation monitored by CD

Equilibrium denaturation experiments monitored by CD was carried as previously described²⁹. The different GdmHCl concentrations were prepared by mixing the apporiate volumes of 50 mM sodium phosphate buffer pH 6.8, 150 mM NaCl, 7 M Gdm HCl and sodium phosphate buffer using a Hamilton Microlab ML510B. The protein concentration used was between 5 µM and 20 µM. Samples were equilibrated at 25 °C for 2 hours. The helical content was followed by changes in ellipticity at 222 nm.

Analysis of equilibrium denaturation curves

Data were analysed in three different ways: with a two-state model⁴³, with a homozipper Ising model or with a heteropolymer Ising model²⁵. For two-state analysis, the denaturation curves were fitted directly using Eq. 1.

$${{\rm{\lambda }}}_{obs}=\frac{{\alpha }_{N}+{\beta }_{N}[D]+({\alpha }_{D}+{\beta }_{D}[D]).\exp [{m}_{D-N}([D]-{[D]}_{50 \% })]/RT}{1+\exp [{m}_{D-N}([D]-{[D]}_{50 \% })]}$$

(1)

where λ_obs is the observed signal (fluorescence or CD), α_N and α_D are the intercepts, and β_N and β_D are the slopes of the baselines at low (N) and high (D) denaturant concentrations, respectively, [D_50%] is the midpoint of unfolding, [D] is the concentration of denaturant and m_D-N is a constant that is proportional to the increase in degree of exposure of the protein on denaturation. The free energy of unfolding in water, ${\Delta G}_{D-N}^{{H}_{2}O}$, can then be calculated using Eq. 2:

$${\Delta G}_{D-N}^{{H}_{2}O}={m}_{D-N}\cdot {[D]}_{50 \% }$$

(2)

To aid comparison of the different datasets, the fluorescence-monitored denaturation curves were normalised by converting each dataset to fraction unfolded (${{\rm{\lambda }}}_{{\rm{U}}}$) using Eq. 3:

$${{\rm{\lambda }}}_{{\rm{U}}}=\frac{{{\rm{\lambda }}}_{{\rm{obs}}}-({{\rm{\alpha }}}_{{\rm{N}}}+{{\rm{\beta }}}_{{\rm{N}}}[{\rm{D}}])}{({{\rm{\alpha }}}_{{\rm{D}}}-{{\rm{\alpha }}}_{{\rm{N}}})+({{\rm{\beta }}}_{{\rm{D}}}-{{\rm{\beta }}}_{{\rm{N}}})[{\rm{D}}]}$$

(3)

where α_D/α_N are the y-intercept values of the denatured/native baselines and β_D/β_N are the slopes of the denatured /native baselines. Similarly, CD-monitored denaturation curves were individually normalised using Eq. 4:

$${{\rm{\lambda }}}_{{\rm{U}}}=\frac{{{\rm{\lambda }}}_{{\rm{obs}}}-{{\rm{\alpha }}}_{{\rm{N}}}}{{{\rm{\alpha }}}_{{\rm{D}}}+({{\rm{\beta }}}_{{\rm{D}}}[{\rm{D}}])-{{\rm{\alpha }}}_{{\rm{N}}}}$$

(4)

where α_D and α_N are the y-intercept values of the denatured/native baselines and β_D is the slope of the denatured baseline. This equation allows the slope of the native baselines of the raw data to be preserved in the normalised data.

Ising model analysis of equilibrium denaturation curves

For the Ising analysis, each fluorescence monitored equilibrium denaturation curve was individually converted to fraction unfolded (${{\rm{\lambda }}}_{{\rm{U}}}$) using Eq. 3. CD-monitored equilibrium denaturation curves were individually normalised using Eq. 4. This equation allows the data to retain the slope of the native baseline and be globally fitted with either homozipper or heteropolymer Ising model. For all constructs, the slopes of their denatured baselines were not significant.

After normalization, the series of curves were globally fitted to either a homozipper or heteropolymer Ising model using the PyFolding package²⁷. Both the homozipper and heteropolymer Ising models were constructed as previously described^28,30. Briefly, each model comprises a one-dimensional linear series of equilibrium constants. These account for the intrinsic folding stability (ΔG_i) and the interfacial energy (ΔG_i−1,i) for each repeated unit in a nearest-neighbour TPR array. The intrinsic stability of the repeating unit has an associated coefficient (m) to represent its sensitivity to the external stimulus – in this case chemical denaturant. In the homozipper model all repeated units are identical, whereas in the heteropolymer model, different types of repeat unit can be incorporated.

Hydrogen/deuterium exchange mass spectrometry

Hydrogen/deuterium exchange mass spectrometry (HDX MS) was performed using a Waters Synapt G2Si equipped with nanoACQUITY UPLC system with H/DX technology and a LEAP autosampler. The final concentrations of proteins in each sample were 5 µM. For each deuteration time, 4 µL complex was equilibrated to 25 °C for 5 min and then mixed with 56 µL D₂O buffer 50 mM sodium phosphate buffer pH 6.8, 150 mM NaCl, for 0, 0.5, 1, 2, or 5 min. The exchange was quenched with an equal volume of quench solution (3 M guanidinium hydrochloride, 0.1% formic acid, pH 2.66). The quenched sample (50 μL) was injected into the sample loop, followed by digestion on an in-line pepsin column (immobilized pepsin, Pierce, Inc.) at 15 °C. The resulting peptides were captured at 0 °C on a BEH C18 Vanguard pre-column, separated by analytical chromatography (Acquity UPLC BEH C18, 1.0 × 50 mm, Waters Corporation) using a 7–85% gradient acetonitrile in 0.1% formic acid over 7.5 min, and electrosprayed into the Waters SYNAPT G2Si quadrupole time-of-flight mass spectrometer. The mass spectrometer settings and peptide identification methods have been reported previously⁴⁴. The experiments were performed in triplicate, and independent replicates of the triplicate experiment were performed to verify the results. The data were plotted using GraphPad Prism, and fitted to the sum of two exponential to obtain the exchange rates.

Data availability

All data are available from the corresponding author on reasonable request.

References

Tang, K. S., Guralnick, B. J., Wang, W. K., Fersht, A. R. & Itzhaki, L. S. Stability and folding of the tumour suppressor protein p16. J. Mol. Biol. 285, 1869–1886 (1999).
Article CAS PubMed Google Scholar
Lowe, A. R. & Itzhaki, L. S. Rational redesign of the folding pathway of a modular protein. Proc. Natl. Acad. Sci. USA 104, 2679–84 (2006).
Google Scholar
Serquera, D. et al. Mechanical unfolding of an ankyrin repeat protein. Biophys. J. 98, 1294–301 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Hutton, R. D. et al. Mapping the Topography of a Protein Energy Landscape. J. Am. Chem. Soc. 137, 14610–25 (2015).
Article CAS PubMed Google Scholar
Zweifel, M. E., Leahy, D. J., Hughson, F. M. & Barrick, D. Structure and stability of the ankyrin domain of the Drosophila Notch receptor. Protein Sci. 12, 2622–32 (2003).
Article CAS PubMed PubMed Central Google Scholar
Barrick, D., Ferreiro, D. U. & Komives, E. A. Folding landscapes of ankyrin repeat proteins: experiments meet theory. Curr. Opin. Struct. Biol. 18, 27–34 (2008).
Article CAS PubMed PubMed Central Google Scholar
Croy, C. H. et al. Biophysical characterization of the free I k B a ankyrin repeat domain in solution. Protein Sci. 13, 1767–1777 (2004).
Article CAS PubMed PubMed Central Google Scholar
Ferreiro, D. U., Cho, S. S., Komives, E. A. & Wolynes, P. G. The Energy Landscape of Modular Repeat Proteins: Topology Determines Folding Mechanism in the Ankyrin Family. J. Mol. Biol. 354, 679–692 (2005).
Article CAS PubMed Google Scholar
Parra, R. G., Espada, R., Verstraete, N. & Ferreiro, D. U. Structural and Energetic Characterization of the Ankyrin Repeat Protein Family. PLOS Comput. Biol. 11, e1004659 (2015).
Article ADS PubMed PubMed Central CAS Google Scholar
Kajander, T., Cortajarena, A. L., Main, E. R. G., Mochrie, S. G. J. & Regan, L. A new folding paradigm for repeat proteins. J. Am. Chem. Soc. 127, 10188–90 (2005).
Article CAS PubMed Google Scholar
Ferreiro, D. U., Walczak, A. M., Komives, E. A. & Wolynes, P. G. The Energy Landscapes of Repeat-Containing Proteins: Topology, Cooperativity, and the Folding Funnels of One-Dimensional Architectures. PLoS Comput. Biol. 4, e1000070 (2008).
Article ADS MathSciNet PubMed PubMed Central CAS Google Scholar
Main, E. R. G., Xiong, Y., Cocco, M. J., D’Andrea, L. & Regan, L. Design of stable alpha-helical arrays from an idealized TPR motif. Structure 11, 497–508 (2003).
Article CAS PubMed Google Scholar
Binz, H. K., Stumpp, M. T., Forrer, P., Amstutz, P. & Plückthun, A. Designing Repeat Proteins: Well-expressed, Soluble and Stable Proteins from Combinatorial Libraries of Consensus Ankyrin Repeat Proteins. J. Mol. Biol. 332, 489–503 (2003).
Article CAS PubMed Google Scholar
Javadi, Y. & Main, E. R. G. Exploring the folding energy landscape of a series of designed consensus tetratricopeptide repeat proteins. Proc. Natl. Acad. Sci. 106, 17383–17388 (2009).
Article ADS CAS PubMed Google Scholar
Grutter, M. G. et al. Designed to be stable: Crystal structure of a consensus ankyrin repeat protein. Proc. Natl. Acad. Sci. 100(4) 1700–05 (2003).
Article PubMed Google Scholar
Parmeggiani, F. et al. Designed armadillo repeat proteins as general peptide-binding scaffolds: consensus design and computational optimization of the hydrophobic core. J. Mol. Biol. 376, 1282–304 (2008).
Article CAS PubMed Google Scholar
MacDonald, J. T. et al. Synthetic beta-solenoid proteins with the fragment-free computational design of a beta-hairpin extension. Proc. Natl. Acad. Sci. USA 113, 10346–51 (2016).
Article CAS PubMed Google Scholar
Tripp, K. W. & Barrick, D. The Tolerance of a Modular Protein to Duplication and Deletion of Internal Repeats. J. Mol. Biol. 344, 169–178 (2004).
Article CAS PubMed Google Scholar
Kloss, E. & Barrick, D. C-terminal deletion of leucine-rich repeats from YopM reveals a heterogeneous distribution of stability in a cooperatively folded protein. Protein Sci. 18, 1948–1960 (2009).
Article CAS PubMed PubMed Central Google Scholar
Vieux, E. F. & Barrick, D. Deletion of internal structured repeats increases the stability of a leucine-rich repeat protein, YopM. Biophys. Chem. 159, 152–161 (2011).
Article CAS PubMed PubMed Central Google Scholar
Street, T. O., Bradley, C. M. & Barrick, D. Predicting coupling limits from an experimentally determined energy landscape. Proc. Natl. Acad. Sci. USA 104, 4907–12 (2007).
Article ADS PubMed CAS Google Scholar
Werbeck, N. D., Rowling, P. J. E., Chellamuthu, V. R. & Itzhaki, L. S. Shifting transition states in the unfolding of a large ankyrin repeat protein. Proc. Natl. Acad. Sci. USA 105, 9982–7 (2008).
Article ADS CAS PubMed Google Scholar
Tsytlonok, M., Sormanni, P., Rowling, P. J. E., Vendruscolo, M. & Itzhaki, L. S. Subdomain architecture and stability of a giant repeat protein. J. Phys. Chem. B 117, 13029–37 (2013).
Article CAS PubMed Google Scholar
Tripp, K. W. & Barrick, D. Rerouting the folding pathway of the Notch ankyrin domain by reshaping the energy landscape. J. Am. Chem. Soc. 130, 5681–8 (2008).
Article CAS PubMed PubMed Central Google Scholar
Aksel, T. & Barrick, D. Chapter 4 Analysis of Repeat-Protein Folding Using Nearest-Neighbor Statistical Mechanical Models. Methods in Enzymology 455, (Elsevier Inc., 2009).
Aksel, T., Majumdar, A. & Barrick, D. The contribution of entropy, enthalpy, and hydrophobic desolvation to cooperativity in repeat-protein folding. Structure 19, 349–60 (2011).
Article CAS PubMed PubMed Central Google Scholar
Lowe, A. R., Perez-Riba, A., Itzhaki, L. S. & Main, E. R. G. PyFolding: Open-Source Graphing, Simulation, and Analysis of the Biophysical Properties of Proteins. Biophys. J. 114, 516–521 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Millership, C., Phillips, J. J. & Main, E. R. G. Ising Model Reprogramming of a Repeat Protein’s Equilibrium Unfolding Pathway. J. Mol. Biol. 428, 1804–1817 (2016).
Article CAS PubMed PubMed Central Google Scholar
Perez-Riba, A., Lowe, A. R., Main, E. R. G. & Itzhaki, L. S. Context-Dependent Energetics of Loop Extensions in a Family of Tandem-Repeat Proteins. Biophys. J. 114, 2552–2562 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Phillips, J. J., Javadi, Y., Millership, C. & Main, E. R. G. Modulation of the multistate folding of designed TPR proteins through intrinsic and extrinsic factors. Protein Sci. 21, 327–338 (2012).
Article CAS PubMed Google Scholar
Kajander, T., Cortajarena, A. L., Mochrie, S. & Regan, L. Structure and stability of designed TPR protein superhelices: unusual crystal packing and implications for natural TPR proteins. Acta Crystallogr. Sect. D Biol. Crystallogr. 63, 800–811 (2007).
Article CAS Google Scholar
Main, E. R. G., Stott, K., Jackson, S. E. & Regan, L. Local and long-range stability in tandemly arrayed tetratricopeptide repeats. Proc. Natl. Acad. Sci. USA 102, 5721–6 (2005).
Article ADS CAS PubMed Google Scholar
Wetzel, S. K. et al. Residue-Resolved Stability of Full-Consensus Ankyrin Repeat Proteins Probed by NMR. J. Mol. Biol. 402, 241–258 (2010).
Article CAS PubMed Google Scholar
Geiger-Schuller, K. et al. Extreme stability in de novo-designed repeat arrays is determined by unusually stable short-range interactions. Proc. Natl. Acad. Sci. 115, 7539–7544 (2018).
Article CAS PubMed Google Scholar
Ladurner, A. G. & Fersht, A. R. Glutamine, alanine or glycine repeats inserted into the loop of a protein have minimal effects on stability and folding rates. J. Mol. Biol. 273, 330–7 (1997).
Article CAS PubMed Google Scholar
Nagi, A. D. & Regan, L. An inverse correlation between loop length and stability in a four-helix-bundle protein. Fold. Des. 2, 67–75 (1997).
Article CAS PubMed Google Scholar
Nagi, A. D., Anderson, K. S. & Regan, L. Using loop length variants to dissect the folding pathway of a four-helix-bundle protein. J. Mol. Biol. 286, 257–65 (1999).
Article CAS PubMed Google Scholar
Cortajarena, A. L., Mochrie, S. G. J. & Regan, L. Mapping the energy landscape of repeat proteins using NMR-detected hydrogen exchange. J. Mol. Biol. 379, 617–26 (2008).
Article CAS PubMed PubMed Central Google Scholar
Ferreiro, D. U. et al. Stabilizing IΚΒα by “consensus” design. J. Mol. Biol. 365, 1201–16 (2007).
Article CAS PubMed Google Scholar
Mittal, A., Holehouse, A. S., Cohan, M. C. & Pappu, R. V. Sequence-to-Conformation Relationships of Disordered Regions Tethered to Folded Domains of Proteins. J. Mol. Biol. 430, 2403–2421 (2018).
Article CAS PubMed Google Scholar
Gasteiger, E. et al. Protein Identification and Analysis Tools on the ExPASy Server. In The Proteomics Protocols Handbook 112, 571–607 (Humana Press, 2005).
Perez-Riba, A. & Itzhaki, L. S. A method for rapid high-throughput biophysical analysis of proteins. Sci. Rep. 7, 9071 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Jackson, S. E. & Fersht, A. R. Folding of chymotrypsin inhibitor 2. 1. Evidence for a two-state transition. Biochemistry 30, 10428–35 (1991).
CAS PubMed Google Scholar
Narang, D., Chen, W., Ricci, C. G. & Komives, E. A. RelA-Containing NFκB Dimers Have Strikingly Different DNA-Binding Cavities in the Absence of DNA. J. Mol. Biol. 430, 1510–1520 (2018).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

L.S.I. acknowledges the support of a Senior Fellowship from the UK Medical Research Foundation. A.P. was supported by a BBSRC Doctoral Training Programme scholarship and an Oliver Gatty Studentship. L.S.I. and E.R.M. acknowledge support from a Leverhulme Trust Project Grant.

Author information

Albert Perez-Riba
Present address: Donnelly Centre for Cellular & Biomolecular Research, University of Toronto, Toronto, Canada

Authors and Affiliations

Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1PD, UK
Albert Perez-Riba & Laura S. Itzhaki
Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093-0378, USA
Elizabeth Komives
School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London, E1 4NS, UK
Ewan R. G. Main

Authors

Albert Perez-Riba
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth Komives
View author publications
You can also search for this author in PubMed Google Scholar
Ewan R. G. Main
View author publications
You can also search for this author in PubMed Google Scholar
Laura S. Itzhaki
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.P. and L.S.I. conceived the project. A.P., E.R.G.M. and L.S.I. designed experiments. A.P. performed the experiments. A.P. and E.R.G.M. analysed the data. A.P. and E.K. designed, performed and analysed the HDX mass spectrometry experiments. A.P., E.R.G.M. and L.S.I. wrote the main manuscript text. All authors reviewed the manuscript.

Corresponding author

Correspondence to Laura S. Itzhaki.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Perez-Riba, A., Komives, E., Main, E.R.G. et al. Decoupling a tandem-repeat protein: Impact of multiple loop insertions on a modular scaffold. Sci Rep 9, 15439 (2019). https://doi.org/10.1038/s41598-019-49905-4

Download citation

Received: 04 July 2019
Accepted: 29 August 2019
Published: 28 October 2019
DOI: https://doi.org/10.1038/s41598-019-49905-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.