Use of the UNRES force field in template-assisted prediction of protein structures and the refinement of server models: Test with CASP12 targets

https://doi.org/10.1016/j.jmgm.2018.05.008Get rights and content

Highlights

  • Hybrid (physics- and template-based) approach to protein structure prediction.

  • Molecular dynamics with coarse-grained UNRES model, restraints from templates.

  • Restraints derived from similar fragments of multiple templates.

  • Method able filter out information from poor templates (70% confidence).

  • Outstanding prediction obtained for some targets.

Abstract

Knowledge-based methods are, at present, the most effective ones for the prediction of protein structures; however, their results heavily depend on the similarity of a target sequence to those of proteins with known structures. On the other hand, the physics-based methods, although still less accurate and more expensive to execute, are independent of databases and give reasonable results where the knowledge-based methods fail because of weak sequence similarity. Therefore, a plausible approach seems to be the use of knowledge-based methods to determine the sections of the structures that correspond to sufficient sequence similarity and physics-based methods to determine the remaining structure. By participating in the 12th Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction (CASP12) as the KIAS-Gdansk group, we tested our recently developed hybrid approach, in which protein-structure prediction is carried out by using the physics-based UNRES coarse-grained energy function, with restraints derived from the server models. Best predictions among all groups were obtained for 2 targets and 80% of our models were in the upper 50% of the models submitted to CASP. Our method was also able to exclude, with about 70% confidence, the information from the servers that performed poorly on a given target. Moreover, the method resulted in the best models of 2 refinement targets and performed remarkably well on oligomeric targets.

Introduction

Prediction of protein structure from amino-acid sequence still remains a major challenge of structural biology, because of still insufficient supply of experimental structures and because the methods developed do not provide sufficiently accurate structures in all situations [1]. While knowledge-based methods, which are largely based on sequence-structure similarity, perform well when good templates with sufficiently similar sequences can be found in structural databases, their reliability decreases dramatically with decreasing sequence similarity. The biannual Critical Assessment of Techniques for Protein Structure Prediction (CASP) experiments [2] demonstrate that there are about 10% of targets for which none of the methods available provides a structure close enough to the experimental structure. The physics-based approaches, which are based on finding the structures with the lowest free energy given an appropriate force field and conformational-search method and are, therefore, not database-dependent, could be expected to handle the cases that are not covered by knowledge-based approaches but the existing force fields are not yet sufficiently accurate.

In our recent work [3,4] we proposed a hybrid approach, which combines the best features of the knowledge-based and physics-based methodology. The approach starts from selecting top models from selected servers, which are then converted into distance- and angle-restraints and, subsequently, large-scale multiplexed replica-exchange molecular dynamics (MREMD) [5] simulations with the physics-based coarse-grained UNRES force field (ref 6 and references therein) are carried out. UNRES applied in ab initio mode has already proved to be good in predicting protein structures, scoring considerable success in the past CASP exercises [7,8]. In particular, it was found that UNRES can predict correct domain packing, resulting in models better in overall topology than those found by template-based methods even for targets of the template-based category [7]. Therefore, in our recent work [4], we proposed an approach in which restraints are imposed only on those fragments of the structure that are similar in all server models; in particular, domain packing is unrestrained in this procedure. This fragment-based approach is new compared to those applied in, e.g., MODELLER [9] or MULTICOM [10], in which restraints derived from whole models are imposed. When the models are diverse in general but share common fragments, it can be expected that the structures of these fragments are predicted with much higher confidence in general than the arrangement of these fragments and the rest of the structure. Therefore, it seems reasonable to impose restraints on the common fragments only. Use of a coarse-grained force field is of advantage in this regard because of lower cost of energy and force evaluation and more extensive search of the conformational space, owing to the reduction of the number of degrees of freedom, even though details of the structure are lost. The test results with CASP11 targets suggested that such an approach gives much better results compared to imposing restraints on whole structures. In this work, we tested the approach in the CASP12 experiment, obtaining a number of very good predictions, two of which (for targets T0892 and T0942) were first-ranked and 80% of models being in the upper 50% of models. Apart from the regular prediction (“T0” type targets) category, our KIAS-Gdansk group also participated in the refinement, oligomer, and data-assisted predictions. Our performance in the data-assisted prediction category has been published as a separate paper [11], while this paper reports our results obtained in the regular, refinement, and oligomer prediction category.

Section snippets

Overview of the prediction protocol

The procedure developed in our earlier work [3,4] is illustrated in Fig. 1. The protocol starts from selecting server models to derive the restraints. As mentioned in the Introduction, only the fragments of the structure that are similar to all server models are restrained. As can be seen from Fig. 1, we used models from only the four servers that were top-performing servers in CASP11. This choice was motivated by our previous work [4], according to which the quality of the models to derive

Results

In this section we describe the performance of the KIAS-Gdansk group in the regular 3D prediction (single-chain “T0” type targets), refinement (“TR” type targets), and oligomer prediction categories. The rankings, the measures of model quality, as well as the GDT_TS plots were taken from the official CASP12 page (http://predictioncenter.org/casp12/index.cgi).

Discussion and conclusions

The purpose of our participation in the CASP12 exercise was to assess the performance and the added value of our structure-prediction methods, in which we use geometric restraints from the BAKER-ROSETTASERVR, GOAL, Zhang-server and QUARK server models. As demonstrated in section Comparison of the KIAS-Gdansk models with the parent server models, our method eliminates the information from the poorest ‘Model 1’ server predictions with about 70% confidence. Given the fact that the four servers

Acknowledgments

This work was supported by grants DEC-2013/10/M/ST4/00640 and DEC-2015/17/N/ST4/03937 from the National Science Center of Poland (Narodowe Centrum Nauki). KJ and JL were supported by the National Research Foundation of Korea grant funded by the Korea government (MEST) (No. 2008-0061987). Calculations were carried out using the computational resources provided by (a) the supercomputer resources at the Informatics Center of the Metropolitan Academic Network (CI TASK) in Gdańsk, (b) the

References (36)

  • P. Krupa et al.

    Performance of protein-structure predictions with the physics-based UNRES force field in CASP11

    Bioinformatics

    (2016)
  • A. Fiser et al.

    MODELLER: generation and refinement of homology-based protein structure models

  • R. Cao et al.

    Large-scale model quality assessment for improving protein tertiary structure prediction

    Bioinformatics

    (2015)
  • A. Karczyńska et al.

    Prediction of protein structure with the coarse-grained UNRES force field assisted by small X-ray scattering data and knowledge-based information

    Proteins: Struct. Func. Bioinfo.

    (2017)
  • C. Czaplewski et al.

    Application of multiplexing replica exchange molecular dynamics method to the UNRES force field: tests with α and α+β proteins

    J. Chem. Theor. Comput.

    (2009)
  • Y. He et al.

    Exploring the parameter space of the coarse-grained UNRES force field by random search: selecting a transferable medium-resolution force field

    J. Comput. Chem.

    (2009)
  • A.K. Sieradzan et al.

    Physics-based potentials for the coupling between backbone- and side-chain-local conformational states in the united residue (UNRES) force field for protein simulations

    J. Chem. Theor. Comput.

    (2015)
  • S. Kumar et al.

    The weighted histogram analysis method for free-energy calculations on biomolecules. I. The method

    J. Comput. Chem.

    (1992)
  • Cited by (17)

    • Modeling protein structures with the coarse-grained UNRES force field in the CASP14 experiment

      2021, Journal of Molecular Graphics and Modelling
      Citation Excerpt :

      The upgraded UNRES force field was calibrated with 9 training proteins of various structural classes [26]; this recent version has been termed the NEWCT-9P force field. This force field has already been tested in CASP13, demonstrating significant improvement over the previous versions of UNRES in the ab initio, as well as bioinformatics- and data-assisted prediction [27–31]. We used our prediction protocol [27], which is based on Multiplexed Replica Exchange Molecular Dynamics (MREMD) [32] simulations with UNRES [33].

    • Scale-consistent approach to the derivation of coarse-grained force fields for simulating structure, dynamics, and thermodynamics of biopolymers

      2020, Progress in Molecular Biology and Translational Science
      Citation Excerpt :

      We also implemented88 the Conformational Space Annealing (CSA) genetic algorithm,89 which is aimed at finding the lowest-energy conformations, in UNRES90 and NARES-2P.91 From the early stages of its development, UNRES was applied in protein structure prediction within the framework of the Community Wide Experiments on the Critical Assessment of Techniques for protein structure prediction (CASP), achieving success several times in the ab initio51,78,88,92 and, recently, data-assisted prediction category93; it also was one of the component of the WeFold coopetition within CASP.94,95 UNRES was also applied in studying protein-folding and assembly pathways and kinetics,96–103 including those involving the formation of disulfide bonds,76 free-energy landscapes,104–107 and in a variety of biology-related studies, including amyloid formation,108–111 signaling,112 Hsp70 chaperone cycle,113 iron-sulfur cluster biogenesis,114 stability of proteins from H. pylori,115 and structure and dynamics of natively unfolded proteins.116

    • Evaluation of the scale-consistent UNRES force field in template-free prediction of protein structures in the CASP13 experiment

      2019, Journal of Molecular Graphics and Modelling
      Citation Excerpt :

      The purpose of assisted prediction attempts was to assess how much UNRES performance can be improved when sparse structural data are added. Another UNRES-based prediction method developed in our lab [18], which makes explicit use of multiple server models, was also tested in CASP13 and its results will be published separately. In what follows we describe the methodology used in prediction (section 2), including the prediction protocol (subsection 2.1), the new scale-consistent version of UNRES (subsection 2.3), and the restraints used in simulations (subsection 2.4).

    • Pragmatic Coarse-Graining of Proteins: Models and Applications

      2023, Journal of Chemical Theory and Computation
    View all citing articles on Scopus
    View full text