ABSTRACT
We illustrate a new approach to the Contact Map Overlap problem for the comparison of protein structures. The approach is based on formulating the problem as an integer linear program and then relaxing in a Lagrangian way a suitable set of constraints. This relaxation is solved by computing a sequence of simple alignment problems, each in quadratic time, and near--optimal Lagrangian multipliers are found by subgradient optimization. By our approach we achieved a substantial speedup over the best existing methods. We were able to solve optimally for the first time instances for PDB proteins with about 1000 residues and 2000 contacts. Moreover, within a few hours we compared 780 pairs in a testbed of 40 large proteins, finding the optimal solution in 150 cases. Finally, we compared 10,000 pairs of proteins from a test set of 269 proteins in the literature, which took a couple of days on a PC.
- W.P. Adams and H.D. Sherali, A Tight Linearization and an Algorithm for Zero-One Quadratic Programming Problems, Management Science 32 (1986) 1274--1290. Google ScholarDigital Library
- E. Balas, S. Ceria, and G. Cornuéjols, A Lift-and-Project Cutting Plane Algorithm for Mixed 0-1 Programs, Mathematical Programming 58 (1993) 295--324. Google ScholarDigital Library
- T. L. Blundell, Structure-Based Drug Design, Nature 384 (1996) 23--26.Google ScholarCross Ref
- C. Branden and J. Tooze, Introduction to Protein Structure, Garland, 1999.Google Scholar
- A. Caprara, M. Fischetti and P. Toth, A Heuristic Method for the Set Covering Problem, Operations Research 47 (1999) 730--743. Google ScholarDigital Library
- A. Caprara, D. Pisinger and P. Toth, Exact Solution of the Quadratic Knapsack Problem, INFORMS J. on Comput., 11 (1999) 125-137. Google ScholarDigital Library
- P. Carraresi and F. Malucelli, A Reformulation Scheme and New Lower Bounds for the QAP, in P.M. Pardalos and H. Wolkowicz (eds.), Quadratic Assignment and Related Problems, DIMACS series in Discrete Mathematics and Theoretical Computer Science, AMS Press, 147--160, 1994.Google Scholar
- I. Eidhammer and I. Jonassen and W. R. Taylor, Structure Comparison and Structure Prediction, to appear J. Comp. Biol.Google Scholar
- D. Fischer et al. CAFASP-1: Critical Assessment of Fully Automated Structure Prediction Methods. Proteins Suppl. 3 (1999) 209--217.Google Scholar
- M.L. Fisher, The Lagrangian Relaxation Method for Solving Integer Programming Problems, Management Science 27 (1981) 1--18.Google ScholarDigital Library
- A. Godzik and J. Skolnick, Flexible Algorithm for Direct Multiple Alignment of Protein Structures and Sequences, CABIOS 10 (1994) 587--596.Google Scholar
- A. Godzik, J. Sklonick and A. Kolinski, A Topology Fingerprint Approach to Inverse Protein Folding Problem, J. Mol. Biol. 227 (1992) 227--238.Google ScholarCross Ref
- D. Goldman, S. Istrail and C. Papadimitriou, Algorithmic Aspects of Protein Structure Similarity, Proceedings of the 40th IEEE Symposium on Foundations of Computer Science, 512--522, 1999. Google ScholarDigital Library
- D. Gusfield, Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology, Cambridge University Press, 1997. Google ScholarDigital Library
- T. F. Havel, I. D. Kuntz and G. M. Crippen, Effect of Distance Constraints on Macromolecular Conformation, Biopolymers 18 (1979).Google Scholar
- M. Held and R. M. Karp, The Traveling Salesman Problem and Minimum Spanning Trees: Part II, Mathematical Programming 1 (1971) 6--25.Google ScholarDigital Library
- L. Holm and C. Sander, Mapping the Protein Universe, Science 273 (1996) 595--602.Google Scholar
- L. Holm and C. Sander, Protein Structure Comparison by Alignment of Distance Matrices, J. Mol. Biol. 233 (1993) 123--138.Google ScholarCross Ref
- W. Kabash, A Solution for the Best Rotation to Relate Two Sets of Vectors, Acta Cryst. A32 (1978) 922--923.Google Scholar
- G. Lancia, R. Carr, B. Walenz and S. Istrail, 101 Optimal PDB Structure Alignments: a Branch-and-Cut Algorithm For The Maximum Contact Map Overlap Problem, Proc. 5th RECOMB (2001) 193--201. Google ScholarDigital Library
- C. Lemmen and T. Lengauer, Computational Methods for the Structural Alignment of Molecules, Journal of Computer--Aided Molecular Design 14 (2000) 215--232.Google ScholarCross Ref
- H. P. Lenhof, K. Reinert, M. Vingron, A Polyhedral Approach to RNA Sequence Structure Alignment, J. Comp. Biol., 5 (1998) 517--530.Google ScholarCross Ref
- L. Lovász and A. Schrijver, Cones of Matrices and Set-Functions and 0-1 Optimization, SIAM Journal on Optimization 1 (1991) 166--190.Google Scholar
- L. Mirny and E. Domany, Protein Fold Recognition and Dynamics in the Space of Contact Maps, Proteins 26 (1996) 391--410.Google ScholarCross Ref
- K. Mizuguchi, C. M. Deane, T. L. Blundell, and J. P. Overington, HOMSTRAD: a Database of Protein Structure Alignments for Homologous Families, Protein Sci. 7(11) (1998) 2469--2471.Google ScholarCross Ref
- J. Moult, T. Hubbard, K. Fidelis, and J. Pedersen, Critical Assessment of Methods of Protein Structure Prediction (CASP): Round III, Proteins Suppl. 3 (1999) 2--6.Google Scholar
- A. G. Murzin, S. E. Brenner, T. Hubbard and C. Chothia, SCOP: a Structural Classification of Proteins Database for the Investigation of Sequences and Structures, J. Mol. Biol. 247 (1995) 536--540.Google ScholarCross Ref
- G. L. Nemhauser and L. Wolsey, Integer and Combinatorial Optimization, John Wiley and Sons, 1988. Google ScholarDigital Library
- N. Siew, A. Elofsson, L. Rychlewski and D. Fischer, MaxSub: An Automated Measure for the Assessment of Protein Structure Prediction Quality, Bioinformatics 16 (2000) 776--785.Google Scholar
- R. Sowdhamini, D. F. Burke, J. F. Huang, K. Mizuguchi, H. A. Nagarajaram, N. Srinivasan, R. E. Steward and T. L. Blundell, CAMPASS: a Database of Structurally Aligned Protein Superfamilies, Structure 6(9) (1998) 1087--1094.Google Scholar
- M. Vendruscolo, E. Kussell and E. Domany, Recovery of Protein Structure from Contact Maps, Fold. Des. 2 (1997) 295--306.Google Scholar
Index Terms
- Structural alignment of large—size proteins via lagrangian relaxation
Recommendations
Successive Lagrangian relaxation algorithm for nonconvex quadratic optimization
Optimization problems whose objective function and constraints are quadratic polynomials are called quadratically constrained quadratic programs (QCQPs). QCQPs are NP-hard in general and are important in optimization theory and practice. There have been ...
A Lagrangian---DNN relaxation: a fast method for computing tight lower bounds for a class of quadratic optimization problems
We propose an efficient computational method for linearly constrained quadratic optimization problems (QOPs) with complementarity constraints based on their Lagrangian and doubly nonnegative (DNN) relaxation and first-order algorithms. The simplified ...
On Lagrangian Relaxation of Quadratic Matrix Constraints
Quadratically constrained quadratic programs (QQPs) play an important modeling role for many diverse problems. These problems are in general NP hard and numerically intractable. Lagrangian relaxations often provide good approximate solutions to these ...
Comments