skip to main content
10.1145/565196.565209acmconferencesArticle/Chapter ViewAbstractPublication PagesrecombConference Proceedingsconference-collections
Article

Structural alignment of large—size proteins via lagrangian relaxation

Published:18 April 2002Publication History

ABSTRACT

We illustrate a new approach to the Contact Map Overlap problem for the comparison of protein structures. The approach is based on formulating the problem as an integer linear program and then relaxing in a Lagrangian way a suitable set of constraints. This relaxation is solved by computing a sequence of simple alignment problems, each in quadratic time, and near--optimal Lagrangian multipliers are found by subgradient optimization. By our approach we achieved a substantial speedup over the best existing methods. We were able to solve optimally for the first time instances for PDB proteins with about 1000 residues and 2000 contacts. Moreover, within a few hours we compared 780 pairs in a testbed of 40 large proteins, finding the optimal solution in 150 cases. Finally, we compared 10,000 pairs of proteins from a test set of 269 proteins in the literature, which took a couple of days on a PC.

References

  1. W.P. Adams and H.D. Sherali, A Tight Linearization and an Algorithm for Zero-One Quadratic Programming Problems, Management Science 32 (1986) 1274--1290. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. E. Balas, S. Ceria, and G. Cornuéjols, A Lift-and-Project Cutting Plane Algorithm for Mixed 0-1 Programs, Mathematical Programming 58 (1993) 295--324. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T. L. Blundell, Structure-Based Drug Design, Nature 384 (1996) 23--26.Google ScholarGoogle ScholarCross RefCross Ref
  4. C. Branden and J. Tooze, Introduction to Protein Structure, Garland, 1999.Google ScholarGoogle Scholar
  5. A. Caprara, M. Fischetti and P. Toth, A Heuristic Method for the Set Covering Problem, Operations Research 47 (1999) 730--743. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. Caprara, D. Pisinger and P. Toth, Exact Solution of the Quadratic Knapsack Problem, INFORMS J. on Comput., 11 (1999) 125-137. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Carraresi and F. Malucelli, A Reformulation Scheme and New Lower Bounds for the QAP, in P.M. Pardalos and H. Wolkowicz (eds.), Quadratic Assignment and Related Problems, DIMACS series in Discrete Mathematics and Theoretical Computer Science, AMS Press, 147--160, 1994.Google ScholarGoogle Scholar
  8. I. Eidhammer and I. Jonassen and W. R. Taylor, Structure Comparison and Structure Prediction, to appear J. Comp. Biol.Google ScholarGoogle Scholar
  9. D. Fischer et al. CAFASP-1: Critical Assessment of Fully Automated Structure Prediction Methods. Proteins Suppl. 3 (1999) 209--217.Google ScholarGoogle Scholar
  10. M.L. Fisher, The Lagrangian Relaxation Method for Solving Integer Programming Problems, Management Science 27 (1981) 1--18.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Godzik and J. Skolnick, Flexible Algorithm for Direct Multiple Alignment of Protein Structures and Sequences, CABIOS 10 (1994) 587--596.Google ScholarGoogle Scholar
  12. A. Godzik, J. Sklonick and A. Kolinski, A Topology Fingerprint Approach to Inverse Protein Folding Problem, J. Mol. Biol. 227 (1992) 227--238.Google ScholarGoogle ScholarCross RefCross Ref
  13. D. Goldman, S. Istrail and C. Papadimitriou, Algorithmic Aspects of Protein Structure Similarity, Proceedings of the 40th IEEE Symposium on Foundations of Computer Science, 512--522, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. Gusfield, Algorithms on Strings, Trees and Sequences: Computer Science and Computational Biology, Cambridge University Press, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. T. F. Havel, I. D. Kuntz and G. M. Crippen, Effect of Distance Constraints on Macromolecular Conformation, Biopolymers 18 (1979).Google ScholarGoogle Scholar
  16. M. Held and R. M. Karp, The Traveling Salesman Problem and Minimum Spanning Trees: Part II, Mathematical Programming 1 (1971) 6--25.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. L. Holm and C. Sander, Mapping the Protein Universe, Science 273 (1996) 595--602.Google ScholarGoogle Scholar
  18. L. Holm and C. Sander, Protein Structure Comparison by Alignment of Distance Matrices, J. Mol. Biol. 233 (1993) 123--138.Google ScholarGoogle ScholarCross RefCross Ref
  19. W. Kabash, A Solution for the Best Rotation to Relate Two Sets of Vectors, Acta Cryst. A32 (1978) 922--923.Google ScholarGoogle Scholar
  20. G. Lancia, R. Carr, B. Walenz and S. Istrail, 101 Optimal PDB Structure Alignments: a Branch-and-Cut Algorithm For The Maximum Contact Map Overlap Problem, Proc. 5th RECOMB (2001) 193--201. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. C. Lemmen and T. Lengauer, Computational Methods for the Structural Alignment of Molecules, Journal of Computer--Aided Molecular Design 14 (2000) 215--232.Google ScholarGoogle ScholarCross RefCross Ref
  22. H. P. Lenhof, K. Reinert, M. Vingron, A Polyhedral Approach to RNA Sequence Structure Alignment, J. Comp. Biol., 5 (1998) 517--530.Google ScholarGoogle ScholarCross RefCross Ref
  23. L. Lovász and A. Schrijver, Cones of Matrices and Set-Functions and 0-1 Optimization, SIAM Journal on Optimization 1 (1991) 166--190.Google ScholarGoogle Scholar
  24. L. Mirny and E. Domany, Protein Fold Recognition and Dynamics in the Space of Contact Maps, Proteins 26 (1996) 391--410.Google ScholarGoogle ScholarCross RefCross Ref
  25. K. Mizuguchi, C. M. Deane, T. L. Blundell, and J. P. Overington, HOMSTRAD: a Database of Protein Structure Alignments for Homologous Families, Protein Sci. 7(11) (1998) 2469--2471.Google ScholarGoogle ScholarCross RefCross Ref
  26. J. Moult, T. Hubbard, K. Fidelis, and J. Pedersen, Critical Assessment of Methods of Protein Structure Prediction (CASP): Round III, Proteins Suppl. 3 (1999) 2--6.Google ScholarGoogle Scholar
  27. A. G. Murzin, S. E. Brenner, T. Hubbard and C. Chothia, SCOP: a Structural Classification of Proteins Database for the Investigation of Sequences and Structures, J. Mol. Biol. 247 (1995) 536--540.Google ScholarGoogle ScholarCross RefCross Ref
  28. G. L. Nemhauser and L. Wolsey, Integer and Combinatorial Optimization, John Wiley and Sons, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. N. Siew, A. Elofsson, L. Rychlewski and D. Fischer, MaxSub: An Automated Measure for the Assessment of Protein Structure Prediction Quality, Bioinformatics 16 (2000) 776--785.Google ScholarGoogle Scholar
  30. R. Sowdhamini, D. F. Burke, J. F. Huang, K. Mizuguchi, H. A. Nagarajaram, N. Srinivasan, R. E. Steward and T. L. Blundell, CAMPASS: a Database of Structurally Aligned Protein Superfamilies, Structure 6(9) (1998) 1087--1094.Google ScholarGoogle Scholar
  31. M. Vendruscolo, E. Kussell and E. Domany, Recovery of Protein Structure from Contact Maps, Fold. Des. 2 (1997) 295--306.Google ScholarGoogle Scholar

Index Terms

  1. Structural alignment of large—size proteins via lagrangian relaxation

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              RECOMB '02: Proceedings of the sixth annual international conference on Computational biology
              April 2002
              341 pages
              ISBN:1581134983
              DOI:10.1145/565196

              Copyright © 2002 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 18 April 2002

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • Article

              Acceptance Rates

              RECOMB '02 Paper Acceptance Rate35of118submissions,30%Overall Acceptance Rate148of538submissions,28%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader