ABSTRACT
Precise representations of protein surfaces are extremely useful when studying protein interactions and properties. Given their simplicity and ability to represent geometrical and physicochemical properties of proteins, voxelised surface representations have received a lot of interest in bioinformatics and computational biology applications such as interaction interface prediction, ligand-binding pocket prediction and both protein--ligand and protein--protein docking. Computing voxelised surfaces for large proteins can be challenging, as space-demanding data structures with associated high computational costs are required. This paper presents a fast, OpenMP-based parallel algorithm for the computation of the voxelised representation of the three main protein surfaces (van der Waals, solvent-accessible and solvent-excluded) at high-resolutions. The solvent-excluded surface computation is based on a region-growing implementation of the approximate Euclidean Distance Transform algorithm with Hierarchical Queues. The geometrical relationship between the solvent-accessible and solvent-excluded surfaces allows us to obtain the latter very efficiently by computing distance map values only for a small subset of the overall voxels representing the protein. The algorithm computes the contribution to the overall outer surface for each atom in parallel. The proposed methodology was experimentally compared to two previous MPI-based parallel implementations showing overall better speedup and efficiency metrics as well as lower computation times.
- Laurent-Philippe Albou, Benjamin Schwarz, Olivier Poch, Jean Marie Wurtz, and Dino Moras. 2009. Defining and characterizing protein surface using alpha shapes. Proteins: Structure, Function, and Bioinformatics 76, 1 (2009), 1--12.Google ScholarCross Ref
- Paul A Bash, Nagarajan Pattabiraman, Conrad Huang, Thomas E Ferrin, and Robert Langridge. 1983. Van der Waals surfaces in molecular modeling: implementation with real-time computer graphics. Science 222, 4630 (1983), 1325--1327.Google Scholar
- Helen M. Berman, John Westbrook, Zukang Feng, Gary Gilliland, T. N. Bhat, Helge Weissig, Ilya N. Shindyalov, and Philip E. Bourne. 2000. The Protein Data Bank. Nucleic Acids Research 28, 1 (2000), 235--242.Google ScholarCross Ref
- Mary Ellen Bock, Guido M Cortelazzo, Carlo Ferrari, and Concettina Guerra. 2005. Identifying similar surface patches on proteins using a Spin-Image surface representation. In Combinatorial Pattern Matching, Alberto Apostolico, Maxime Crochemore, and Kunsoo Park (Eds.). Lecture Notes in Computer Science, Vol. 3537. Springer Berlin Heidelberg, Berlin, Heidelberg, 417--428. Google ScholarDigital Library
- J.E. Bresenham. 1965. Algorithm for computer control of a digital plotter. IBM Systems Journal 4, 1 (1965), 25--30. Google ScholarDigital Library
- M. L. Connolly. 1983. Analytical molecular surface calculation. Journal of Applied Crystallography 16, 5 (Oct 1983), 548--558.Google ScholarCross Ref
- Olivier Cuisenaire. 1997. Region growing Euclidean Distance Transforms. In Image Analysis and Processing, Alberto Bimbo (Ed.). Lecture Notes in Computer Science, Vol. 1310. Springer Berlin Heidelberg, Berlin, Heidelberg, 263--270. Google ScholarDigital Library
- Sebastian Daberdaku and Carlo Ferrari. 2015. Parallel computation of voxelized macromolecular surfaces by spatial slicing. In 2015 IEEE Trustcom/BigDataSE/ISPA, Vol. 3. IEEE, IEEE Computer Society, Washington, DC, 184--189. Google ScholarDigital Library
- Sebastian Daberdaku and Carlo Ferrari. 2016. Computing discrete fine-grained representations of protein surfaces. In Computational Intelligence Methods for Bioinformatics and Biostatistics - 12th International Meeting, CIBB 2015, Naples, Italy, September 10-12, 2015, Revised Selected Papers, Claudia Angelini, Paola MV Rancoita, and Stefano Rovetta (Eds.). Lecture Notes in Bioinformatics, Vol. 9874. Springer International Publishing, Cham, Chapter 14, 180--195.Google Scholar
- Sebastian Daberdaku and Carlo Ferrari. 2018. Computing voxelised representations of macromolecular surfaces: A parallel approach. The International Journal of High Performance Computing Applications 32, 3 (2018), 407--432.Google ScholarCross Ref
- Sebastian Daberdaku and Carlo Ferrari. 2018. Exploring the potential of 3D Zernike descriptors and SVM for protein--protein interface prediction. BMC Bioinformatics 19, 1 (06 Feb 2018), 35.Google Scholar
- Sergio Decherchi, José Colmenares, Chiara Eva Catalano, Michela Spagnuolo, Emil Alexov, and Walter Rocchia. 2013. Between algorithm and model: Different molecular surface definitions for the Poisson-Boltzmann based electrostatic characterization of biomolecules in solution. Computer Physics Communications 13 (2013), 61.Google ScholarCross Ref
- Zainab Abu Deeb, Donald A. Adjeroh, and Bing-Hua Jiang. 2011. Protein surface characterization using an invariant descriptor. International Journal of Biomedical Imaging 2011 (2011), 15. Google ScholarDigital Library
- Luigi Di Costanzo, Guadalupe Sabio, Alfonso Mora, Paulo C Rodriguez, Augusto C Ochoa, Francisco Centeno, and David W Christianson. 2005. Crystal structure of human arginase I at 1.29-Å resolution and exploration of inhibition in the immune response. Proceedings of the National Academy of Sciences of the United States of America 102, 37 (2005), 13058--13063.Google ScholarCross Ref
- Juan Esquivel-Rodriguez, Vianney Filos-Gonzalez, Bin Li, and Daisuke Kihara. 2014. Pairwise and multimeric protein-protein docking using the LZerD program suite. In Protein Structure Prediction, Daisuke Kihara (Ed.). Methods in Molecular Biology, Vol. 1137. Springer New York, New York, NY, 209--234.Google Scholar
- Scott Grandison, Carl Roberts, and Richard J. Morris. 2009. The application of 3D Zernike moments for the description of "model-free" molecular structure, functional motion, and structural reliability. Journal of Computational Biology 16, 3 (2009), 487--500.Google ScholarCross Ref
- Bingjie Hu, Xiaolei Zhu, Lyman Monroe, Mark G Bures, and Daisuke Kihara. 2014. PL-PatchSurfer: A novel molecular local surface-based method for exploring protein--ligand interactions. International Journal of Molecular Sciences 15, 9 (2014), 15122.Google ScholarCross Ref
- Bingding Huang and Michael Schroeder. 2006. LIGSITEcsc: Predicting ligand binding sites using the Connolly surface and degree of conservation. BMC Structural Biology 6, 1 (24 Sep 2006), 19.Google Scholar
- Sheng-You Huang. 2015. Exploring the potential of global protein--protein docking: an overview and critical assessment of current programs for automatic ab initio docking. Drug Discov Today 20, 8 (2015), 969--977.Google ScholarCross Ref
- Daisuke Kihara, Lee Sael, Rayan Chikhi, and Juan Esquivel-Rodriguez. 2011. Molecular surface representation using 3D Zernike descriptors for protein shape comparison and docking. Current Protein and Peptide Science 12, 6 (2011), 520--533.Google ScholarCross Ref
- Kengo Kinoshita and Haruki Nakamura. 2005. Identification of the ligand binding sites on the molecular surface of proteins. Protein Science 14, 3 (2005), 711--718.Google ScholarCross Ref
- B. Lee and F.M. Richards. 1971. The interpretation of protein structures: Estimation of static accessibility. Journal of Molecular Biology 55, 3 (1971), 379-IN4.Google ScholarCross Ref
- Bin Li, Srinivasan Turuvekere, Manish Agrawal, David La, Karthik Ramani, and Daisuke Kihara. 2008. Characterization of local geometry of protein surfaces with the visibility criterion. Proteins: Structure, Function, and Bioinformatics 71, 2 (2008), 670--683.Google ScholarCross Ref
- A. D. MacKerell, D. Bashford, M. Bellott, R. L. Dunbrack, J. D. Evanseck, M. J. Field, S. Fischer, J. Gao, H. Guo, S. Ha, D. Joseph-McCarthy, L. Kuchnir, K. Kuczera, F. T. K. Lau, C. Mattos, S. Michnick, T. Ngo, D. T. Nguyen, B. Prodhom, W. E. Reiher, B. Roux, M. Schlenkrich, J. C. Smith, R. Stote, J. Straub, M. Watanabe, J. Wiórkiewicz-Kuczera, D. Yin, and M. Karplus. 1998. All-atom empirical potential for molecular modeling and dynamics studies of proteins. Journal of Physical Chemistry B 102, 18 (1998), 3586--3616.Google ScholarCross Ref
- Massimo Paoli, Robert Liddington, Jeremy Tame, Anthony Wilkinson, and Guy Dodson. 1996. Crystal Structure of T State Haemoglobin with Oxygen Bound At All Four Haems. Journal of Molecular Biology 256, 4 (1996), 775--792.Google ScholarCross Ref
- Mathias Rask-Andersen, Markus Sällman Almén, and Helgi B Schiöth. 2011. Trends in the exploitation of novel drug targets. Nature reviews Drug discovery 10, 8 (2011), 579.Google Scholar
- David W Ritchie. 2008. Recent progress and future directions in protein--protein docking. Current Protein and Peptide Science 9, 1 (2008), 1--15.Google ScholarCross Ref
- Lee Sael, David La, Bin Li, Raif Rustamov, and Daisuke Kihara. 2008. Rapid comparison of properties on protein surface. Proteins 73, 1 (2008), 1--10.Google ScholarCross Ref
- Michel F. Sanner, Arthur J. Olson, and Jean-Claude Spehner. 1995. Fast and robust computation of molecular surfaces. In Proceedings of the Eleventh Annual Symposium on Computational Geometry (SCG '95). ACM, New York, NY, USA, 406--407. Google ScholarDigital Library
- Molly B. Schmid. 2002. Structural proteomics: The potential of high-throughput structure determination. Trends in Microbiology 10, 10 (2002), s27--s31.Google ScholarCross Ref
- Stefan Schmitt, Daniel Kuhn, and Gerhard Klebe. 2002. A new method to detect related function among proteins independent of sequence and fold homology. Journal of Molecular Biology 323, 2 (2002), 387--406.Google ScholarCross Ref
- Allegra Via, Fabrizio Ferre, Barbara Brannetti, and Manuela Helmer-Citterich. 2000. Protein surface similarities: A survey of methods to describe and compare protein surfaces. Cellular and Molecular Life Sciences 57, 13-14 (2000), 1970--1977.Google ScholarCross Ref
- Martin Weisel, Ewgenij Proschak, and Gisbert Schneider. 2007. PocketPicker: analysis of ligand binding-sites with shape descriptors. Chemistry Central Journal 1, 1 (2007), 7.Google ScholarCross Ref
- Shuangye Yin, Elizabeth A Proctor, Alexey A Lugovskoy, and Nikolay V Dokholyan. 2009. Fast screening of protein surfaces using geometric invariant fingerprints. Proceedings of the National Academy of Sciences of the United States of America 106, 39 (2009), 16622--16626.Google ScholarCross Ref
- Wei-Wei Yu, Fei He, and Ping Xi. 2010. A rapid 3D seed-filling algorithm based on scan slice. Computers & Graphics 34, 4 (2010), 449 -- 459. Procedural Methods in Computer Graphics Illustrative Visualization. Google ScholarDigital Library
- Yongjie Zhang, Guoliang Xu, and Chandrajit Bajaj. 2006. Quality meshing of implicit solvation models of biomolecular structures. Computer Aided Geometric Design 23, 6 (2006), 510--530. Google ScholarDigital Library
- Xiaolei Zhu, Yi Xiong, and Daisuke Kihara. 2015. Large-scale binding ligand prediction by improved patch-based method Patch-Surfer2.0. Bioinformatics 31, 5 (2015), 707--713.Google ScholarCross Ref
Index Terms
- Parallel Computation of Voxelised Protein Surfaces with OpenMP
Recommendations
Accelerating the computation of triangulated molecular surfaces with OpenMP
The ability to compute mesh representations of molecular surfaces in an accurate and timely manner is of great importance for applications such as implicit solvent modelling. The efficient computation of high-resolution mesh surfaces for large ...
Residue interaction networks of K-Ras protein with water molecules identifies the potential role of switch II and P-loop
AbstractThe mutant K-Ras with aberrant signaling is the primary cause of several cancers. The proposed study investigated the influence of water molecules in K-Ras crystal structure, where they have a significant function by understanding their residue ...
Graphical abstractDisplay Omitted
Highlights- We determined the interaction properties of Residue interaction networks (RINs) of K-Ras with and without water molecules.
- Firstly, we explored the RINs and determined hotspot residues with water and without water K-Ras-GTP.
- ...
Parallel protein secondary structure prediction schemes using Pthread and OpenMP over hyper-threading technology
AbstractProtein secondary structure prediction has a fundamental influence on today’s bioinformatics research. In this work, tertiary classifiers for the protein secondary structure prediction are implemented on Denoeux Belief Neural Network (DBNN) ...
Comments