Abstract
The use of GPUs to accelerate the factoring of large sparse symmetric matrices shows the potential of yielding important benefits to a large group of widely used applications. This paper examines how a multifrontal sparse solver performs when exploiting both the GPU and its multi-core host. It demonstrates that the GPU can dramatically accelerate the solver relative to one host CPU. Furthermore, the solver can profitably exploit both the GPU to factor its larger frontal matrices and multiple threads on the host to handle the smaller frontal matrices.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Heath, M., Ng, E., Peyton, B.: Parallel algorithms for sparse linear systems. Society for Industrial and Applied Mathematics Review 33, 420–460 (1991)
Charlesworth, A., Gustafson, J.: Introducing Replicated VLSI to Supercomputing: the FPS-164/MAX Scientific Computer. IEEE Computer 19(3), 10–23 (1986)
Pham, D.C., Aipperspach, T., Boerstler, D., Bolliger, M., Chaudhry, R., Cox, D., Harvey, P., Harvey, P.M., Hofstee, H.P., Johns, C., Kahle, J., Kameyama, A., Keaty, J., Masubuchi, Y., Pham, M., Pille, J., Posluszny, S., Riley, M., Stasiak, D.L., Suzuoki, M., Takahashi, O., Warnock, J., Weitzel, S., Wendel, D., Yazawa, K.: Overview of the Architecture, Circuit Design, and Physical Implementation of a First-Generation Cell Processor. IEEE Journal of Solid State Circuits 41(1) (January 2006)
Lastra, A., Lin, M., Minocha, D.: ACM Workshop on General Purpose Computations on Graphics Processors (2004)
Duff, I., Reid, J.: The Multifrontal Solution of Indefinite Sparse Symmetric Linear Systems. ACM Transactions on Mathematical Software 9, 302–335 (1983)
Dongarra, J.J., Du Croz, J., Hammarling, S., Duff, I.S.: A Set of Level 3 Basic Linear Algebra Subprograms. ACM Transactions on Mathematical Software 16(1), 1–17 (1990)
Scott Larson, E., McAllister, D.: Fast matrix multiplies using graphics hardware. In: Proceedings of the 2001 ACM/IEEE Conference on Supercomputing, p. 55. ACM Press, New York (2001)
Fatahalian, K., Sugarman, J., Hanrahan, P.: Understanding the Efficiency of GPU Algorithms for Matrix-Matrix Multiplication. In: Proceedings of the ACM Sigraph/Eurographics Conference on Graphics Hardware. Eurographics Association, pp. 133–138 (2004)
Govindaraju, N., Manocha, D.: Cache-Efficient Numerical Algorithms Using Graphics Hardware, University of North Carolina Technical Report (2007)
Lucas, R.F.: GPU-Enhanced Linear Solver Results. In: The Proceedings of Parallel Processing for Scientific Computing. SIAM, Philadelphia (2008)
Private communication with Gene Poole, ANSYS Inc., at SC|2008, Austin, TX (November 2008)
Karypis, G., Kumar, V.: A fast and high quality multilevel scheme for partitioning irregular graphs. In: Haridi, S., Ali, K., Magnusson, P. (eds.) Euro-Par 1995. LNCS, vol. 966, pp. 113–122. Springer, Heidelberg (1995)
Ashcraft, C., Grimes, R.: The Influence of Relaxed Supernode Partitions on the Multifrontal Method. ACM Transactions in Mathematical Software 15, 291–309 (1989)
Ashcraft, C., Lucas, R.: A Stackless Multifrontal Method. In: Tenth SIAM Conference on Parallel Processing for Scientific Computing (March 2001)
Arnold, M.G., Bailey, T.A., Cowles, J.R., Winkel, M.D.: Applying Features of IEEE 754 to Sign/Logarithm Arithmetic. IEEE Transactions on Computers 41(8), 1040–1050 (1992)
Buck, I.: GPU Computing: Programming a Massively Parallel Processor. In: International Symposium on Code Generation and Optimization, San Jose, California
Duff, I.: Parallel Implementation of Multifrontal Schemes. Parallel Computing 3, 193–204 (1986)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lucas, R.F., Wagenbreth, G., Davis, D.M., Grimes, R. (2011). Multifrontal Computations on GPUs and Their Multi-core Hosts. In: Palma, J.M.L.M., Daydé, M., Marques, O., Lopes, J.C. (eds) High Performance Computing for Computational Science – VECPAR 2010. VECPAR 2010. Lecture Notes in Computer Science, vol 6449. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19328-6_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-19328-6_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19327-9
Online ISBN: 978-3-642-19328-6
eBook Packages: Computer ScienceComputer Science (R0)