Dynamic Core Binding for Load Balancing of Applications Parallelized with MPI/OpenMP

Kawai, Masatoshi; Ida, Akihiro; Hanawa, Toshihiro; Nakajima, Kengo

doi:10.1007/978-3-031-36024-4_30

Masatoshi Kawai¹³,
Akihiro Ida¹⁴,
Toshihiro Hanawa¹⁵ &
…
Kengo Nakajima^15,16

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 10475))

Included in the following conference series:

International Conference on Computational Science

526 Accesses

Abstract

Load imbalance is a critical problem that degrades the performance of parallelized applications in massively parallel processing. Although an MPI/OpenMP implementation is widely used for parallelization, users must maintain load balancing at the process level and thread (core) level for effective parallelization. In this paper, we propose dynamic core binding (DCB) to processes for reducing the computation time and energy consumption of applications. Using the DCB approach, an unequal number of cores is bound to each process, and load imbalance among processes is mitigated at the core level. This approach is not only improving parallel performance but also reducing power consumption by reducing the number of using cores without increasing the computational time. Although load balancing among nodes cannot be handled by DCB, we also examine how to solve this problem by mapping processes to nodes. In our numerical evaluations, we implemented a DCB library and applied it to the lattice \(\mathcal {H}\)-matrixes. Based on the numerical evaluations, we achieved a 58% performance improvement and 77% energy consumption reduction for the applications using the lattice \(\mathcal {H}\)-matrix.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Attiya, I., et al.: Job scheduling in cloud computing using a modified Harris hawks optimization and simulated annealing algorithm. Comput. Intell. Neurosci. 2020 (2020)
Google Scholar
Corbalan, J., et al.: Dynamic load balancing of MPI+OpenMP applications. In: 2004 International Conference on Parallel Processing, ICPP 2004, vol. 1, pp. 195–202 (2004)
Google Scholar
Curtis-Maury, M., et al.: Online power-performance adaptation of multithreaded programs using hardware event-based prediction. In: Proceedings of the 20th Annual International Conference on Supercomputing, pp. 157–166 (2006)
Google Scholar
Curtis-Maury, M., et al.: Prediction-based power-performance adaptation of multithreaded scientific codes. IEEE Trans. Parallel Distrib. Syst. 19(10), 1396–1410 (2008)
Article Google Scholar
Garcia, M., Corbalan, J., Badia, R.M., Labarta, J.: A dynamic load balancing approach with SMPSuperscalar and MPI. In: Keller, R., Kramer, D., Weiss, J.-P. (eds.) Facing the Multicore - Challenge II. LNCS, vol. 7174, pp. 10–23. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30397-5_2
Chapter Google Scholar
Garza-Santisteban, F., et al.: A simulated annealing hyper-heuristic for job shop scheduling problems. In: 2019 IEEE Congress on Evolutionary Computation (CEC), pp. 57–64 (2019)
Google Scholar
Grant, R.E., et al.: Standardizing power monitoring and control at exascale. Computer 49(10), 38–46 (2016)
Article Google Scholar
Hiroshi, N., et al.: Third generation digital annealer technology (2021). https://www.fujitsu.com/jp/documents/digitalannealer/researcharticles/DA_WP_EN_20210922. pdf
Ida, A.: Lattice \(\cal{H} \)-matrices on distributed-memory systems. In: IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 389–398 (2018)
Google Scholar
Iwashita, T., et al.: Software framework for parallel BEM analyses with H-matrices using MPI and OpenMP. Procedia Comput. Sci. 108, 2200–2209 (2017)
Article Google Scholar
Klinkenberg, J., et al.: CHAMELEON: reactive load balancing for hybrid MPI+ OpenMP task-parallel applications. J. Parallel Distrib. Comput. 138, 55–64 (2020)
Article Google Scholar
Korte, B.H., et al.: J. Comb. Optim. 1 (2011)
Google Scholar
Li, D., et al.: Strategies for energy-efficient resource management of hybrid programming models. IEEE Trans. Parallel Distrib. Syst. 24(1), 144–157 (2012)
Article Google Scholar
Nakajima, K., et al.: Communication-computation overlapping with dynamic loop scheduling for preconditioned parallel iterative solvers on multicore and manycore clusters. In: 2017 46th International Conference on Parallel Processing Workshops (ICPPW), pp. 210–219 (2017)
Google Scholar
Suleman, M.A., et al.: Feedback-driven threading: power-efficient and high-performance execution of multi-threaded workloads on CMPs. ACM SIGPLAN Not. 43(3), 277–286 (2008)
Article Google Scholar
Zaman, M., et al.: PyQUBO: Python library for mapping combinatorial optimization problems to QUBO form. IEEE Trans. Comput. 71(4), 838–850 (2022)
Article Google Scholar

Download references

Acknowledgment

This work was supported by JSPS KAKENHI Grant Number 18K18059, 21H03447, and 19H05662. This work is also supported by “Joint Usage/Research Center for Interdisciplinary Large-scale Information Infrastructures (JHPCN)” in Japan (Project ID: jh230058).

Author information

Authors and Affiliations

Nagoya University, Nagoya, Japan
Masatoshi Kawai
Research Institute for Value-Added-Information Generation (VAiG), Yokosuka, Japan
Akihiro Ida
The University of Tokyo, Tokyo, Japan
Toshihiro Hanawa & Kengo Nakajima
Riken, Wakō, Japan
Kengo Nakajima

Authors

Masatoshi Kawai
View author publications
You can also search for this author in PubMed Google Scholar
Akihiro Ida
View author publications
You can also search for this author in PubMed Google Scholar
Toshihiro Hanawa
View author publications
You can also search for this author in PubMed Google Scholar
Kengo Nakajima
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Masatoshi Kawai .

Editor information

Editors and Affiliations

Czech Technical University in Prague, Prague, Czech Republic
Jiří Mikyška
University of Amsterdam, Amsterdam, The Netherlands
Clélia de Mulatier
AGH University of Science and Technology, Krakow, Poland
Maciej Paszynski
University of Amsterdam, Amsterdam, The Netherlands
Valeria V. Krzhizhanovskaya
University of Tennessee at Knoxville, Knoxville, TN, USA
Jack J. Dongarra
University of Amsterdam, Amsterdam, The Netherlands
Peter M.A. Sloot

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kawai, M., Ida, A., Hanawa, T., Nakajima, K. (2023). Dynamic Core Binding for Load Balancing of Applications Parallelized with MPI/OpenMP. In: Mikyška, J., de Mulatier, C., Paszynski, M., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M. (eds) Computational Science – ICCS 2023. ICCS 2023. Lecture Notes in Computer Science, vol 10475. Springer, Cham. https://doi.org/10.1007/978-3-031-36024-4_30

Download citation

DOI: https://doi.org/10.1007/978-3-031-36024-4_30
Published: 26 June 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-36023-7
Online ISBN: 978-3-031-36024-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Dynamic Core Binding for Load Balancing of Applications Parallelized with MPI/OpenMP