article

Open Access

Efficient synchronization of multiprocessors with shared memory

Authors:
Clyde P. Kruskal

Univ. of Maryland, College Park

Univ. of Maryland, College Park
View Profile

,
Larry Rudolph

Hebrew Univ., Jerusalem, Israel

Hebrew Univ., Jerusalem, Israel
View Profile

,
Marc Snir

IBM T. J. Watson Research Center, Yorktown Heights, NY

IBM T. J. Watson Research Center, Yorktown Heights, NY
View Profile

Authors Info & Claims

ACM Transactions on Programming Languages and Systems Volume 10 Issue 4pp 579–601https://doi.org/10.1145/48022.48024

Published:01 October 1988Publication History

ACM Transactions on Programming Languages and Systems

Abstract

A new formalism is given for read-modify-write (RMW) synchronization operations. This formalism is used to extend the memory reference combining mechanism introduced in the NYU Ultracomputer, to arbitrary RMW operations. A formal correctness proof of this combining mechanism is given. General requirements for the practicality of combining are discussed. Combining is shown to be practical for many useful memory access operations. This includes memory updates of the form mem_val := mem_val op val, where op need not be associative, and a variety of synchronization primitives. The computation involved is shown to be closely related to parallel prefix evaluation.

References

1 CAMBPELL, R. H., AND HABERMAN, A.N. The specification of process synchronization by path expressions. In International Symposium on Operating Systems. Lecture Notes in Computer Science, 16. E. Gelenbe and C. Kaise, Eds. Springer-Verlag, New York, 1974, pp. 93-106. Google Scholar
2 COLLIER, W. Principles of architecture for systems of parallel processes. IBM Tech. Rep. TR00.3100, Mar. 1981.Google Scholar
3 DICKEY, S., KENNER, R., AND SN{R, M. An implementation of a combining network for the NYU Ultracomputer, Ultracomputer Note 93, New York University, New York, Jan. 1986.Google Scholar
4 DICKEY, S., KENNER, R., SNIR, M., AND SOLWORTH, J. A VLSI combining network for the NYU Ultracomputer. In IEEE Proceedings of the International Con{erence on Computer Design, (Port Chester, N.Y., Oct. 1985). IEEE, New York, 1985, pp. 110-113.Google Scholar
5 DIJKSTRA, E.W. Hierarchical ordering of sequential processes. Acta In{. 1 (1971), 115-138.Google Scholar
6 DRAUOHON, E., GRISHMAN, R., SCHWARTZ, J., AND STEIN, A. Programming considerations for parallel computers. Rep. IMM 362, Courant Institute of Mathematical Sciences, New York University, New York, 1967.Google Scholar
7 GAJSKI, D. D., AND PEIR, J.-K. Essential issues in multiprocessor systems. IEEE Comput. 18, 6 (June 1985), 9-28.Google Scholar
8 GOTTLIEB, A., GRISHMAN, R., KRUSKAL, C. P., MCAULIFFE, K. P., RUDOLPH, L., AND SNIR, M. The NYU Ultracomputer--Designing an MIMD parallel computer. IEEE Trans. Comput. C-32, 2 (Feb. 1983), 75-89.Google Scholar
9 GOTTLIEB, A., AND KRUSKAL, C.P. Coordinating parallel processors: A partial unification. SIGARCH News (Oct. 1981), 16-24. Google Scholar
10 GOTrLIEB, A., LUBACHEVSKY, B. D., AND RUDOLPH, L. Efficient techniques for coordinating sequential processors. ACM Trans Program. Lang. Syst. 5, 2 (Apr. 1983), 164-189. Google Scholar
11 HOARE, C. A. R. Communicating sequential processes. Commun. ACM 21, 8 (Aug. 1978), 666-677. Google Scholar
12 LADNER, R., AND FISHER, M. J. Parallel prefix computations. J. ACM 27, 4 (Oct. 1980), 831-838. Google Scholar
13 LAMPORT, L. Time, clocks, and the ordering of events in a distributed system. Commun. ACM 21, 7 (July 1978), 558-565. Google Scholar
14 LAMPORT, L. How to make a multiprocessor computer that correctly executes multiprocess programs. IEEE Trans. Comput. C-28, 9 (Sept. 1979), 690-691.Google Scholar
15 LAMPORT, L. On interprocess communication. Distrib. Comput. 1, 2 (Apr. 1986), 77-101.Google Scholar
16 LEE, G., KRUSKAL, C. P., AND KUCK, D.J. The effectiveness of combining in multistage interconnection networks in the presence of 'hot spots'. In 1986 International Conference on Parallel Processing, (Aug. 1986). IEEE, New York, 1986, pp. 35-41.Google Scholar
17 LYNCH, N., AND FISHER, M.J. On describing the behavior and implementation of distributed systems. Theor. Comput. Sci. 13, 1 (Jan. 1981), 17-43.Google Scholar
18 PETERSON, J., AND SILBERSHATZ, A. Operating System Concepts, Addison-Wesley, Reading, Mass., 1983. Google Scholar
19 PFISTER, G. H., ET AL. The IBM Research Parallel Processor Prototype (RP3): Introduction and architecture. In 1985 International Con{erence on Parallel Processing. IEEE, New York, 1985, pp. 784-772.Google Scholar
20 PFISTER, G. H., ANO NORTON, A. 'Hot spot' contention and combining in multistage intercon-{ nection networks. IEEE Trans. Comput. C-34, 10 (Oct. 1985), 933-938.Google Scholar
21 RETTBERG, R., AND THOMAS, R. Contention is no obstacle to shared-memory multiprocessing. Cornmun. ACM 29, 12 (1986), 1202-1212. Google Scholar
22 RUDOLPH, L. Software structures for ultraparallel computing. Ph.D. dissertation, New York University, 1981. Google Scholar
23 SEITZ, C. The cosmic cube. Commun. ACM 28, 1 (Jan. 1985), 22-33. Google Scholar
24 SHASHA, D., AND SNIR, M. Efficient and correct execution of programs that share memory. ACM Trans. Program. Lang. Syst. 10, 2 (Apr. 1988), 282-312. Google Scholar
25 SMITH, B. J. Architectures and applications of the HEP multiprocessor computer system. Real- Time Signal Processing IV, Proceedings o{ SPIE. The International Society for Optical Engineering, 1981, pp. 241-248.Google Scholar
26 SULLIVAN, H., BASHKOW, T. R., AND KLAPPHOLZ, D. A large scale, homogeneous, fully distributed parallel machine. In The 4th Annual Symposium on Computer Architecture (1977). IEEE, New York, 1977, pp. 105-134. Google Scholar
27 ZHU, C. Q., AND YEW, D.C. A scheme to enforce data dependence on large multiprocessor systems. IEEE Trans. So{tw. Eng. SE-13, 6 (June 1977), 726-739. Google Scholar

Index Terms

Efficient synchronization of multiprocessors with shared memory

Recommendations

Algorithms for scalable synchronization on shared-memory multiprocessors

Busy-wait techniques are heavily used for mutual exclusion and barrier synchronization in shared-memory parallel programs. Unfortunately, typical implementations of busy-waiting tend to produce large amounts of memory and interconnect contention, ...
Read More
Speculative Locks for Concurrent Execution of Critical Sections in Shared-Memory Multiprocessors
Read More
Nonblocking Algorithms and Preemption-Safe Locking on Multiprogrammed Shared Memory Multiprocessors

Most multiprocessors are multiprogrammed to achieve acceptable response time and to increase their utilization. Unfortunately, inopportune preemption may significantly degrade the performance of synchronized parallel applications. To address this ...
Read More

Reviews

Reviewer: Patricia Mainwaring Samwell

The subject of this paper is the contention that arises when several processors in a shared memory multiprocessor attempt to access a particular memory location at the same time. This form of memory contention can cause catastrophic degradation of multiprocessor machines. The paper develops a generalized formal description of RMW (read modify write) operations and discusses their implementation for shared memory multiprocessors. It describes how contentious memory requests may be combined, with the combination distributed across the processor/memory switch, and discusses the semantics of interleaved sequences of memory accesses generated from different processes. It goes on to explore the implications of combination for different sets of operations, e.g., load/store, logical operations, and arithmetic operations. The mechanism of combination is also formalized. Reading this research paper requires some mathematical facility and an understanding of implementation mechanisms for multiprocessors. The paper identifies the often subtle process and processor interactions that determine the correctness and efficiency of multiprocessor systems and presents a very useful generalization and formalization of the mechanisms involved. It is an interesting and well-written paper notable for its clarity of exposition.

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on Programming Languages and Systems Volume 10, Issue 4
Oct. 1988
128 pages
ISSN:0164-0925
EISSN:1558-4593
DOI:10.1145/48022
Issue’s Table of Contents

Copyright © 1988 ACM
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 October 1988
Published in toplas Volume 10, Issue 4

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 71
  Total Citations
  View Citations
- 858
  Total Downloads
- Downloads (Last 12 months)60
- Downloads (Last 6 weeks)9
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Efficient synchronization of multiprocessors with shared memory

ACM Transactions on Programming Languages and Systems

Abstract

References

Cited By

Index Terms

Recommendations

Algorithms for scalable synchronization on shared-memory multiprocessors

Speculative Locks for Concurrent Execution of Critical Sections in Shared-Memory Multiprocessors

Nonblocking Algorithms and Preemption-Safe Locking on Multiprogrammed Shared Memory Multiprocessors

Reviews

Access critical reviews of Computing literature here

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Efficient synchronization of multiprocessors with shared memory

ACM Transactions on Programming Languages and Systems

Abstract

References

Cited By

Index Terms

Recommendations

Algorithms for scalable synchronization on shared-memory multiprocessors

Speculative Locks for Concurrent Execution of Critical Sections in Shared-Memory Multiprocessors

Nonblocking Algorithms and Preemption-Safe Locking on Multiprogrammed Shared Memory Multiprocessors

Reviews

Access critical reviews of Computing literature here

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media