ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Purchase PDF (660 K)

Article Toolbox
 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1006/jpdc.1999.1554    
How to Cite or Link Using DOI (Opens New Window)

Copyright © 1999 Academic Press. All rights reserved.

Regular Article

Communication Optimizations for Parallel C Programs*1

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Yingchun Zhu and Laurie J. Hendren

School of Computer Science, McGill University, Montreal, Quebec, Canada, H3A 2A7f1


Received 1 August 1998; 
revised 15 March 1999; 
accepted 9 April 1999. ;
Available online 27 March 2002.

Abstract

This paper presents algorithms for reducing the communication overhead for parallel C programs that use dynamically allocated data structures. The framework consists of an analysis phase called possible-placement analysis, and a transformation phase called communication selection. The fundamental idea of possible-placement analysis is to find all possible points for insertion of remote memory operations. Remote reads are propagated upwards, whereas remote writes are propagated downwards. Based on the results of the possible-placement analysis, the communication selection transformation selects the “best” place for inserting the communication and determines if pipelining or blocking of communication should be performed. The framework has been implemented in the EARTH-McCAT optimizing C compiler, and experimental results are presented for five pointer-intensive benchmarks running on the EARTH-MANNA distributed-memory parallel processor. These experiments show that the communication optimization can provide performance improvements of up to 16% over the unoptimized benchmarks.

Author Keywords: communication optimizations; multithreaded architectures; compiling for parallel architectures

*1 Work supported in part by NSERC and FCAR.

f1 E-mail: ying@cs.mcgill.ca, hendren@cs.mcgill.ca


 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.