ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Journal of Parallel and Distributed Computing
Volume 64, Issue 1, January 2004, Pages 68-78
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Purchase PDF (361 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.jpdc.2003.06.003    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2003 Elsevier Inc. All rights reserved.

A fine-grain load-adaptive algorithm of the 2D discrete wavelet transform for multithreaded architectures

Parimala ThulasiramanCorresponding Author Contact Information, E-mail The Corresponding Author, a, Ashfaq A. KhokharE-mail The Corresponding Author, b, Gerd HeberE-mail The Corresponding Author, c and Guang R. GaoE-mail The Corresponding Author, d

a Department of Computer Science, University of Manitoba, Winnipeg, Manitoba R3T 2N2, Canada b Department of EECS, University of Illinois at Chicago, Chicago, IL 60607, USA c Cornell Theory Center, Cornell University, 638 Rhodes Hall, Ithaca, NY 14853, USA d Department of ECE, University of Delaware, Newark, DE 19716, USA

Received 27 November 2001; 
revised 19 May 2003. 
Available online 28 October 2003.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

In this paper we develop a load-adaptive multithreaded algorithm to compute 2D Discrete Wavelet Transform (DWT) and its implementation on a fine-grain multithreading platform. In a 2D DWT computation, the problem sizes reduces at every decomposition level and the length of the emerging computation paths also vary. The parallel algorithm proposed in this paper, dynamically scales itself to the varying problem size. During any iteration, the ratio of the number of local threads to the number of remote threads issued by a processor can be adjusted to be greater than 1 by controlling the algorithm parameters. This approach provides an opportunity to interleave computation and communication without explicitly introducing idle cycles on waiting for the remote threads to finish. Experimental results are reported based on the implementations of the proposed algorithm on a 20 node emulated multithreaded platform, EARTH-MANNA, specifically designed for fine-grain multithreaded paradigms. We show that multithreading implementations of the proposed algorithm are at least 2 times faster than the MPI-based message passing implementations reported in the literature, assuming the same processor speed. We further show that the proposed algorithm and implementations scale linearly with respect to problem and machine sizes.

Author Keywords: Wavelet transform; Parallel algorithms; Image processing; Fine grain; Multithreading


 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.