ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Microprocessors and Microsystems
Volume 31, Issue 1, 12 February 2007, Pages 25-37
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (2440 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.micpro.2006.07.004    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2006 Elsevier B.V. All rights reserved.

Multiplier-less VLSI architecture for real-time computation of multi-dimensional convolution

Ming Z. Zhanga, E-mail The Corresponding Author, Hau T. Ngoa, E-mail The Corresponding Author and Vijayan K. AsariCorresponding Author Contact Information, a, E-mail The Corresponding Author

aComputational Intelligence and Machine Vision Laboratory, Department of Electrical and Computer Engineering, Old Dominion University, Norfolk, VA 23529, USA

Available online 28 August 2006.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

A VLSI efficient multiplier-less architecture for real-time computation of multi-dimensional convolution is presented in this paper. The new architecture performs computations in the logarithmic domain by utilizing novel multiplier-less log2 and inverse-log2 modules which are capable of converting the fraction numbers currently not available in the literature. An effective data handling strategy is developed in conjunction with the logarithmic modules to eliminate the necessity of multipliers in the architecture. The proposed approach reduces hardware resources significantly compared to other approaches maintaining a high degree of accuracy. The architecture is developed as a combined systolic-pipelined design that produces an output in every clock cycle after an initial latency of 93.19 uSec. The architecture is capable of operating with a clock frequency of 99 MHz based on Xilinx’s Virtex II 2v2000ff896-4 FPGA and the throughput of the system is observed as 99 MOPS (million outputs per second). Error analysis performed with the FPGA-based system in the image processing examples of edge detection and noise filtering shows that the proposed architecture produces outputs similar to that obtained by software simulation using Matlab.

Keywords: Multi-dimensional convolution; Multiplier-less architecture; Logarithmic domain computation; Systolic-pipelines architecture; FPGA-based implementation

Article Outline

1. Introduction
2. Theory of log-based convolution operation
2.1. log2 and inverse-log2 approximations
3. Architecture for log-based 2-D convolution
3.1. Line buffer module
3.2. Processing elements (PEs)
3.3. log2 architecture
3.4. Inverse-log2 architecture
3.5. Optional padding architecture
4. Performance evaluation
4.1. Set up of simulation parameters
4.2. Edge detection by Laplacian kernel
4.3. Noise filtering by Gaussian kernel
4.4. Performance analysis
5. Conclusion
References
















 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.