skip to main content
10.1145/1529282.1529704acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
research-article

Optimizing techniques for saturated arithmetic with first-order linear recurrence

Published:08 March 2009Publication History

ABSTRACT

Saturated arithmetic is a typical operation in multimedia applications, most multimedia extensions in the instruction set architecture (ISA) of modern processors provide saturation instructions for such operation. Therefore, extensive researches have focused on how to utilize saturation instructions to optimize programs. Previous algorithms mainly focus on purely saturated arithmetic, however saturated arithmetic is often mingled with first-order linear recurrence (FOLR) in real life applications. When FLOR pattern appears in the program, previous algorithms can not identify the saturated arithmetic as well.

In fact, the saturated arithmetic with FOLR (SAWF) is a new and significant pattern, especially, SAWF with one as coefficient is frequently used in multimedia applications. Hence, it is necessary to explore a method with which such pattern can be efficiently vectorized. This paper discusses how to vectorize SAWF, explores the efficient method to vectorize SAWF with one as coefficient and gives its evaluation and implement a library for the optimizing technique. Such an implementation manner can make compilers are able to exploit it more easily. The experimental results shows the optimizing technique can achieve a speedup of 1.19 to 1.46 on Pentium IV processor. At the same time, the optimizing techniques in this paper can also be used to develop a library for SAWF so a programmer can benefit even without changing the compiler.

References

  1. Gang Ren, Peng Wu, David Padua. An Empirical Study On the Vectorization of Multimedia Applications for Multimedia Extensions. Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Aart J. C. Bik, Milind Girkae, Paul M. Grey, Xinmin Tian. Automatic Detection of Saturation and Clipping Idioms. Proceedings of the 15th International Workshop on Languages and Compilers for parallel computers, July, 2002 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Slingerland N, Smith A J. Measuring the Performance of Multimedia Instruction Sets. IEEE Trans. Computers, 2002, 51(11): 1317--1332. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Nathan T. Slingerland, Alan Jay Smith. Design and characterization of the Berkeley multimedia workload, Multimedia Syst, 2002, 8(4): 315--327. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Ren G, Wu P, Padua D. A Preliminary Study On the Vectorization of Multimedia Applications for Multimedia Extensions. Proc. Of the 16th Int'l WorkShop on Languages and Compilers for Parallel Computing. 2003Google ScholarGoogle Scholar
  6. Weihua Jiang, Chao Mei, BoHuang, Jianhui Li, Jiahua Zhu, Binyu Zang, Chuanqi Zhu. Boosting the Performance of Multimedia Applications Using SIMD Instructions. Compiler Constructions. 2005 Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Jiahua Zhu, HongJiang Zhang, Hui Shi, Binyu Zang, Chuanqi Zhu "Overflow Controlled SIMD Arithmetic". The 17th International Workshop on Languages and Compilers for Parallel Computing (LCPC 04) Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Hong-Soog Kim, Young-Ha Yoon, Dong-Soo Han. Parallel Processing of First Order Linear Recurrence on SMP Machines. The Journal of Supercomputing, 27, 295--310, 2004 Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Nakamura, Y. Okabe, and T. Tsuda. New fast algorithms for first-order linear recurrences on vector computers. In 5th Workshop on Compilers for Parallel Computers, pp. 167C174, June 1995.Google ScholarGoogle Scholar
  10. H. Wang, A. Nicolau, S. Keung, and Kai-Yeung Siu. Computing programs containing band linear recurrences on vector supercomputers. IEEE Transactions on Parallel and Distributed Systems, 7(8):769C782, August 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Y. Tanaka, K. Iwasawa, S. Gotoo, and Y. Umetani. Compiling techniques for first-order linear recurrences on a vector computer. In Supercomputing 88, pp. 174C181, IEEE, November 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. H. Wang, A. Nicolau, S. Keung, and K. S. Siu. Scalable techniques for computing band linear recurrences on massively parallel and vector supercomputers. In 8th International Parallel Processing Symposium, pp. 502C508. IEEE/ACM, April 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Randy Allen, Ken Kennedy, Carrie Porterfield and Joe Warren. Conversion of Control Dependence to Data Dependence. ACM Symposium on Principles of Programming. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Zheng B, Tsai J Y, Zhang BY, Chen T, Huang B, Li J H, Ding Y H, Liang J, Zhen Y, Yew P C, Zhu C Q. Designing the Agassiz Compiler for Concurrent Multithreaded Architectures. Proc. Of the 12th Intel WorkShop on Languages and Compilers for Parallel Computing, 1999:380--398 Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Optimizing techniques for saturated arithmetic with first-order linear recurrence

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SAC '09: Proceedings of the 2009 ACM symposium on Applied Computing
      March 2009
      2347 pages
      ISBN:9781605581668
      DOI:10.1145/1529282

      Copyright © 2009 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 8 March 2009

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,650of6,669submissions,25%
    • Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader