skip to main content
article
Free Access

The cache performance and optimizations of blocked algorithms

Published:01 April 1991Publication History
First page image

References

  1. 1 W. Abu-Sufah, D. J. Kuck, and D. H. Lawrie. Automatic program transformations for virtual memory computers. Proc. of the 1979 National Computer Conference, pages 969-974, June 1979.Google ScholarGoogle ScholarCross RefCross Ref
  2. 2 E. Anderson and J. Dongarra. LAPACK working note 18, implementation guide for LAPACK. Technical Report CS- 90-101, University of Tennessee, Apr 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3 D. Callahan, S. Carr, and K. Kennedy. Improving register allocation for subscripted variables. In Proceedings of the ACM SIGPLAN '90 Conference on Programming Language Design and Implementation, June 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4 J. Dongarra, J. Du Croz, S. Hammarling, and I. Duff. A set of level 3 basic linear algebra subprograms. ACM Transactions on Mathematical Software, pages 1-17, March 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5 K. Gallivan, W. Jalby, U. Meier, and A. Sameh. The impact of hierarchical memory systems on linear algebra algorithm design. Technical Report UIUCSRD 625, University of Illinios, 1987.Google ScholarGoogle Scholar
  6. 6 D. Oannon and W. Jalby. The influence of memory hierarchy on algorithm organization: Programming FFTs on a vector multiproeessor. In The Characteristics of Parallel Algorithms. MIT Press, 1987.Google ScholarGoogle Scholar
  7. 7 D. Gannon, W. Jalby, and K. Gallivan. Strategies for cache and local memory management by global program transformation. Journal of Parallel and Distributed Computing, 5:587-616, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8 G. H. Golub and C. F. Van Loan. Matrix Computations. Johns Hopkins University Press, 1989.Google ScholarGoogle Scholar
  9. 9 J.-W. Hong and H. T. Kung. I/O complexity: The red-blue pebbl~ game, In Proceedings of the Thirteenth Annual ACM Symposium on Theory of Computing, pages 326-333. ACM SIGACT, May 1981. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10 A. C. McKeller and E. G. Coffman. The organization of matrices and matrix operations in a paged multiprogramming environment. CACM, 12(3):153-165, 1969. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11 A. Porterfield. Software Methods for Improvement of Cache Performance on Supercomputer Applications. PhD thesis, Rice University, May 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12 M. E. Wolf and M. S. Lam. A data locality optimizing algorithm. Submitted for publication., 1990.Google ScholarGoogle Scholar

Index Terms

  1. The cache performance and optimizations of blocked algorithms

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM SIGPLAN Notices
          ACM SIGPLAN Notices  Volume 26, Issue 4
          Special issue of a journal and a proceedings
          Apr. 1991
          305 pages
          ISSN:0362-1340
          EISSN:1558-1160
          DOI:10.1145/106973
          Issue’s Table of Contents
          • cover image ACM Conferences
            ASPLOS IV: Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
            April 1991
            320 pages
            ISBN:0897913809
            DOI:10.1145/106972

          Copyright © 1991 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 1 April 1991

          Check for updates

          Qualifiers

          • article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader