skip to main content
10.1145/207110.207162acmconferencesArticle/Chapter ViewAbstractPublication PagespldiConference Proceedingsconference-collections
Article
Free Access

Tile size selection using cache organization and data layout

Published:01 June 1995Publication History

ABSTRACT

When dense matrix computations are too large to fit in cache, previous research proposes tiling to reduce or eliminate capacity misses. This paper presents a new algorithm for choosing problem-size dependent tile sizes based on the cache size and cache line size for a direct-mapped cache. The algorithm eliminates both capacity and self-interference misses and reduces cross-interference misses. We measured simulated miss rates and execution times for our algorithm and two others on a variety of problem sizes and cache organizations. At higher set associativity, our algorithm does not always achieve the best performance. However on direct-mapped caches, our algorithm improves simulated miss rates and measured execution times when compared with previous work.

References

  1. BJWE92.F. Bodin, W. Jalby, D. Windheiser, and C. Eisenbeis. A quantitative algorithm for data locality optimzation. In Code Generation-Concepts, Tools, Techniques. Springer-Verlag, 1992.Google ScholarGoogle Scholar
  2. CK92.S. Carr and K. Kennedy. Compiler blockability of numerical algorithms. In Proceedings of Supercomputing '92, Minneapolis, MN, November 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. CL95.S. Carr and R. B. Lehoucq, A compiler-blockable algorithm for QR decomposition. In Proceedings of the Eighth SIAM Conference on Parallel Processing for Scientific Computing, San Francisco, CA, February 1995.Google ScholarGoogle Scholar
  4. CMT94.S. Carr, K. S. MCKinley, and C, Tseng, Compiler optimizations for improving data locahty. In Proceedings of the Sixth hzternationaI Conference on Architectural Support for Programming Languages and Operating Systems, San Jose, CA, October 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Col94.S. Coleman. Selecting tile sizes based on cache and data organization. Master's thes~s, Dept. of Computer Science, University of Massachusetts, Amherst, September 1994.Google ScholarGoogle Scholar
  6. Ess93.K. Esseghir. Improving data locality for caches. Master's thesis, Dept. of Computer Science, Rice University, September 1993.Google ScholarGoogle Scholar
  7. GJG88.D. Gannon, W. Jalby, and K. Gallivan. Strategies for cache and local memory management by global program transformation. Journal of Parallel and Dis* tributed Computing, 5(5):587-616, October 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. IT88.F. Irigoin and R. Triolet. Supernode partitioning. In Proceedings of the Fifteenth Annual A CM Symposium on the Principles of Programming Languages, San Diego, CA, January 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Kob87.Neal Koblitz. Graduate Texts in Mathematics. Springer-Verlag, New York, 1987.Google ScholarGoogle Scholar
  10. LRW91.M. Lam, E. Rothberg, and M. E. Wolf. The cache performance and opt~mizations of blocked algorithms. In Proceedings of the Fourth bzternational Conference on Architectural Support for Programming Languages and Operating Systems, Santa Clara, CA, April 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. NJL94.J.J. Navarro, T Juan, and T Lang. Mob forms: A class of multilevel block algorithms for dense linear algebra operations. In Proceedings of the !994 A CM International Conference on Supercomputing, pages 354-363, Manchester, England, June 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Smi82.A.J. Smith. Cache memories. Computing Surveys, 14(3):473-530, September 1982. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. TGJ93.O. Temam, E. Granston, and W. Jalby. To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts. In Proceedings of Supercomputing '93, Portland, OR, November 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. WL91.M.E. Wolf and M. Lam. A data locality optimizing algorithm. In Proceedings of the SIGPLAN '91 Conference on Programming Language Design and implementation, Toronto, Canada, June 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Wol89.M.J. Wolfe. More iteration space tiling. In Proceedings of Supercomputing '89, pages 655-664, Reno, NV, November 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Tile size selection using cache organization and data layout

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        PLDI '95: Proceedings of the ACM SIGPLAN 1995 conference on Programming language design and implementation
        June 1995
        335 pages
        ISBN:0897916972
        DOI:10.1145/207110

        Copyright © 1995 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 June 1995

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        PLDI '95 Paper Acceptance Rate28of105submissions,27%Overall Acceptance Rate406of2,067submissions,20%

        Upcoming Conference

        PLDI '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader