2005 | OriginalPaper | Buchkapitel
A Data Transformations Based Approach for Optimizing Memory and Cache Locality on Distributed Memory Multiprocessors
verfasst von : Xia Jun, Xue-Jun Yang
Erschienen in: Advanced Parallel Processing Technologies
Verlag: Springer Berlin Heidelberg
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
Data locality is one of the key factors in affecting the performance of parallel programs running on distributed memory multiprocessors. This paper presents an approach for optimizing memory locality and cache locality of perfect or non-perfect loop nests using linear data transformations on distributed memory multiprocessors. The approach optimizes memory locality with the data space fusion technique and cache locality with the projection-delamination technique, and combines the both techniques effectively to make the overheads of remote memory accesses and local memory accesses as low as possible. We conduct experiments with nine programs and the results show the approach is effective in optimizing memory locality and cache locality simultaneously.