Abstract
Trace-driven memory simulation tools such as MetaSim Tracer [1] capture the address stream of an application during an instrumented program run. Various statistics can be measured using the in-flight address stream including anticipated cache hit rates. This paper reports on performance improvements of MetaSim Tracer gained by using techniques developed in SimPoint [2]. Concurrent research [3] addresses techniques that can be used to reduce the instrumentation overhead involved in memory tracing, and this work addresses a technique that can be used to decrease the amount of cache simulation that is required on top of this. The result is a tool for trace driven cache simulation that is practical to use for memory performance studies of full sized scientific applications.
- Snavely, A., Wolter, N. and Carrington, L. Modeling Application Performance by Convolving Machine Signatures with Application Profiles. IEEE 4th Workshop on Workload Characterization, December 2001.]] Google ScholarDigital Library
- Sherwood, T., Perelman, E., Hamerly, G. and Calder, B. Automatically Characterizing Large Scale Program Behavior. International Conference on Architectural Support for Programming Languages and Operating Systems, October 2002.]] Google ScholarDigital Library
- Gao, X., Simon, B. and Snavely, A. ALITER: An Asynchronous Lightweight Instrumentation Tool for Event Recording. Workshop on Binary Instrumentation and Applications, September 2005.]]Google ScholarDigital Library
- Carrington, L., Snavely, A., Gao, X. and Wolter, N. A Performance Prediction Framework for Scientific Applications. Workshop on Performance Modeling, June 2003.]]Google ScholarCross Ref
- Srivastava, A. and Eustace, A. ATOM: A Flexible Interface for Building High Performance Program Analysis Tools. USENIX Winter Conference, January 1995.]] Google ScholarDigital Library
- Gao, X. and Snavely, A. Exploiting Stability to Reduce Time-Space Cost for Memory Tracing. Workshop on Performance Modeling - ICCS, June 2003.]]Google ScholarCross Ref
- Lemieux User Information Page. http://www.psc.edu/machines/tcs/lemieux.html, July 2004.]]Google Scholar
- Bailey, D. The NAS Parallel Benchmarks. Intl. Journal of Supercomputer Applications, vol. 5, no. 3, Fall 1991.]]Google Scholar
- Snavely, A., Carrington, L., Wolter, N., Labarta, J., Badia, R. and Purkayastha, A. A Framework for Application Performance Modeling and Prediction. Supercomputing, November 2002.]] Google ScholarDigital Library
- Ramamurti, R. and Lohner, R. A Parallel Implicit Incompressible Flow Solver Using Unstructured Meshes. Computers and Fluids, Vol. 25, No. 2, pp. 119--132, 1996.]]Google Scholar
- Pin User Manual. http://rogue.colorado.edu/Pin/, November 2003.]]Google Scholar
- Buck, B. and Hollingsworth, J. An API for Runtime Code Patching. Journal of High Performance Computing Applications 14 (4), Winter 2000.]] Google ScholarDigital Library
Index Terms
- Low cost trace-driven memory simulation using SimPoint
Recommendations
Synthetic Trace-Driven Simulation of Cache Memory
AINAW '07: Proceedings of the 21st International Conference on Advanced Information Networking and Applications Workshops - Volume 01The widening gap between CPU and memory speed has made caches an integral feature of modern high-performance processors. The high degree of configurability of cache memory can require extensive design space exploration and is generally performed using ...
Trace-driven memory simulation: a survey
As the gap between processor and memory speeds continues to widen, methods for evaluating memory system designs before they are implemented in hardware are becoming increasingly important. One such method, trace-driven memory simulation, has been the ...
Comments