Skip to main content
Top

2017 | OriginalPaper | Chapter

Runtime-Assisted Shared Cache Insertion Policies Based on Re-reference Intervals

Authors : Vladimir Dimić, Miquel Moretó, Marc Casas, Mateo Valero

Published in: Euro-Par 2017: Parallel Processing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Processor speed is improving at a faster rate than the speed of main memory, which makes memory accesses increasingly expensive. One way to solve this problem is to reduce miss ratio of the processor’s last level cache by improving its replacement policy. We approach the problem by co-designing the runtime system and hardware and exploiting the semantics of the applications written in data-flow task-based programming models to provide hardware with information about the task types and task data-dependencies. We propose the Task-Type aware Insertion Policy, TTIP, which uses the runtime system to dynamically determine the best probability per task type for bimodal insertion in the recency stack and the static Dependency-Type aware Insertion Policy, DTIP, that inserts cache lines in the optimal position taking into account the dependency types of the current task. TTIP and DTIP perform similarly or better than state-of-the-art replacement policies, while requiring less hardware.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Belady, L.A.: A study of replacement algorithms for a virtual-storage computer. IBM Syst. J. 5, 78–101 (1966)CrossRef Belady, L.A.: A study of replacement algorithms for a virtual-storage computer. IBM Syst. J. 5, 78–101 (1966)CrossRef
2.
go back to reference Bienia, C.: Benchmarking modern multiprocessors. Ph.D. thesis, Princeton (2011) Bienia, C.: Benchmarking modern multiprocessors. Ph.D. thesis, Princeton (2011)
3.
go back to reference Blumofe, R., Joerg, C., Kuszmaul, B., et al.: Cilk: an efficient multithreaded runtime system. J. Parallel Distrib. Comput. 37, 55–69 (1995)CrossRef Blumofe, R., Joerg, C., Kuszmaul, B., et al.: Cilk: an efficient multithreaded runtime system. J. Parallel Distrib. Comput. 37, 55–69 (1995)CrossRef
5.
go back to reference Chamberlain, B., Callahan, D., Zima, H.: Parallel programmability and the Chapel language. Int. J. High Perform. Comput. Appl. 21, 291–312 (2007–2008) Chamberlain, B., Callahan, D., Zima, H.: Parallel programmability and the Chapel language. Int. J. High Perform. Comput. Appl. 21, 291–312 (2007–2008)
6.
go back to reference Chasapis, D., Casas, M., Moretó, M., Vidal, R., Ayguadé, E., Labarta, J., Valero, M.: PARSECSs: evaluating the impact of task parallelism in the PARSEC benchmark suite. In: TACO (2015) Chasapis, D., Casas, M., Moretó, M., Vidal, R., Ayguadé, E., Labarta, J., Valero, M.: PARSECSs: evaluating the impact of task parallelism in the PARSEC benchmark suite. In: TACO (2015)
7.
go back to reference Chen, W., Liu, P., Stelzer, K.: Implementation of a pseudo-LRU algorithm in a partitioned cache, US Patent 7,069,390 (2006) Chen, W., Liu, P., Stelzer, K.: Implementation of a pseudo-LRU algorithm in a partitioned cache, US Patent 7,069,390 (2006)
8.
go back to reference Davis, T., Hu, Y.: The University of Florida sparse matrix collection. ACM Trans. Math. Softw. 38(1), 1 (2011)MathSciNetMATH Davis, T., Hu, Y.: The University of Florida sparse matrix collection. ACM Trans. Math. Softw. 38(1), 1 (2011)MathSciNetMATH
9.
go back to reference Jaleel, A., Theobald, K.B., Steely Jr., S.C., Emer, J.: High performance cache replacement using re-reference interval prediction (RRIP). SIGARCH Comput. Arch. News 38, 60–71 (2010)CrossRef Jaleel, A., Theobald, K.B., Steely Jr., S.C., Emer, J.: High performance cache replacement using re-reference interval prediction (RRIP). SIGARCH Comput. Arch. News 38, 60–71 (2010)CrossRef
10.
go back to reference Jaulmes, L., Casas, M., Moretó, M., et al.: Exploiting asynchrony from exact forward recovery for due in iterative solvers. In: SC (2015) Jaulmes, L., Casas, M., Moretó, M., et al.: Exploiting asynchrony from exact forward recovery for due in iterative solvers. In: SC (2015)
11.
go back to reference Kale, L.V., Krishnan, S.: CHARM++: a portable concurrent object oriented system based on C++. In: OOPSLA (1993) Kale, L.V., Krishnan, S.: CHARM++: a portable concurrent object oriented system based on C++. In: OOPSLA (1993)
12.
go back to reference Manivannan, M., Papaefstathiou, V., Pericas, M., Stenstrom, P.: RADAR: runtime-assisted dead region management for last-level caches. In: HPCA (2016) Manivannan, M., Papaefstathiou, V., Pericas, M., Stenstrom, P.: RADAR: runtime-assisted dead region management for last-level caches. In: HPCA (2016)
13.
go back to reference Manivannan, M., Stenstrom, P.: Runtime-guided cache coherence optimizations in multi-core architectures. In: IPDPS (2014) Manivannan, M., Stenstrom, P.: Runtime-guided cache coherence optimizations in multi-core architectures. In: IPDPS (2014)
14.
go back to reference Sun Microsystems: UltraSPARC T2 supplement to the UltraSPARC architecture 2007, draft D1.4.3 (2007) Sun Microsystems: UltraSPARC T2 supplement to the UltraSPARC architecture 2007, draft D1.4.3 (2007)
15.
go back to reference OpenMP Arch. Rev. Board: OpenMP Application Program Interface, v4.0 (2013) OpenMP Arch. Rev. Board: OpenMP Application Program Interface, v4.0 (2013)
16.
go back to reference Pan, A., Pai, V.S.: Runtime-driven shared last-level cache management for task-parallel programs. In: SC (2015) Pan, A., Pai, V.S.: Runtime-driven shared last-level cache management for task-parallel programs. In: SC (2015)
17.
go back to reference Papaefstathiou, V., Katevenis, M.G., Nikolopoulos, D.S., Pnevmatikatos, D.: Prefetching and cache management using task lifetimes. In: ICS (2013) Papaefstathiou, V., Katevenis, M.G., Nikolopoulos, D.S., Pnevmatikatos, D.: Prefetching and cache management using task lifetimes. In: ICS (2013)
18.
go back to reference Qureshi, M., Jaleel, A., Patt, Y., Steely, S., Emer, J.: Set-dueling-controlled adaptive insertion for high-performance caching. In: Micro. IEEE (2008) Qureshi, M., Jaleel, A., Patt, Y., Steely, S., Emer, J.: Set-dueling-controlled adaptive insertion for high-performance caching. In: Micro. IEEE (2008)
19.
go back to reference Qureshi, M.K., Jaleel, A., Patt, Y.N., Steely, S.C., Emer, J.: Adaptive insertion policies for high performance caching. In: ISCA (2007) Qureshi, M.K., Jaleel, A., Patt, Y.N., Steely, S.C., Emer, J.: Adaptive insertion policies for high performance caching. In: ISCA (2007)
20.
go back to reference Qureshi, M.K., Lynch, D.N., Mutlu, O., Patt, Y.N.: A case for MLP-aware cache replacement. In: ISCA (2006) Qureshi, M.K., Lynch, D.N., Mutlu, O., Patt, Y.N.: A case for MLP-aware cache replacement. In: ISCA (2006)
21.
go back to reference Reinders, J.: Intel Threading Building Blocks. First edn. (2007) Reinders, J.: Intel Threading Building Blocks. First edn. (2007)
22.
go back to reference Rico, A., Duran, A., Cabarcas, F., Etsion, Y., Ramirez, A., Valero, M.: Trace-driven simulation of multithreaded applications. In: ISPASS (2011) Rico, A., Duran, A., Cabarcas, F., Etsion, Y., Ramirez, A., Valero, M.: Trace-driven simulation of multithreaded applications. In: ISPASS (2011)
23.
go back to reference Rico, A., Cabarcas, F., Villavieja, C., et al.: On the simulation of large-scale architectures using multiple application abstraction levels. In: TACO (2012) Rico, A., Cabarcas, F., Villavieja, C., et al.: On the simulation of large-scale architectures using multiple application abstraction levels. In: TACO (2012)
24.
go back to reference Shewchuk, J.R.: An introduction to the conjugate gradient method without the agonizing pain. Technical report (1994) Shewchuk, J.R.: An introduction to the conjugate gradient method without the agonizing pain. Technical report (1994)
25.
go back to reference Teruel, X.: OmpSs quick overview, a practical approach (2013) Teruel, X.: OmpSs quick overview, a practical approach (2013)
26.
go back to reference Valero, M., Moreto, M., Casas, M., Ayguade, E., Labarta, J.: Runtime-aware architectures: a first approach. Supercomp. Front. Innov. 1, 29–44 (2014) Valero, M., Moreto, M., Casas, M., Ayguade, E., Labarta, J.: Runtime-aware architectures: a first approach. Supercomp. Front. Innov. 1, 29–44 (2014)
27.
go back to reference Wu, C.J., Jaleel, A., Hasenplaugh, W., et al.: SHiP: signature-based hit predictor for high performance caching. In: MICRO (2011) Wu, C.J., Jaleel, A., Hasenplaugh, W., et al.: SHiP: signature-based hit predictor for high performance caching. In: MICRO (2011)
28.
go back to reference Wulf, W.A., McKee, S.A.: Hitting the memory wall: implications of the obvious. SIGARCH Comput. Arch. News 23, 20–24 (1995)CrossRef Wulf, W.A., McKee, S.A.: Hitting the memory wall: implications of the obvious. SIGARCH Comput. Arch. News 23, 20–24 (1995)CrossRef
Metadata
Title
Runtime-Assisted Shared Cache Insertion Policies Based on Re-reference Intervals
Authors
Vladimir Dimić
Miquel Moretó
Marc Casas
Mateo Valero
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-64203-1_18

Premium Partner