Skip to main content
Top

2018 | OriginalPaper | Chapter

A Post-link Prefetching Based on Event Sampling

Authors : Hongmei Wei, Fei Wang, Zhongsheng Li

Published in: Advanced Computer Architecture

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Data prefetching is an effective approach to improve performance by hiding long memory latency. Existing profiling feedback optimizations can do well in pointer-based linked data structure prefetching. However, these optimizations, which instrument and optimize source code during compiling or post link, usually incur tremendous overhead at profiling stage. Furthermore, it is a mission impossible for these methods to do optimization without source code. This work designs and implements an Event Sampling based Prefetching Optimizer, which is a post-link prefetching based on hardware performance counters event sampling. Evaluation on SW26010 processor shows that with the proposed prefetching approach, 9 out of 29 programs of SPEC2006 can be speeded up by about 4.3% on average with only less than 10% sampling overhead on average.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Manikantan, R.: Performance oriented prefetching enhancements using commit stalls. J. Instr. Level Parallelism 13, 1–28 (2011) Manikantan, R.: Performance oriented prefetching enhancements using commit stalls. J. Instr. Level Parallelism 13, 1–28 (2011)
2.
go back to reference Mowry, T.C.: Tolerating latency through software-controlled data prefetching, Ph.D. thesis. Stanford University, March 1994 Mowry, T.C.: Tolerating latency through software-controlled data prefetching, Ph.D. thesis. Stanford University, March 1994
3.
go back to reference Bernstein, D., Cohen, D., Freund, A., Maydan, D.E.: Compiler techniques for data prefetching on the PowerPC. In: Proceedings of the 1995 International Conference on Parallel Architectures and Compilation Techniques, June 1995 Bernstein, D., Cohen, D., Freund, A., Maydan, D.E.: Compiler techniques for data prefetching on the PowerPC. In: Proceedings of the 1995 International Conference on Parallel Architectures and Compilation Techniques, June 1995
4.
go back to reference Chilimbi, T.M., Hirzel, M.: Dynamic hot data stream prefetching for general-purposes programs. In: Proceedings of the 2002 ACM SIGPLAN Conference on Programming Language Design and Implementation, June 2002 Chilimbi, T.M., Hirzel, M.: Dynamic hot data stream prefetching for general-purposes programs. In: Proceedings of the 2002 ACM SIGPLAN Conference on Programming Language Design and Implementation, June 2002
6.
go back to reference Luk, C.-K., Muth, R., Patil, H., Lowney, P.G., Cohn, R., Weiss, R.: Profile-guided post-link stride prefetching. In: Proceedings of 2002 International Conference on Supercomputing, pp. 167–178, June 2002 Luk, C.-K., Muth, R., Patil, H., Lowney, P.G., Cohn, R., Weiss, R.: Profile-guided post-link stride prefetching. In: Proceedings of 2002 International Conference on Supercomputing, pp. 167–178, June 2002
7.
go back to reference Zou, Q., Li, X.F., Zhang, L.B.: Runtime engine for dynamic profile guided stride prefetching. J. Comput. Sci. Technol. 23(4), 633–643 (2008) Zou, Q., Li, X.F., Zhang, L.B.: Runtime engine for dynamic profile guided stride prefetching. J. Comput. Sci. Technol. 23(4), 633–643 (2008)
8.
go back to reference Adl-Tabatabai, A.R., Hudson, R.L., Serrano, M.J., Subramoney, S.: Prefetch injection based on hardware monitoring and objects metadata. In: Proceedings of the ACM SIGPLAN 2004 Conference on Programming Language Design and Implementation (2004) Adl-Tabatabai, A.R., Hudson, R.L., Serrano, M.J., Subramoney, S.: Prefetch injection based on hardware monitoring and objects metadata. In: Proceedings of the ACM SIGPLAN 2004 Conference on Programming Language Design and Implementation (2004)
9.
go back to reference Lu, J., Chen, H., Yew, P.-C., Hsu, w.-C.: Design and implementation of a lightweight dynamic optimization system. J. Instr. Level Parallelism 6, 1–24 (2004) Lu, J., Chen, H., Yew, P.-C., Hsu, w.-C.: Design and implementation of a lightweight dynamic optimization system. J. Instr. Level Parallelism 6, 1–24 (2004)
10.
go back to reference Beyler, J.C., Clavss, P.: Performance driven data cache prefetching in a dynamic software optimization system. In: Proceedings of the 36th International Conference on Supercomputing, pp. 202–209 (2007) Beyler, J.C., Clavss, P.: Performance driven data cache prefetching in a dynamic software optimization system. In: Proceedings of the 36th International Conference on Supercomputing, pp. 202–209 (2007)
11.
go back to reference Luk, C.-K., Muth, R., Patil, H., Cohn, R., Lowney, G.: Ispike: a post-link optimizer for the intel itanium architecture. In: Proceedings of the International Symposium on Code Generation and Optimization (2004) Luk, C.-K., Muth, R., Patil, H., Cohn, R., Lowney, G.: Ispike: a post-link optimizer for the intel itanium architecture. In: Proceedings of the International Symposium on Code Generation and Optimization (2004)
12.
go back to reference Collins, J., et al.: Speculative precomputation: long-range prefetching of delinquent loads. In: Proceedings of the International Symposium on Computer Architecture, July 2001 Collins, J., et al.: Speculative precomputation: long-range prefetching of delinquent loads. In: Proceedings of the International Symposium on Computer Architecture, July 2001
13.
go back to reference Kamruzzaman, Md., Swanson, S., Tullsen, D.M.: Inter-core prefetching for multicore processors using migrating helper threads. In: ASPLOS 2011, 5–11 March 2011 Kamruzzaman, Md., Swanson, S., Tullsen, D.M.: Inter-core prefetching for multicore processors using migrating helper threads. In: ASPLOS 2011, 5–11 March 2011
14.
go back to reference Mehta, S., Fang, Z., Zhai, A., Yew, P.-C.: Multi-stage coordinated prefetching for present-day processors. In: ICS 2014, pp. 73–82 (2014) Mehta, S., Fang, Z., Zhai, A., Yew, P.-C.: Multi-stage coordinated prefetching for present-day processors. In: ICS 2014, pp. 73–82 (2014)
15.
go back to reference Weifeng, Z., Calder, B., Tullsen, D.M.: A self-repairing prefetcher in an event-driven dynamic optimization framework. In: Proceedings of the International Symposium on Code Generation and Optimization, pp. 50–64. IEEE Computer Society (2006) Weifeng, Z., Calder, B., Tullsen, D.M.: A self-repairing prefetcher in an event-driven dynamic optimization framework. In: Proceedings of the International Symposium on Code Generation and Optimization, pp. 50–64. IEEE Computer Society (2006)
16.
go back to reference Qi, F.B., Wang, F., Li, Z.S.: Feedback directed prefetching optimization for linked data structure. J. Softw. 20(Suppl.), 34 − 39 2009. (in Chinese) Qi, F.B., Wang, F., Li, Z.S.: Feedback directed prefetching optimization for linked data structure. J. Softw. 20(Suppl.), 34 − 39 2009. (in Chinese)
17.
go back to reference Wang, F., Wei, H.M., Qi, F.B.: Prefetching optimization based on profiling compilation. High Perform. Comput. Technol. 186 (2007). (in Chinese) Wang, F., Wei, H.M., Qi, F.B.: Prefetching optimization based on profiling compilation. High Perform. Comput. Technol. 186 (2007). (in Chinese)
18.
go back to reference Zou, Q., Wu, M., Hu, W.W., Zhang, L.B.: An instrument-analysis framework for adaptive prefetch optimization in JVM. J. Softw. 19(7), 1581–1589 (2008). (in Chinese)CrossRef Zou, Q., Wu, M., Hu, W.W., Zhang, L.B.: An instrument-analysis framework for adaptive prefetch optimization in JVM. J. Softw. 19(7), 1581–1589 (2008). (in Chinese)CrossRef
19.
go back to reference Fu, H., Liao, J., Yang, J., et al.: The sunway taihulight supercomputer: system and applications. Sci. China Inf. Sci. 59(7) (2016) Fu, H., Liao, J., Yang, J., et al.: The sunway taihulight supercomputer: system and applications. Sci. China Inf. Sci. 59(7) (2016)
Metadata
Title
A Post-link Prefetching Based on Event Sampling
Authors
Hongmei Wei
Fei Wang
Zhongsheng Li
Copyright Year
2018
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-13-2423-9_5