Skip to main content

2018 | OriginalPaper | Buchkapitel

A Comparison of ARM Against x86 for Distributed Machine Learning Workloads

verfasst von : Sebastian Kmiec, Jonathon Wong, Hans-Arno Jacobsen, Da Qi Ren

Erschienen in: Performance Evaluation and Benchmarking for the Analytics Era

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The rise of Machine Learning (ML) in the last decade has created an unprecedented surge in demand for new and more powerful hardware. Various hardware approaches exist to take on these large demands motivating the need for hardware performance benchmarks to compare these diverse hardware systems. In this paper, we present a comprehensive analysis and comparison of available benchmark suites in the field of ML and related fields. The analysis of these benchmarks is used to discuss the potential of ARM processors within the context of ML deployments. Our paper concludes with a brief hardware performance comparison of modern, server-grade ARM and x86 processors using a benchmark suite selected from our survey.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
4.
Zurück zum Zitat Barroso, L.A., Hölzle, U.: The case for energy-proportional computing. Computer 40(12), 33–37 (2007)CrossRef Barroso, L.A., Hölzle, U.: The case for energy-proportional computing. Computer 40(12), 33–37 (2007)CrossRef
5.
Zurück zum Zitat Bienia, C.: Benchmarking modern multiprocessors. Ph.D. thesis, Princeton University, Princeton (2011). aAI3445564 Bienia, C.: Benchmarking modern multiprocessors. Ph.D. thesis, Princeton University, Princeton (2011). aAI3445564
6.
Zurück zum Zitat Blem, E., Menon, J., Sankaralingam, K.: Power struggles: revisiting the RISC vs. CISC debate on contemporary ARM and x86 architectures. In: 2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA), pp. 1–12, February 2013 Blem, E., Menon, J., Sankaralingam, K.: Power struggles: revisiting the RISC vs. CISC debate on contemporary ARM and x86 architectures. In: 2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA), pp. 1–12, February 2013
8.
Zurück zum Zitat Che, S., Sheaffer, J.W., Boyer, M., Szafaryn, L.G., Wang, L., Skadron, K.: A characterization of the Rodinia benchmark suite with comparison to contemporary CMP workloads. In: Proceedings of the IEEE International Symposium on Workload Characterization (IISWC 2010), pp. 1–11. IEEE Computer Society, Washington, DC (2010). http://dx.doi.org/10.1109/IISWC.2010.5650274 Che, S., Sheaffer, J.W., Boyer, M., Szafaryn, L.G., Wang, L., Skadron, K.: A characterization of the Rodinia benchmark suite with comparison to contemporary CMP workloads. In: Proceedings of the IEEE International Symposium on Workload Characterization (IISWC 2010), pp. 1–11. IEEE Computer Society, Washington, DC (2010). http://​dx.​doi.​org/​10.​1109/​IISWC.​2010.​5650274
9.
Zurück zum Zitat Danalis, A., Marin, G., McCurdy, C., Meredith, J.S., Roth, P.C., Spafford, K., Tipparaju, V., Vetter, J.S.: The scalable heterogeneous computing (SHOC) benchmark suite. In: Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units (GPGPU-3), pp. 63–74. ACM, New York (2010). http://doi.acm.org/10.1145/1735688.1735702 Danalis, A., Marin, G., McCurdy, C., Meredith, J.S., Roth, P.C., Spafford, K., Tipparaju, V., Vetter, J.S.: The scalable heterogeneous computing (SHOC) benchmark suite. In: Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units (GPGPU-3), pp. 63–74. ACM, New York (2010). http://​doi.​acm.​org/​10.​1145/​1735688.​1735702
13.
Zurück zum Zitat George, A.D.: An overview of RISC vs. CISC. In: Proceedings of the Twenty-Second Southeastern Symposium on System Theory, pp. 436–438, March 1990 George, A.D.: An overview of RISC vs. CISC. In: Proceedings of the Twenty-Second Southeastern Symposium on System Theory, pp. 436–438, March 1990
14.
Zurück zum Zitat Hauswald, J., Kang, Y., Laurenzano, M.A., Chen, Q., Li, C., Mudge, T., Dreslinski, R.G., Mars, J., Tang, L.: Djinn and Tonic: DNN as a service and its implications for future warehouse scale computers. In: Proceedings of the 42nd Annual International Symposium on Computer Architecture (ISCA 2015), pp. 27–40. ACM, New York (2015). http://doi.acm.org/10.1145/2749469.2749472 Hauswald, J., Kang, Y., Laurenzano, M.A., Chen, Q., Li, C., Mudge, T., Dreslinski, R.G., Mars, J., Tang, L.: Djinn and Tonic: DNN as a service and its implications for future warehouse scale computers. In: Proceedings of the 42nd Annual International Symposium on Computer Architecture (ISCA 2015), pp. 27–40. ACM, New York (2015). http://​doi.​acm.​org/​10.​1145/​2749469.​2749472
15.
Zurück zum Zitat Huang, S., Huang, J., Dai, J., Xie, T., Huang, B.: The HiBench benchmark suite: characterization of the MapReduce-based data analysis. In: 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW), pp. 41–51. IEEE (2010) Huang, S., Huang, J., Dai, J., Xie, T., Huang, B.: The HiBench benchmark suite: characterization of the MapReduce-based data analysis. In: 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW), pp. 41–51. IEEE (2010)
17.
Zurück zum Zitat Jundt, A., Cauble-Chantrenne, A., Tiwari, A., Peraza, J., Laurenzano, M.A., Carrington, L.: Compute bottlenecks on the new 64-bit ARM. In: Proceedings of the 3rd International Workshop on Energy Efficient Supercomputing (E2SC 2015), pp. 6:1–6:7. ACM, New York (2015). http://doi.acm.org/10.1145/2834800.2834806 Jundt, A., Cauble-Chantrenne, A., Tiwari, A., Peraza, J., Laurenzano, M.A., Carrington, L.: Compute bottlenecks on the new 64-bit ARM. In: Proceedings of the 3rd International Workshop on Energy Efficient Supercomputing (E2SC 2015), pp. 6:1–6:7. ACM, New York (2015). http://​doi.​acm.​org/​10.​1145/​2834800.​2834806
19.
Zurück zum Zitat Laurenzano, M.A., Tiwari, A., Jundt, A., Peraza, J., Ward, W.A., Campbell, R., Carrington, L.: Characterizing the performance-energy tradeoff of small ARM cores in HPC computation. In: Silva, F., Dutra, I., Santos Costa, V. (eds.) Euro-Par 2014. LNCS, vol. 8632, pp. 124–137. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-09873-9_11 Laurenzano, M.A., Tiwari, A., Jundt, A., Peraza, J., Ward, W.A., Campbell, R., Carrington, L.: Characterizing the performance-energy tradeoff of small ARM cores in HPC computation. In: Silva, F., Dutra, I., Santos Costa, V. (eds.) Euro-Par 2014. LNCS, vol. 8632, pp. 124–137. Springer, Cham (2014). https://​doi.​org/​10.​1007/​978-3-319-09873-9_​11
20.
Zurück zum Zitat Li, M., Tan, J., Wang, Y., Zhang, L., Salapura, V.: SparkBench: A comprehensive benchmarking suite for in memory data analytic platform Spark. In: Proceedings of the 12th ACM International Conference on Computing Frontiers (CF 2015), pp. 53:1–53:8. ACM, New York (2015). http://doi.acm.org/10.1145/2742854.2747283 Li, M., Tan, J., Wang, Y., Zhang, L., Salapura, V.: SparkBench: A comprehensive benchmarking suite for in memory data analytic platform Spark. In: Proceedings of the 12th ACM International Conference on Computing Frontiers (CF 2015), pp. 53:1–53:8. ACM, New York (2015). http://​doi.​acm.​org/​10.​1145/​2742854.​2747283
22.
Zurück zum Zitat Saponara, S., Fanucci, L., Coppola, M.: Many-core platform with NoC interconnect for low cost and energy sustainable cloud server-on-chip. In: 2012 Sustainable Internet and ICT for Sustainability (SustainIT), pp. 1–5, October 2012 Saponara, S., Fanucci, L., Coppola, M.: Many-core platform with NoC interconnect for low cost and energy sustainable cloud server-on-chip. In: 2012 Sustainable Internet and ICT for Sustainability (SustainIT), pp. 1–5, October 2012
23.
Zurück zum Zitat Svanfeldt-Winter, O., Lafond, S., Lilius, J.: Cost and energy reduction evaluation for ARM based web servers. In: Proceedings of the 2011 IEEE Ninth International Conference on Dependable, Autonomic and Secure Computing (DASC 2011), pp. 480–487. IEEE Computer Society, Washington, DC (2011). http://dx.doi.org/10.1109/DASC.2011.93 Svanfeldt-Winter, O., Lafond, S., Lilius, J.: Cost and energy reduction evaluation for ARM based web servers. In: Proceedings of the 2011 IEEE Ninth International Conference on Dependable, Autonomic and Secure Computing (DASC 2011), pp. 480–487. IEEE Computer Society, Washington, DC (2011). http://​dx.​doi.​org/​10.​1109/​DASC.​2011.​93
24.
Zurück zum Zitat Venkata, S.K., Ahn, I., Jeon, D., Gupta, A., Louie, C., Garcia, S., Belongie, S., Taylor, M.B.: SD-VBS: The San Diego vision benchmark suite. In: Proceedings of the 2009 IEEE International Symposium on Workload Characterization (IISWC 2009), pp. 55–64. IEEE Computer Society, Washington, DC (2009). http://dx.doi.org/10.1109/IISWC.2009.5306794 Venkata, S.K., Ahn, I., Jeon, D., Gupta, A., Louie, C., Garcia, S., Belongie, S., Taylor, M.B.: SD-VBS: The San Diego vision benchmark suite. In: Proceedings of the 2009 IEEE International Symposium on Workload Characterization (IISWC 2009), pp. 55–64. IEEE Computer Society, Washington, DC (2009). http://​dx.​doi.​org/​10.​1109/​IISWC.​2009.​5306794
25.
Zurück zum Zitat Wang, L., Zhan, J., Luo, C., Zhu, Y., Yang, Q., He, Y., Gao, W., Jia, Z., Shi, Y., Zhang, S., Zheng, C., Lu, G., Zhan, K., Li, X., Qiu, B.: BigDataBench: a Big Data benchmark suite from internet services. CoRR abs/1401.1406 (2014). http://arxiv.org/abs/1401.1406 Wang, L., Zhan, J., Luo, C., Zhu, Y., Yang, Q., He, Y., Gao, W., Jia, Z., Shi, Y., Zhang, S., Zheng, C., Lu, G., Zhan, K., Li, X., Qiu, B.: BigDataBench: a Big Data benchmark suite from internet services. CoRR abs/1401.1406 (2014). http://​arxiv.​org/​abs/​1401.​1406
Metadaten
Titel
A Comparison of ARM Against x86 for Distributed Machine Learning Workloads
verfasst von
Sebastian Kmiec
Jonathon Wong
Hans-Arno Jacobsen
Da Qi Ren
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-72401-0_12

Neuer Inhalt