Skip to main content

2019 | OriginalPaper | Buchkapitel

A Near-Data Processing Server Architecture and Its Impact on Data Center Applications

verfasst von : Xiaojia Song, Tao Xie, Stephen Fischer

Erschienen in: High Performance Computing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Existing near-data processing (NDP) techniques have demonstrated their strength for some specific data-intensive applications. However, they might be inadequate for a data center server, which normally needs to perform a diverse range of applications from data-intensive to compute-intensive. How to develop a versatile NDP-powered server to support various data center applications remains an open question. Further, a good understanding of the impact of NDP on data center applications is still missing. For example, can a compute-intensive application also benefit from NDP? Which type of NDP engine is a better choice, an FPGA-based engine or an ARM-based engine? To address these issues, we first propose a new NDP server architecture that tightly couples each SSD with a dedicated NDP engine to fully exploit the data transfer bandwidth of an SSD array. Based on the architecture, two NDP servers ANS (ARM-based NDP Server) and FNS (FPGA-based NDP Server) are introduced. Next, we implement a single-engine prototype for each of them. Finally, we measure performance, energy efficiency, and cost/performance ratio of six typical data center applications running on the two prototypes. Some new findings have been observed.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Ahn, J., Hong, S., Yoo, S., Mutlu, O., Choi, K.: A scalable processing-in-memory accelerator for parallel graph processing. ACM SIGARCH Comput. Architect. News 43(3), 105–117 (2016)CrossRef Ahn, J., Hong, S., Yoo, S., Mutlu, O., Choi, K.: A scalable processing-in-memory accelerator for parallel graph processing. ACM SIGARCH Comput. Architect. News 43(3), 105–117 (2016)CrossRef
2.
Zurück zum Zitat Asanovic, K., Patterson, D.: Firebox: a hardware building block for 2020 warehouse-scale computers. In: USENIX FAST, vol. 13 (2014) Asanovic, K., Patterson, D.: Firebox: a hardware building block for 2020 warehouse-scale computers. In: USENIX FAST, vol. 13 (2014)
3.
Zurück zum Zitat Cho, S., Park, C., Oh, H., Kim, S., Yi, Y., Ganger, G.R.: Active disk meets flash: a case for intelligent SSDs. In: Proceedings of the 27th International ACM Conference on International Conference on Supercomputing, pp. 91–102. ACM (2013) Cho, S., Park, C., Oh, H., Kim, S., Yi, Y., Ganger, G.R.: Active disk meets flash: a case for intelligent SSDs. In: Proceedings of the 27th International ACM Conference on International Conference on Supercomputing, pp. 91–102. ACM (2013)
4.
Zurück zum Zitat CNXSoft: Allwinner A64 a quad core 64-bit ARM cortex A53 SoC for tablets (2015) CNXSoft: Allwinner A64 a quad core 64-bit ARM cortex A53 SoC for tablets (2015)
5.
Zurück zum Zitat Davidson, G.S., Cowie, J.R., Helmreich, S.C., Zacharski, R.A., Boyack, K.W.: Data-centric computing with the netezza architecture. Technical report, Sandia National Laboratories (2006) Davidson, G.S., Cowie, J.R., Helmreich, S.C., Zacharski, R.A., Boyack, K.W.: Data-centric computing with the netezza architecture. Technical report, Sandia National Laboratories (2006)
6.
Zurück zum Zitat De, A., Gokhale, M., Gupta, R., Swanson, S.: Minerva: accelerating data analysis in next-generation SSDs. In: FCCM, pp. 9–16. IEEE (2013) De, A., Gokhale, M., Gupta, R., Swanson, S.: Minerva: accelerating data analysis in next-generation SSDs. In: FCCM, pp. 9–16. IEEE (2013)
7.
Zurück zum Zitat Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: CVPR. IEEE (2009) Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: CVPR. IEEE (2009)
10.
Zurück zum Zitat Gao, M., Ayers, G., Kozyrakis, C.: Practical near-data processing for in-memory analytics frameworks. In: 2015 International Conference on PACT, pp. 113–124 (2015) Gao, M., Ayers, G., Kozyrakis, C.: Practical near-data processing for in-memory analytics frameworks. In: 2015 International Conference on PACT, pp. 113–124 (2015)
11.
Zurück zum Zitat Gu, B., et al.: Biscuit: a framework for near-data processing of big data workloads. In: ISCA, pp. 153–165. IEEE (2016) Gu, B., et al.: Biscuit: a framework for near-data processing of big data workloads. In: ISCA, pp. 153–165. IEEE (2016)
12.
Zurück zum Zitat He, H., Guo, H.: The realization of FFT algorithm based on FPGA co-processor. In: Second International Symposium on Intelligent Information Technology Application, IITA 2008, vol. 3, pp. 239–243. IEEE (2008) He, H., Guo, H.: The realization of FFT algorithm based on FPGA co-processor. In: Second International Symposium on Intelligent Information Technology Application, IITA 2008, vol. 3, pp. 239–243. IEEE (2008)
15.
Zurück zum Zitat István, Z., Sidler, D., Alonso, G.: Caribou: intelligent distributed storage. Proc. VLDB Endowment 10(11), 1202–1213 (2017)CrossRef István, Z., Sidler, D., Alonso, G.: Caribou: intelligent distributed storage. Proc. VLDB Endowment 10(11), 1202–1213 (2017)CrossRef
16.
Zurück zum Zitat Jo, I., et al.: YourSQL: a high-performance database system leveraging in-storage computing. Proc. VLDB Endowment 9(12), 924–935 (2016)CrossRef Jo, I., et al.: YourSQL: a high-performance database system leveraging in-storage computing. Proc. VLDB Endowment 9(12), 924–935 (2016)CrossRef
17.
Zurück zum Zitat Jun, S.W., Liu, M., Lee, S., Hicks, et al.: BlueDBM: an appliance for big data analytics. In: Computer Architecture (ISCA), pp. 1–13 (2015) Jun, S.W., Liu, M., Lee, S., Hicks, et al.: BlueDBM: an appliance for big data analytics. In: Computer Architecture (ISCA), pp. 1–13 (2015)
18.
Zurück zum Zitat Koo, G., et al.: Summarizer: trading communication with computing near storage. In: Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, pp. 219–231. ACM (2017) Koo, G., et al.: Summarizer: trading communication with computing near storage. In: Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, pp. 219–231. ACM (2017)
19.
Zurück zum Zitat Mayhew, D., Krishnan, V.: PCI express and advanced switching: evolutionary path to building next generation interconnects. In: Proceedings of the High Performance Interconnects, pp. 21–29 (2003) Mayhew, D., Krishnan, V.: PCI express and advanced switching: evolutionary path to building next generation interconnects. In: Proceedings of the High Performance Interconnects, pp. 21–29 (2003)
20.
Zurück zum Zitat Nurvitadhi, E., Sheffield, D., Sim, J., Mishra, A., Venkatesh, G., Marr, D.: Accelerating binarized neural networks: comparison of FPGA, CPU, GPU, and ASIC. In: FPT, pp. 77–84. IEEE (2016) Nurvitadhi, E., Sheffield, D., Sim, J., Mishra, A., Venkatesh, G., Marr, D.: Accelerating binarized neural networks: comparison of FPGA, CPU, GPU, and ASIC. In: FPT, pp. 77–84. IEEE (2016)
21.
Zurück zum Zitat Rodinia: accelerating compute-intensive applications with accelerators (2009) Rodinia: accelerating compute-intensive applications with accelerators (2009)
23.
Zurück zum Zitat Talbot, J., Yoo, R.M., Kozyrakis, C.: Phoenix++: modular mapreduce for shared-memory systems. In: Proceedings of the Second International Workshop on MapReduce and its Applications, pp. 9–16. ACM (2011) Talbot, J., Yoo, R.M., Kozyrakis, C.: Phoenix++: modular mapreduce for shared-memory systems. In: Proceedings of the Second International Workshop on MapReduce and its Applications, pp. 9–16. ACM (2011)
24.
Zurück zum Zitat Tiwari, D., et al.: Active flash: towards energy-efficient, in-situ data analytics on extreme-scale machines. In: FAST, pp. 119–132 (2013) Tiwari, D., et al.: Active flash: towards energy-efficient, in-situ data analytics on extreme-scale machines. In: FAST, pp. 119–132 (2013)
27.
Zurück zum Zitat Woods, L., István, Z., Alonso, G.: Ibex: an intelligent storage engine with support for advanced SQL offloading. Proc. VLDB Endowment 7(11), 963–974 (2014)CrossRef Woods, L., István, Z., Alonso, G.: Ibex: an intelligent storage engine with support for advanced SQL offloading. Proc. VLDB Endowment 7(11), 963–974 (2014)CrossRef
29.
Zurück zum Zitat Yoshimi, M., Oge, Y., Yoshinaga, T.: Pipelined parallel join and its FPGA-based acceleration. TRETS 10(4), 28 (2017)CrossRef Yoshimi, M., Oge, Y., Yoshinaga, T.: Pipelined parallel join and its FPGA-based acceleration. TRETS 10(4), 28 (2017)CrossRef
30.
Zurück zum Zitat Zhang, D., Jayasena, N., Lyashevsky, A., Greathouse, J.L., Xu, L., Ignatowski, M.: TOP-PIM: throughput-oriented programmable processing in memory. In: HPDC, pp. 85–98. ACM (2014) Zhang, D., Jayasena, N., Lyashevsky, A., Greathouse, J.L., Xu, L., Ignatowski, M.: TOP-PIM: throughput-oriented programmable processing in memory. In: HPDC, pp. 85–98. ACM (2014)
Metadaten
Titel
A Near-Data Processing Server Architecture and Its Impact on Data Center Applications
verfasst von
Xiaojia Song
Tao Xie
Stephen Fischer
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-20656-7_5

Premium Partner