Skip to main content

Tipp

Weitere Artikel dieser Ausgabe durch Wischen aufrufen

09.09.2019 | Schwerpunktbeitrag

Evaluating the Vector Supercomputer SX-Aurora TSUBASA as a Co-Processor for In-Memory Database Systems

verfasst von: Johannes Pietrzyk, Dirk Habich, Patrick Damme, Erich Focht, Wolfgang Lehner

Erschienen in: Datenbank-Spektrum | Ausgabe 3/2019

Einloggen, um Zugang zu erhalten
share
TEILEN

Abstract

In-memory column-store database systems are state of the art for the efficient processing of analytical workloads. In these systems, data compression as well as vectorization play an important role. Currently, the vectorized processing is done using regular SIMD (Single Instruction Multiple Data) extensions of modern processors. For example, Intel’s latest SIMD extension supports 512-bit vector registers which allows the parallel processing of 8× 64-bit values. From a database system perspective, this vectorization technique is not only very interesting for compression and decompression to reduce the computational overhead, but also for all database operators like joins, scan, as well as groupings. In contrast to these SIMD extensions, NEC Corporation has recently introduced a novel pure vector engine (supercomputer) as a co-processor called SX-Aurora TSUBASA. This vector engine features a vector length of 16.384 bits with the world’s highest bandwidth of up to 1.2 TB/s, which perfectly fits to data-intensive applications like in-memory database systems. Therefore, we describe the unique architecture and properties of this novel vector engine in this paper. Moreover, we present selected in-memory column-store-specific evaluation results to show the benefits of this vector engine compared to regular SIMD extensions. Finally, we conclude the paper with an outlook on our ongoing research activities in this direction.

Sie möchten Zugang zu diesem Inhalt erhalten? Dann informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 69.000 Bücher
  • über 500 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt 90 Tage mit der neuen Mini-Lizenz testen!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 58.000 Bücher
  • über 300 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko





Jetzt 90 Tage mit der neuen Mini-Lizenz testen!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 50.000 Bücher
  • über 380 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe



 


Jetzt 90 Tage mit der neuen Mini-Lizenz testen!

Weitere Produktempfehlungen anzeigen
Fußnoten
1
High Bandwidth Memory Version 2.
 
2
In comparison, the vector length of Intel’s latest vector extension AVX-512 is limited to 8 elements with 8 Byte per element.
 
Literatur
1.
Zurück zum Zitat Abadi D, Boncz PA, Harizopoulos S, Idreos S, Madden S (2013) The design and implementation of modern column-oriented database systems. Found Trends Databases 5(3):197–280 CrossRef Abadi D, Boncz PA, Harizopoulos S, Idreos S, Madden S (2013) The design and implementation of modern column-oriented database systems. Found Trends Databases 5(3):197–280 CrossRef
2.
Zurück zum Zitat Abadi DJ, Madden S, Ferreira M (2006) Integrating compression and execution in column-oriented database systems. In: SIGMOD, pp 671–682. ACM: New York Abadi DJ, Madden S, Ferreira M (2006) Integrating compression and execution in column-oriented database systems. In: SIGMOD, pp 671–682. ACM: New York
3.
Zurück zum Zitat Binnig C, Hildenbrand S, Färber F (2009) Dictionary-based order-preserving string compression for main memory column stores. In: SIGMOD, pp 283–296. ACM: New York Binnig C, Hildenbrand S, Färber F (2009) Dictionary-based order-preserving string compression for main memory column stores. In: SIGMOD, pp 283–296. ACM: New York
4.
Zurück zum Zitat Boncz PA, Kersten ML, Manegold S (2008) Breaking the memory wall in monetdb. Commun ACM 51(12):77–85 CrossRef Boncz PA, Kersten ML, Manegold S (2008) Breaking the memory wall in monetdb. Commun ACM 51(12):77–85 CrossRef
5.
Zurück zum Zitat Chen Z, Gehrke J, Korn F (2001) Query optimization in compressed database systems. In: SIGMOD, pp 271–282. ACM: New York Chen Z, Gehrke J, Korn F (2001) Query optimization in compressed database systems. In: SIGMOD, pp 271–282. ACM: New York
6.
Zurück zum Zitat Copeland GP, Khoshafian S (1985) A decomposition storage model. In: SIGMOD, pp 268–279. ACM: New York Copeland GP, Khoshafian S (1985) A decomposition storage model. In: SIGMOD, pp 268–279. ACM: New York
7.
Zurück zum Zitat Damme P (2017) Query processing based on compressed intermediates. VLDB PhD Workshop. Munich, 28.08.2017 Damme P (2017) Query processing based on compressed intermediates. VLDB PhD Workshop. Munich, 28.08.2017
8.
Zurück zum Zitat Damme P, Habich D, Hildebrandt J, Lehner W (2017) Lightweight data compression algorithms: An experimental survey (experiments and analyses). In: EDBT, pp 72–83. Venice, 21–24.03.2017 Damme P, Habich D, Hildebrandt J, Lehner W (2017) Lightweight data compression algorithms: An experimental survey (experiments and analyses). In: EDBT, pp 72–83. Venice, 21–24.03.2017
9.
Zurück zum Zitat Damme P, Habich D, Lehner W (2015) Direct transformation techniques for compressed data: General approach and application scenarios. In: ADBIS, pp 151–165. Springer Damme P, Habich D, Lehner W (2015) Direct transformation techniques for compressed data: General approach and application scenarios. In: ADBIS, pp 151–165. Springer
10.
Zurück zum Zitat Damme P, Ungethüm A, Hildebrandt J, Habich D, Lehner W (2019) From a comprehensive experimental survey to a cost-based selection strategy for lightweight integer compression algorithms. ACM Trans Database Syst 44(3):9:1–9:46 CrossRef Damme P, Ungethüm A, Hildebrandt J, Habich D, Lehner W (2019) From a comprehensive experimental survey to a cost-based selection strategy for lightweight integer compression algorithms. ACM Trans Database Syst 44(3):9:1–9:46 CrossRef
11.
Zurück zum Zitat Faerber F, Kemper A, Larson P, Levandoski JJ, Neumann T, Pavlo A (2017) Main memory database systems. Found Trends Databases 8(1-2):1–130 CrossRef Faerber F, Kemper A, Larson P, Levandoski JJ, Neumann T, Pavlo A (2017) Main memory database systems. Found Trends Databases 8(1-2):1–130 CrossRef
12.
Zurück zum Zitat Habich D, Damme P, Ungethüm A, Lehner W (2018) Make larger vector register sizes new challenges?: Lessons learned from the area of vectorized lightweight compression algorithms. In: DBTest@SIGMOD, pp 8:1–8:6. ACM: New York Habich D, Damme P, Ungethüm A, Lehner W (2018) Make larger vector register sizes new challenges?: Lessons learned from the area of vectorized lightweight compression algorithms. In: DBTest@SIGMOD, pp 8:1–8:6. ACM: New York
13.
Zurück zum Zitat Habich D, Damme P, Ungethüm A, Pietrzyk J, Krause A, Hildebrandt J, Lehner W (2019) Morphstore – in-memory query processing based on morphing compressed intermediates LIVE. In: SIGMOD, pp 1917–1920. ACM: New York Habich D, Damme P, Ungethüm A, Pietrzyk J, Krause A, Hildebrandt J, Lehner W (2019) Morphstore – in-memory query processing based on morphing compressed intermediates LIVE. In: SIGMOD, pp 1917–1920. ACM: New York
14.
Zurück zum Zitat He J, Zhang S, He B (2014) In-cache query co-processing on coupled CPU-GPU architectures. PVLDB 8(4):329–340 He J, Zhang S, He B (2014) In-cache query co-processing on coupled CPU-GPU architectures. PVLDB 8(4):329–340
15.
Zurück zum Zitat Hildebrandt J, Habich D, Damme P, Lehner W (2016) Compression-aware in-memory query processing: Vision, system design and beyond. In: ADMS, pp 40–56. Springer Hildebrandt J, Habich D, Damme P, Lehner W (2016) Compression-aware in-memory query processing: Vision, system design and beyond. In: ADMS, pp 40–56. Springer
16.
Zurück zum Zitat Idreos S, Groffen F, Nes N, Manegold S, Mullender KS, Kersten ML (2012) Monetdb: two decades of research in column-oriented database architectures. IEEE Data Eng Bull 35(1):40–45 Idreos S, Groffen F, Nes N, Manegold S, Mullender KS, Kersten ML (2012) Monetdb: two decades of research in column-oriented database architectures. IEEE Data Eng Bull 35(1):40–45
17.
Zurück zum Zitat Karnagel T, Habich D, Lehner W (2017) Adaptive work placement for query processing on heterogeneous computing resources. PVLDB 10(7):733–744 Karnagel T, Habich D, Lehner W (2017) Adaptive work placement for query processing on heterogeneous computing resources. PVLDB 10(7):733–744
18.
Zurück zum Zitat Karnagel T, Müller R, Lohman GM (2015) Optimizing gpu-accelerated group-by and aggregation. In: ADMS, pp 13–24. Springer Karnagel T, Müller R, Lohman GM (2015) Optimizing gpu-accelerated group-by and aggregation. In: ADMS, pp 13–24. Springer
19.
Zurück zum Zitat Kissinger T, Schlegel B, Habich D, Lehner W (2013) QPPT: query processing on prefix trees. In: CIDR. Asilomar, 06.–09.01.2013 Kissinger T, Schlegel B, Habich D, Lehner W (2013) QPPT: query processing on prefix trees. In: CIDR. Asilomar, 06.–09.01.2013
20.
Zurück zum Zitat Komatsu K, Momose S, Isobe Y, Watanabe O, Musa A, Yokokawa M, Aoyama T, Sato M, Kobayashi H (2018) Performance evaluation of a vector supercomputer sx-aurora TSUBASA. In: SC, pp 54:1–54:12. IEEE/ACM: New York Komatsu K, Momose S, Isobe Y, Watanabe O, Musa A, Yokokawa M, Aoyama T, Sato M, Kobayashi H (2018) Performance evaluation of a vector supercomputer sx-aurora TSUBASA. In: SC, pp 54:1–54:12. IEEE/ACM: New York
21.
Zurück zum Zitat Lang H, Kipf A, Passing L, Boncz PA, Neumann T, Kemper A (2018) Make the most out of your SIMD investments: counter control flow divergence in compiled query pipelines. In: DaMoN@SIGMOD, pp 5:1–5:8. ACM: New York Lang H, Kipf A, Passing L, Boncz PA, Neumann T, Kemper A (2018) Make the most out of your SIMD investments: counter control flow divergence in compiled query pipelines. In: DaMoN@SIGMOD, pp 5:1–5:8. ACM: New York
22.
Zurück zum Zitat Lang H, Mühlbauer T, Funke F, Boncz PA, Neumann T, Kemper A (2016) Data blocks: Hybrid OLTP and OLAP on compressed storage using both vectorization and compilation. In: SIGMOD, pp 311–326. ACM: New York CrossRef Lang H, Mühlbauer T, Funke F, Boncz PA, Neumann T, Kemper A (2016) Data blocks: Hybrid OLTP and OLAP on compressed storage using both vectorization and compilation. In: SIGMOD, pp 311–326. ACM: New York CrossRef
23.
Zurück zum Zitat Lee J et al (2014) Joins on encoded and partitioned data. PVLDB 7(13):1355–1366 Lee J et al (2014) Joins on encoded and partitioned data. PVLDB 7(13):1355–1366
24.
Zurück zum Zitat Lemire D, Boytsov L (2015) Decoding billions of integers per second through vectorization. Softw Pract Exper 45(1):1–29 CrossRef Lemire D, Boytsov L (2015) Decoding billions of integers per second through vectorization. Softw Pract Exper 45(1):1–29 CrossRef
25.
Zurück zum Zitat Li F, Das S, Syamala M, Narasayya VR (2016) Accelerating relational databases by leveraging remote memory and RDMA. In: SIGMOD, pp 355–370. ACM: New York CrossRef Li F, Das S, Syamala M, Narasayya VR (2016) Accelerating relational databases by leveraging remote memory and RDMA. In: SIGMOD, pp 355–370. ACM: New York CrossRef
26.
Zurück zum Zitat Li Y, Patel JM (2013) Bitweaving: Fast scans for main memory data processing. In: SIGMOD, pp 289–300. ACM: New York Li Y, Patel JM (2013) Bitweaving: Fast scans for main memory data processing. In: SIGMOD, pp 289–300. ACM: New York
27.
Zurück zum Zitat Lisa NJ, Ungethüm A, Habich D, Lehner W, Nguyen TDA, Kumar A (2018) Column scan acceleration in hybrid CPU-FPGA systems. In: ADMS@VLDB, pp 22–33. Rio de Janeiro, 27.08.2018 Lisa NJ, Ungethüm A, Habich D, Lehner W, Nguyen TDA, Kumar A (2018) Column scan acceleration in hybrid CPU-FPGA systems. In: ADMS@VLDB, pp 22–33. Rio de Janeiro, 27.08.2018
28.
Zurück zum Zitat Oukid I, Booss D, Lespinasse A, Lehner W, Willhalm T, Gomes G (2017) Memory management techniques for large-scale persistent-main-memory systems. PVLDB 10(11):1166–1177 Oukid I, Booss D, Lespinasse A, Lehner W, Willhalm T, Gomes G (2017) Memory management techniques for large-scale persistent-main-memory systems. PVLDB 10(11):1166–1177
29.
Zurück zum Zitat Pietrzyk J, Habich D, Damme P, Lehner W (2019) First investigations of the vector supercomputer sx-aurora TSUBASA as a co-processor for database systems. In: BTW Workshopband, pp 33–50. GI: Bonn Pietrzyk J, Habich D, Damme P, Lehner W (2019) First investigations of the vector supercomputer sx-aurora TSUBASA as a co-processor for database systems. In: BTW Workshopband, pp 33–50. GI: Bonn
30.
Zurück zum Zitat Pietrzyk J, Ungethüm A, Habich D, Lehner W (2019) Fighting the duplicates in hashing: conflict detection-aware vectorization of linear probing. In: BTW, pp 35–53. GI: Bonn Pietrzyk J, Ungethüm A, Habich D, Lehner W (2019) Fighting the duplicates in hashing: conflict detection-aware vectorization of linear probing. In: BTW, pp 35–53. GI: Bonn
31.
Zurück zum Zitat Pirk H, Moll O, Zaharia M, Madden S (2016) Voodoo – A vector algebra for portable database performance on modern hardware. PVLDB 9(14):1707–1718 Pirk H, Moll O, Zaharia M, Madden S (2016) Voodoo – A vector algebra for portable database performance on modern hardware. PVLDB 9(14):1707–1718
32.
Zurück zum Zitat Polychroniou O, Raghavan A, Ross KA (2015) Rethinking SIMD vectorization for in-memory databases. In: SIGMOD, pp 1493–1508. ACM: New York Polychroniou O, Raghavan A, Ross KA (2015) Rethinking SIMD vectorization for in-memory databases. In: SIGMOD, pp 1493–1508. ACM: New York
33.
Zurück zum Zitat Stonebraker M, Abadi DJ, Batkin A, Chen X, Cherniack M, Ferreira M, Lau E, Lin A, Madden S, O’Neil EJ, O’Neil PE, Rasin A, Tran N, Zdonik SB (2005) C‑store: a column-oriented DBMS. In: VLDB, pp 553–564. ACM: New York Stonebraker M, Abadi DJ, Batkin A, Chen X, Cherniack M, Ferreira M, Lau E, Lin A, Madden S, O’Neil EJ, O’Neil PE, Rasin A, Tran N, Zdonik SB (2005) C‑store: a column-oriented DBMS. In: VLDB, pp 553–564. ACM: New York
34.
Zurück zum Zitat Ungethüm A, Pietrzyk J, Damme P, Habich D, Lehner W (2018) Conflict detection-based run-length encoding – AVX-512 CD instruction set in action. In: ICDE Workshops, pp 96–101. IEEE Computer Society: Washington D.C. Ungethüm A, Pietrzyk J, Damme P, Habich D, Lehner W (2018) Conflict detection-based run-length encoding – AVX-512 CD instruction set in action. In: ICDE Workshops, pp 96–101. IEEE Computer Society: Washington D.C.
35.
Zurück zum Zitat Zukowski M, Héman S, Nes N, Boncz PA (2006) Super-scalar RAM-CPU cache compression. In: ICDE, p 59. IEEE Computer Society: Washington D.C. Zukowski M, Héman S, Nes N, Boncz PA (2006) Super-scalar RAM-CPU cache compression. In: ICDE, p 59. IEEE Computer Society: Washington D.C.
36.
Zurück zum Zitat Zukowski M, van de Wiel M, Boncz PA (2012) Vectorwise: a vectorized analytical DBMS. In: ICDE, pp 1349–1350. IEEE Computer Society: Washington D.C. Zukowski M, van de Wiel M, Boncz PA (2012) Vectorwise: a vectorized analytical DBMS. In: ICDE, pp 1349–1350. IEEE Computer Society: Washington D.C.
Metadaten
Titel
Evaluating the Vector Supercomputer SX-Aurora TSUBASA as a Co-Processor for In-Memory Database Systems
verfasst von
Johannes Pietrzyk
Dirk Habich
Patrick Damme
Erich Focht
Wolfgang Lehner
Publikationsdatum
09.09.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Datenbank-Spektrum / Ausgabe 3/2019
Print ISSN: 1618-2162
Elektronische ISSN: 1610-1995
DOI
https://doi.org/10.1007/s13222-019-00323-w

Premium Partner