Skip to main content
Erschienen in: The Journal of Supercomputing 11/2020

19.02.2020

ScrimpCo: scalable matrix profile on commodity heterogeneous processors

verfasst von: Jose C. Romero, Antonio Vilches, Andrés Rodríguez, Angeles Navarro, Rafael Asenjo

Erschienen in: The Journal of Supercomputing | Ausgabe 11/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The discovery of time series motifs and discords is considered a paramount and challenging problem regarding time series analysis. In this work, we present ScrimpCo, a heterogeneous implementation of a previous algorithm called SCRIMP that excels at finding relevant subsequences in time series. We propose and evaluate several static, dynamic and adaptive partition strategies targeting commodity processors, on both homogeneous (CPU multicore) and heterogeneous (CPU + GPU) architectures. For the CPU + GPU implementation, we explore a heterogeneous parallel_reduce pattern that computes part of the computation onto an OpenCL capable GPU, whereas the CPU cores take care of the other part. Our heterogeneous scheduler, built on top of TBB, pays special attention to appropriately balance the computational load among the GPU and CPU cores. The experimental results show that our homogeneous implementation scales linearly and that our heterogeneous implementation allows us to reach near-ideal performance on commodity processors that feature an on-chip GPU.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Yoon CE, O’Reilly O, Bergen KJ, Beroza GC (2015) Earthquake detection through computationally efficient similarity search. Sci Adv 1(11):e1501057CrossRef Yoon CE, O’Reilly O, Bergen KJ, Beroza GC (2015) Earthquake detection through computationally efficient similarity search. Sci Adv 1(11):e1501057CrossRef
2.
Zurück zum Zitat Rhodes J, Cole W, Upshaw C, Edgar T, Webber M (2014) Clustering analysis of residential electricity demand profiles. Appl Energy 135:461–471CrossRef Rhodes J, Cole W, Upshaw C, Edgar T, Webber M (2014) Clustering analysis of residential electricity demand profiles. Appl Energy 135:461–471CrossRef
3.
Zurück zum Zitat Kolb I, Talei Franzesi G, Wang M, Kodandaramaiah SB, Forest CR, Boyden ES, Singer AC (2018) Evidence for long-timescale patterns of synaptic inputs in ca1 of awake behaving mice. J Neurosci 38(7):1821–1834CrossRef Kolb I, Talei Franzesi G, Wang M, Kodandaramaiah SB, Forest CR, Boyden ES, Singer AC (2018) Evidence for long-timescale patterns of synaptic inputs in ca1 of awake behaving mice. J Neurosci 38(7):1821–1834CrossRef
4.
Zurück zum Zitat Balázs S, Ajinkya D, Barbara W (2015) Searching for motifs in the behaviour of larval Drosophila melanogaster and Caenorhabditis elegans reveals continuity between behavioural states. J R Soc Interface 12(113):20150899CrossRef Balázs S, Ajinkya D, Barbara W (2015) Searching for motifs in the behaviour of larval Drosophila melanogaster and Caenorhabditis elegans reveals continuity between behavioural states. J R Soc Interface 12(113):20150899CrossRef
5.
Zurück zum Zitat Yeh CCM, Zhu Y, Ulanova L, Begum N, Ding Y, Dau HA, Silva DF, Mueen A, Keogh E (2016) Matrix profile i: all pairs similarity joins for time series: a unifying view that includes motifs, discords and shapelets. In: 2016 IEEE 16th International Conference on Data Mining (ICDM), IEEE 1317–1322 Yeh CCM, Zhu Y, Ulanova L, Begum N, Ding Y, Dau HA, Silva DF, Mueen A, Keogh E (2016) Matrix profile i: all pairs similarity joins for time series: a unifying view that includes motifs, discords and shapelets. In: 2016 IEEE 16th International Conference on Data Mining (ICDM), IEEE 1317–1322
6.
Zurück zum Zitat Zhu Y, Zimmerman Z, Senobari NS, Yeh CM, Funning G, Mueen A, Brisk P, Keogh E (2016) Matrix profile ii: Exploiting a novel algorithm and GPUs to break the one hundred million barrier for time series motifs and joins. In: 2016 IEEE 16th International Conference on Data Mining (ICDM), pp 739–748 Zhu Y, Zimmerman Z, Senobari NS, Yeh CM, Funning G, Mueen A, Brisk P, Keogh E (2016) Matrix profile ii: Exploiting a novel algorithm and GPUs to break the one hundred million barrier for time series motifs and joins. In: 2016 IEEE 16th International Conference on Data Mining (ICDM), pp 739–748
7.
Zurück zum Zitat Zhu Y, Yeh CCM, Zimmerman ZF, Kamgar K, Keogh E (2018) Matrix Profile XI: SCRIMP++: time series motif discovery at interactive speeds. In: 2018 IEEE International Conference on Data Mining (ICDM), pp 837–846 Zhu Y, Yeh CCM, Zimmerman ZF, Kamgar K, Keogh E (2018) Matrix Profile XI: SCRIMP++: time series motif discovery at interactive speeds. In: 2018 IEEE International Conference on Data Mining (ICDM), pp 837–846
8.
Zurück zum Zitat Pfeilschifter G (2019) Time series analysis with matrix profile on HPC systems. Master’s thesis, Department of Informatics, Technical University of Munich, Germany Pfeilschifter G (2019) Time series analysis with matrix profile on HPC systems. Master’s thesis, Department of Informatics, Technical University of Munich, Germany
9.
Zurück zum Zitat Zhu Y, Zimmerman Z, Shakibay Senobari N, Yeh CCM, Funning G, Mueen A, Brisk P, Keogh E (2018) Exploiting a novel algorithm and GPUs to break the ten quadrillion pairwise comparisons barrier for time series motifs and joins. Knowl Inf Syst 54(1):203–236CrossRef Zhu Y, Zimmerman Z, Shakibay Senobari N, Yeh CCM, Funning G, Mueen A, Brisk P, Keogh E (2018) Exploiting a novel algorithm and GPUs to break the ten quadrillion pairwise comparisons barrier for time series motifs and joins. Knowl Inf Syst 54(1):203–236CrossRef
10.
Zurück zum Zitat Fernandez I, Villegas A, Gutierrez E, Plata O (2019) Accelerating time series motif discovery in the Intel Xeon Phi KNL processor. J Supercomput 75(11):7053–7075CrossRef Fernandez I, Villegas A, Gutierrez E, Plata O (2019) Accelerating time series motif discovery in the Intel Xeon Phi KNL processor. J Supercomput 75(11):7053–7075CrossRef
11.
Zurück zum Zitat Voss M, Asenjo R, Reinders J (2019) Pro TBB: C++ parallel programming with threading building blocks. Apress, New YorkCrossRef Voss M, Asenjo R, Reinders J (2019) Pro TBB: C++ parallel programming with threading building blocks. Apress, New YorkCrossRef
12.
Zurück zum Zitat Navarro A, Corbera F, Rodriguez A, Vilches A, Asenjo R (2019) Heterogeneous parallel_for template for CPU-GPU chips. Int J Parallel Program 47(2):213–233CrossRef Navarro A, Corbera F, Rodriguez A, Vilches A, Asenjo R (2019) Heterogeneous parallel_for template for CPU-GPU chips. Int J Parallel Program 47(2):213–233CrossRef
13.
Zurück zum Zitat Paparrizos J, Gravano L (2016) k-shape: Efficient and accurate clustering of time series. SIGMOD Rec 45(1):69–76CrossRef Paparrizos J, Gravano L (2016) k-shape: Efficient and accurate clustering of time series. SIGMOD Rec 45(1):69–76CrossRef
14.
Zurück zum Zitat Echihabi K, Zoumpatianos K, Palpanas T, Benbrahim H (2018) The lernaean hydra of data series similarity search: An experimental evaluation of the state of the art. PVLDB 12(2):112–127 Echihabi K, Zoumpatianos K, Palpanas T, Benbrahim H (2018) The lernaean hydra of data series similarity search: An experimental evaluation of the state of the art. PVLDB 12(2):112–127
15.
Zurück zum Zitat Makridakis S, Spiliotis E, Assimakopoulos V (2019) The m4 competition: 100,000 time series and 61 forecasting methods. Int J Forecast 36(1):54–74CrossRef Makridakis S, Spiliotis E, Assimakopoulos V (2019) The m4 competition: 100,000 time series and 61 forecasting methods. Int J Forecast 36(1):54–74CrossRef
16.
Zurück zum Zitat Ahmed NK, Atiya AF, Gayar NE, El-Shishiny H (2010) An empirical comparison of machine learning models for time series forecasting. Econom Rev 29(5–6):594–621MathSciNetCrossRef Ahmed NK, Atiya AF, Gayar NE, El-Shishiny H (2010) An empirical comparison of machine learning models for time series forecasting. Econom Rev 29(5–6):594–621MathSciNetCrossRef
17.
Zurück zum Zitat McGovern A, Rosendahl DH, Brown RA, Droegemeier KK (2011) Identifying predictive multi-dimensional time series motifs: an application to severe weather prediction. Data Min Knowl Discov 22(1):232–258CrossRef McGovern A, Rosendahl DH, Brown RA, Droegemeier KK (2011) Identifying predictive multi-dimensional time series motifs: an application to severe weather prediction. Data Min Knowl Discov 22(1):232–258CrossRef
18.
Zurück zum Zitat Torkamani S, Lohweg V (2017) Survey on time series motif discovery. Data Min Knowl Discov 7(2):1199CrossRef Torkamani S, Lohweg V (2017) Survey on time series motif discovery. Data Min Knowl Discov 7(2):1199CrossRef
20.
Zurück zum Zitat Zymbler M, Polyakov A, Kipnis M (2019) Time series discord discovery on intel many-core systems. In: Sokolinsky L, Zymbler M (eds) Parallel computational technologies. Springer, Cham, pp 168–182CrossRef Zymbler M, Polyakov A, Kipnis M (2019) Time series discord discovery on intel many-core systems. In: Sokolinsky L, Zymbler M (eds) Parallel computational technologies. Springer, Cham, pp 168–182CrossRef
21.
Zurück zum Zitat Przymus P, Kaczmarski K (2014) Time series queries processing with GPU support. New trends in databases and information systems. Springer, Cham, pp 53–60CrossRef Przymus P, Kaczmarski K (2014) Time series queries processing with GPU support. New trends in databases and information systems. Springer, Cham, pp 53–60CrossRef
22.
Zurück zum Zitat Bueno J, Planas J, Duran A, Badia RM, Martorell X, Ayguadé E, Labarta J (2012) Productive programming of GPU clusters with OmpSs. In: International Parallel and Distributed Processing Symposium, pp 557–568 Bueno J, Planas J, Duran A, Badia RM, Martorell X, Ayguadé E, Labarta J (2012) Productive programming of GPU clusters with OmpSs. In: International Parallel and Distributed Processing Symposium, pp 557–568
23.
Zurück zum Zitat Belviranli M, Bhuyan L, Gupta R (2013) A dynamic self-scheduling scheme for heterogeneous multiprocessor architectures. ACM Trans Archit Code Optim (TACO) 9(4):1–20CrossRef Belviranli M, Bhuyan L, Gupta R (2013) A dynamic self-scheduling scheme for heterogeneous multiprocessor architectures. ACM Trans Archit Code Optim (TACO) 9(4):1–20CrossRef
24.
Zurück zum Zitat Pandit P, Govindarajan R (2014) Fluidic kernels: cooperative execution of OpenCL programs on multiple heterogeneous devices. In: International Symposium on Code Generation and Optimization, CGO ’14 Pandit P, Govindarajan R (2014) Fluidic kernels: cooperative execution of OpenCL programs on multiple heterogeneous devices. In: International Symposium on Code Generation and Optimization, CGO ’14
25.
Zurück zum Zitat Kaleem R, Barik R, Shpeisman T, Hu C, Lewis BT, Pingali K (Aug 2014) Adaptive heterogeneous scheduling for integrated GPUs. In: International Conference on Parallel Architecture and Compilation Techniques (PACT), pp 151–162 Kaleem R, Barik R, Shpeisman T, Hu C, Lewis BT, Pingali K (Aug 2014) Adaptive heterogeneous scheduling for integrated GPUs. In: International Conference on Parallel Architecture and Compilation Techniques (PACT), pp 151–162
26.
Zurück zum Zitat Pérez B, Bosque JL, Beivide R (2016) Simplifying programming and load balancing of data parallel applications on Heterogeneous Systems. In: Proceedings of the 9th Annual Workshop on General Purpose Processing GPU. GPGPU ’16, pp 42–51 Pérez B, Bosque JL, Beivide R (2016) Simplifying programming and load balancing of data parallel applications on Heterogeneous Systems. In: Proceedings of the 9th Annual Workshop on General Purpose Processing GPU. GPGPU ’16, pp 42–51
27.
Zurück zum Zitat Blumofe RD, Leiserson CE (1999) Scheduling multithreaded computations by work stealing. J ACM 46(5):720–748MathSciNetCrossRef Blumofe RD, Leiserson CE (1999) Scheduling multithreaded computations by work stealing. J ACM 46(5):720–748MathSciNetCrossRef
30.
Zurück zum Zitat Nelson CR, Plosser CR (1982) Trends and random walks in macroeconomic time series: some evidence and implications. J Monet Econ 10(2):139–162CrossRef Nelson CR, Plosser CR (1982) Trends and random walks in macroeconomic time series: some evidence and implications. J Monet Econ 10(2):139–162CrossRef
Metadaten
Titel
ScrimpCo: scalable matrix profile on commodity heterogeneous processors
verfasst von
Jose C. Romero
Antonio Vilches
Andrés Rodríguez
Angeles Navarro
Rafael Asenjo
Publikationsdatum
19.02.2020
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 11/2020
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-020-03199-w

Weitere Artikel der Ausgabe 11/2020

The Journal of Supercomputing 11/2020 Zur Ausgabe