Skip to main content
Erschienen in: Data Mining and Knowledge Discovery 6/2021

16.08.2021

Early abandoning and pruning for elastic distances including dynamic time warping

verfasst von: Matthieu Herrmann, Geoffrey I. Webb

Erschienen in: Data Mining and Knowledge Discovery | Ausgabe 6/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Nearest neighbor search under elastic distances is a key tool for time series analysis, supporting many applications. However, straightforward implementations of distances require \(O(n^2)\) space and time complexities, preventing these applications from scaling to long series. Much work has been devoted to speeding up the NN search process, mostly with the development of lower bounds, allowing to avoid costly distance computations when a given threshold is exceeded. This threshold, provided by the similarity search process, also allows to early abandon the computation of a distance itself. Another approach, is to prune parts of the computation. All these techniques are orthogonal to each other. In this work, we develop a new generic strategy, “EAPruned”, that tightly integrates pruning with early abandoning. We apply it to six elastic distance measures: DTW, CDTW, WDTW, ERP, MSM and TWE, showing substantial speedup in NN search applications. Pruning alone also shows substantial speedup for some distances, benefiting applications beyond the scope of NN search (e.g. requiring all pairwise distances), and hence where early abandoning is not applicable. We release our implementation as part of a new C++ library for time series classification, along with easy to use Python/Numpy bindings.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
In ERP, the gap cost is given by \({{\,\mathrm{cost}\,}}(s_i, g)\). If S is translated, the gap cost also changes while the canonical alignment cost remains unchanged, making ERP translation-sensitive.
 
2
Similar to discard points, instructing the next rows to skip their column.
 
Literatur
Zurück zum Zitat Bagnall A, Flynn M, Large J, Lines J, Middlehurst M (2020) A tale of two toolkits, report the third: On the usage and performance of HIVE-COTE v1.0. arXiv:2004.06069 Bagnall A, Flynn M, Large J, Lines J, Middlehurst M (2020) A tale of two toolkits, report the third: On the usage and performance of HIVE-COTE v1.0. arXiv:​2004.​06069
Zurück zum Zitat Benkabou SE, Benabdeslem K, Canitia B (2018) Unsupervised outlier detection for time series by entropy and dynamic time warping. Knowl Inf Syst 54(2):463–486CrossRef Benkabou SE, Benabdeslem K, Canitia B (2018) Unsupervised outlier detection for time series by entropy and dynamic time warping. Knowl Inf Syst 54(2):463–486CrossRef
Zurück zum Zitat Chen L, Ng R (2004) On the marriage of lp-norms and edit distance. In: Proceedings 2004 VLDB Conference, pp 792 – 803 Chen L, Ng R (2004) On the marriage of lp-norms and edit distance. In: Proceedings 2004 VLDB Conference, pp 792 – 803
Zurück zum Zitat Dau HA, Bagnall A, Kamgar K, Yeh CCM, Zhu Y, Gharghabi S, Ratanamahatana CA, Keogh E (2019) The UCR Time Series Archive. arXiv:1810.07758 Dau HA, Bagnall A, Kamgar K, Yeh CCM, Zhu Y, Gharghabi S, Ratanamahatana CA, Keogh E (2019) The UCR Time Series Archive. arXiv:​1810.​07758
Zurück zum Zitat Itakura F (1975) Minimum prediction residual principle applied to speech recognition. IEEE Trans Acoust Speech Signal Process 23(1):67–72CrossRef Itakura F (1975) Minimum prediction residual principle applied to speech recognition. IEEE Trans Acoust Speech Signal Process 23(1):67–72CrossRef
Zurück zum Zitat Rakthanmanon T, Campana B, Mueen A, Batista G, Westover B, Zhu Q, Zakaria J, Keogh E (2012) Searching and mining trillions of time series subsequences under dynamic time warping. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD ’12, ACM Press, Beijing, China, p 262, https://doi.org/10.1145/2339530.2339576 Rakthanmanon T, Campana B, Mueen A, Batista G, Westover B, Zhu Q, Zakaria J, Keogh E (2012) Searching and mining trillions of time series subsequences under dynamic time warping. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD ’12, ACM Press, Beijing, China, p 262, https://​doi.​org/​10.​1145/​2339530.​2339576
Zurück zum Zitat Sakoe H, Chiba S (1971) A dynamic programming approach to continuous speech recognition. Proceedings of the Seventh International Congress on Acoustics, Budapest, Akadémiai Kiadó, Budapest 3:65–69 Sakoe H, Chiba S (1971) A dynamic programming approach to continuous speech recognition. Proceedings of the Seventh International Congress on Acoustics, Budapest, Akadémiai Kiadó, Budapest 3:65–69
Zurück zum Zitat Shifaz A, Pelletier C, Petitjean F, Webb GI (2020) TS-CHIEF: A Scalable and Accurate Forest Algorithm for Time Series Classification. arXiv:190610329 [cs, stat] Shifaz A, Pelletier C, Petitjean F, Webb GI (2020) TS-CHIEF: A Scalable and Accurate Forest Algorithm for Time Series Classification. arXiv:​190610329 [cs, stat]
Zurück zum Zitat Zhu Q, Batista G, Rakthanmanon T, Keogh E (2012) A Novel Approximation to Dynamic Time Warping allows Anytime Clustering of Massive Time Series Datasets. In: Proceedings of the 2012 SIAM International Conference on Data Mining, Society for Industrial and Applied Mathematics, pp 999–1010, https://doi.org/10.1137/1.9781611972825.86 Zhu Q, Batista G, Rakthanmanon T, Keogh E (2012) A Novel Approximation to Dynamic Time Warping allows Anytime Clustering of Massive Time Series Datasets. In: Proceedings of the 2012 SIAM International Conference on Data Mining, Society for Industrial and Applied Mathematics, pp 999–1010, https://​doi.​org/​10.​1137/​1.​9781611972825.​86
Metadaten
Titel
Early abandoning and pruning for elastic distances including dynamic time warping
verfasst von
Matthieu Herrmann
Geoffrey I. Webb
Publikationsdatum
16.08.2021
Verlag
Springer US
Erschienen in
Data Mining and Knowledge Discovery / Ausgabe 6/2021
Print ISSN: 1384-5810
Elektronische ISSN: 1573-756X
DOI
https://doi.org/10.1007/s10618-021-00782-4

Weitere Artikel der Ausgabe 6/2021

Data Mining and Knowledge Discovery 6/2021 Zur Ausgabe

Premium Partner