skip to main content
research-article

Anticipatory DTW for efficient similarity search in time series databases

Published:01 August 2009Publication History
Skip Abstract Section

Abstract

Time series arise in many different applications in the form of sensor data, stocks data, videos, and other time-related information. Analysis of this data typically requires searching for similar time series in a database. Dynamic Time Warping (DTW) is a widely used high-quality distance measure for time series. As DTW is computationally expensive, efficient algorithms for fast computation are crucial.

In this paper, we propose a novel filter-and-refine DTW algorithm called Anticipatory DTW. Existing algorithms aim at efficiently finding similar time series by filtering the database and computing the DTW in the refinement step. Unlike these algorithms, our approach exploits previously unused information from the filter step during the refinement, allowing for faster rejection of false candidates. We characterize a class of applicable filters for our approach, which comprises state-of-the-art lower bounds of the DTW.

Our novel anticipatory pruning incurs hardly any over-head and no false dismissals. We demonstrate substantial efficiency improvements in thorough experiments on synthetic and real world time series databases and show that our technique is highly scalable to multivariate, long time series and wide DTW bands.

References

  1. J. Aach and G. M. Church. Aligning gene expression time series with time warping algorithms. Bioinformatics, 17(6):495--508, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  2. I. Assent and H. Kremer. Robust adaptable video copy detection. In SSTD. Springer, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. I. Assent, A. Wenning, and T. Seidl. Approximation techniques for indexing the Earth Mover's Distance in multimedia databases. In ICDE, page 11, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. V. Athitsos, P. Papapetrou, M. Potamias, G. Kollios, and D. Gunopulos. Approximate embedding-based subsequence matching of time series. In SIGMOD, pages 365--378, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. D. J. Berndt and J. Clifford. Using dynamic time warping to find patterns in time series. In AAAI Workshop on KDD, pages 229--248, 1994.Google ScholarGoogle Scholar
  6. F. K.-P. Chan, A. W.-C. Fu, and C. Yu. Haar wavelets for efficient similarity search of time-series: With and without time warping. TKDE, 15(3):686--705, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A.-P. Chen, S.-F. Lin, and Y.-C. Cheng. Time registration of two image sequences by dynamic time warping. In Proc. ICNSC, pages 418--423, 2004.Google ScholarGoogle Scholar
  8. L. Chen and R. Ng. On the marriage of Lp-norms and Edit Distance. In VLDB, pages 792--803, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Chu, E. J. Keogh, D. Hart, and M. J. Pazzani. Iterative deepening dynamic time warping for time series. In SDM, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  10. H. Ding, G. Trajcevski, P. Scheuermann, X. Wang, and E. Keogh. Querying and mining of time series data: Experimental comparison of representations and distance measures. In VLDB, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. C. Faloutsos. Searching Multimedia Databases by Content. Kluwer Academic Publishers, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. F. Itakura. Minimum prediction residual principle applied to speech recognition. IEEE Trans. Acoust., Speech, Signal Processing, 23(1):67--72, 1975.Google ScholarGoogle ScholarCross RefCross Ref
  13. M. W. Kadous. Australian sign language data set 1, 1999. www.cse.unsw.edu.au/~waleed/tml/data/.Google ScholarGoogle Scholar
  14. E. J. Keogh. Exact indexing of dynamic time warping. In VLDB, pages 406--417, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. E. J. Keogh, K. Chakrabarti, M. Pazzani, and S. Mehrotra. Locally adaptive dimensionality reduction for indexing large time series databases. SIGMOD Rec., 30(2):151--162, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. E. J. Keogh, K. Chakrabarti, M. J. Pazzani, and S. Mehrotra. Dimensionality reduction for fast similarity search in large time series databases. KAIS, 3(3):263--286, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  17. E. J. Keogh, L. Wei, X. Xi, S. Lee, and M. Vlachos. LB_Keogh supports exact indexing of shapes under rotation invariance with arbitrary representations and distance measures. In VLDB, pages 882--893, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. S. Kim, S. Park, and W. Chu. An index-based approach for similarity search supporting time warping in large sequence databases. In ICDE, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. C. A. Ratanamahatana and E. J. Keogh. Making time-series classification more accurate using learned constraints. In SDM, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  20. T. M. Rath and R. Manmatha. Lower-bounding of dynamic time warping distances for multivariate time series, 2003.Google ScholarGoogle Scholar
  21. H. Sakoe and S. Chiba. Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust., Speech, Signal Processing, 26(1):43--49, 1978.Google ScholarGoogle ScholarCross RefCross Ref
  22. Y. Sakurai, C. Faloutsos, and M. Yamamuro. Stream monitoring under the time warping distance. In ICDE, pages 1046--1055, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  23. Y. Sakurai, M. Yoshikawa, and C. Faloutsos. FTW: fast similarity search under the time warping distance. In PODS, pages 326--337, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. S. Salvador and P. Chan. FastDTW: Toward accurate dynamic time warping in linear time and space. In KDD TDM, 2004.Google ScholarGoogle Scholar
  25. S. Salvador and P. Chan. Toward accurate dynamic time warping in linear time and space. Intelligent Data Analysis, 11(5):561--580, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. T. Seidl and H.-P. Kriegel. Optimal multi-step k-nearest neighbor search. In SIGMOD, pages 154--165. ACM, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and trecvid. In Proc. MIR, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. M. Vlachos, M. Hadjieleftheriou, D. Gunopulos, and E. J. Keogh. Indexing multi-dimensional time-series with support for multiple distance measures. In SIGKDD, pages 216--225, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. B. K. Yi, H. V. Jagadish, and C. Faloutsos. Efficient retrieval of similar time sequences under time warping. In ICDE, pages 201--208, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. M. Zhou and M. H. Wong. Boundary-based lower-bound functions for dynamic time warping and their indexing. In ICDE, pages 1307--1311, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  31. M. Zhou and M. H. Wong. Efficient online subsequence searching in data streams under dynamic time warping distance. In ICDE, pages 686--695, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Y. Zhu and D. Shasha. Warping indexes with envelope transforms for query by humming. In SIGMOD, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Anticipatory DTW for efficient similarity search in time series databases

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader