skip to main content
10.1145/1835804.1835941acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Online discovery and maintenance of time series motifs

Published:25 July 2010Publication History

ABSTRACT

The detection of repeated subsequences, time series motifs, is a problem which has been shown to have great utility for several higher-level data mining algorithms, including classification, clustering, segmentation, forecasting, and rule discovery. In recent years there has been significant research effort spent on efficiently discovering these motifs in static offline databases. However, for many domains, the inherent streaming nature of time series demands online discovery and maintenance of time series motifs. In this paper, we develop the first online motif discovery algorithm which monitors and maintains motifs exactly in real time over the most recent history of a stream. Our algorithm has a worst-case update time which is linear to the window size and is extendible to maintain more complex pattern structures. In contrast, the current offline algorithms either need significant update time or require very costly pre-processing steps which online algorithms simply cannot afford.

Our core ideas allow useful extensions of our algorithm to deal with arbitrary data rates and discovering multidimensional motifs. We demonstrate the utility of our algorithms with a variety of case studies in the domains of robotics, acoustic monitoring and online compression.

Skip Supplemental Material Section

Supplemental Material

kdd2010_mueen_odmt_01.mov

mov

135.9 MB

References

  1. Agrawal, R., Faloutsos, C. and Swami, A.N. Efficient Similarity Search in Sequence Databases. FODO 1993: 69--84. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Beaudoin, P., Panne, M., Poulin, P. and Coros, S. Motion-Motif Graphs. ACM/EG Symposium on Computer Animation 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Bespamyatnikh, S. N. An Optimal Algorithm for Closest Pair Maintenance. ACM SCG '95, 152--161. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Bulut, A. and Singh, A.: SWAT: Hierarchical Stream Summarization in Large Networks. In Proceedings of the ICDE (2003).Google ScholarGoogle ScholarCross RefCross Ref
  5. Cardinal, J. and Eppstein, D. Lazy Algorithms for Dynamic Closest Pair with Arbitrary Distance Measures. ALENEX/ANALC 2004.Google ScholarGoogle Scholar
  6. Chiu, B., Keogh, E. and Lonardi, S. Probabilistic Discovery of Time Series Motifs. ACM SIGKDD 2003. pp 493--498. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Cummins, M. and Newman, P. FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance. The International Journal of Robotics Research, 27(6), 647--665, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Datar, M., Gionis, A., Indyk, P., and Motwani, R. Maintaining Stream Statistics over Sliding Windows. SIAM J. Comput. 31, 6 (Jun. 2002), 1794--1813. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Dawson, D. K. and Efford, M. G. Bird Population Density Estimated from Acoustic Signals. Journal of Applied Ecology. Volume 46 Issue 6, Pages 1201--1209.Google ScholarGoogle Scholar
  10. Ding, H., Trajcevski, G., Scheuermann, P., Wang, X. and Keogh, E. Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures. VLDB 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Dohnal, V., Gennaro C. and Zezula, P. Similarity Join in Metric Spaces Using eD-Index. Database and Expert Systems Applications, Volume 2736, pp. 484--493, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Eppstein, D. Fast Hierarchical Clustering and Other Applications of Dynamic Closest Pairs. ACM Journal of Experimental Algorithmics 5:1 (2000). Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Fuchs, E., Gruber, T., Nitschke, J. and Sick, B. On-line Motif Detection in Time Series with SwiftMotif. In: Pattern Recognition 42(11):3015--3031, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Keogh, E., Chu, S., Hart, D. and Pazzani, M. An Online Algorithm for Segmenting Time Series. ICDM, pp. 289--296, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Lazaridis, I. and Mehrotra, S. Capturing Sensor-Generated Time Series with Quality Guarantees. ICDE 2003.Google ScholarGoogle Scholar
  16. Lin, J., Keogh, E., Lonardi, S. and Patel, P. Finding Motifs in Time Series, Workshop on Temporal Data Mining (KDD'02), 2002.Google ScholarGoogle Scholar
  17. Mueen, A., Keogh, E., Zhu, Q., Cash, S. and Westover, B. Exact Discovery of Time Series Motif. SDM 2009.Google ScholarGoogle Scholar
  18. N. Tatbul, N., Çetintemel, U., Zdonik, S., Cherniack, M. and Stonebraker. M. Load Shedding in a Data Stream Manager. VLDB 2003, pp. 309--320. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Nanopoulos A., Theodoridis Y. and Manolopoulos, Y. C2P: Clustering Based on Closest Pairs. VLDB, pp. 331--340, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Odlyzko, A.M. and Rains, E.M. On Longest Increasing Subsequences in Random Permutation, Analysis, Geometry, Number Theory: the Mathematics of Leon Ehrenpreis.439--451, Contemp. Math., 251, Amer. Math. Soc., Providence, RI, 2000.Google ScholarGoogle Scholar
  21. Ogras, Y. and Ferhatosmanoglu, H. Online Summarization of Dynamic Time Series Data. The VLDB Journal, 15(1):84--98, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Palpanas, T., Vlachos, M., Keogh, E., Gunopulos, D. and Truppel, W. Online Amnesic Approximation of Streaming Time Series. In ICDE 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Patel, P., Keogh, E., Lin, J. and Lonardi, S. Mining Motifs in Massive Time Series Databases. ICDM 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Patnaik, D., Marwah, M., Sharma, R.K. and Ramakrishnan, N. Sustainable Operation and Management of Data Center Chillers using Temporal Data Mining. KDD 2009: 1305--1314. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Penteriani, V. Variation in the Function of Eagle Owl Vocal Behaviour: Territorial Defence and Intra-Pair Communication? Ethol. Ecol. Evol. 14: 275--281.Google ScholarGoogle ScholarCross RefCross Ref
  26. Trifa, V.M., Girod, L., Collier, T., Blumstein, D.T. and Taylor, C.E. Automated Wildlife Monitoring Using Self-Configuring Sensor Networks Deployed in Natural Habitats. AROB 2007.Google ScholarGoogle Scholar
  27. Vahdatpour, A., Amini, N. and Sarrafzadeh, M. Towards Unsupervised Activity Discovery using Multi Dimensional Motif Detection in Time Series. 21st International Joint Conference on Artificial Intelligence (IJCAI) 2009, Pasadena, California. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Weber R., Schek, H-J. and Blott, S. A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces. VLDB, pp. 194--205, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Wurst, M., Morik, K. and Mierswa, I. Localized Alternative Cluster Ensembles for Collaborative Structuring. ECML 2006. pp. 485--496. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Yankov, D., Keogh, E., Medina, J., Chiu, B. and Zordan V. Detecting Motifs under Uniform Scaling. SIGKDD 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Supporting webpage containing Data, Code, Videos, Excel sheet and Presentation slides. Link: http://www.cs.ucr.edu/~mueen/OnlineMotif/index.htmlGoogle ScholarGoogle Scholar

Index Terms

  1. Online discovery and maintenance of time series motifs

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
      July 2010
      1240 pages
      ISBN:9781450300551
      DOI:10.1145/1835804

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 25 July 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader