skip to main content
10.1145/2623330.2623613acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Learning time-series shapelets

Published:24 August 2014Publication History

ABSTRACT

Shapelets are discriminative sub-sequences of time series that best predict the target variable. For this reason, shapelet discovery has recently attracted considerable interest within the time-series research community. Currently shapelets are found by evaluating the prediction qualities of numerous candidates extracted from the series segments. In contrast to the state-of-the-art, this paper proposes a novel perspective in terms of learning shapelets. A new mathematical formalization of the task via a classification objective function is proposed and a tailored stochastic gradient learning algorithm is applied. The proposed method enables learning near-to-optimal shapelets directly without the need to try out lots of candidates. Furthermore, our method can learn true top-K shapelets by capturing their interaction. Extensive experimentation demonstrates statistically significant improvement in terms of wins and ranks against 13 baselines over 28 time-series datasets.

Skip Supplemental Material Section

Supplemental Material

p392-sidebyside.mp4

mp4

233.5 MB

References

  1. K.-W. Chang, B. Deka, W. mei W. Hwu, and D. Roth. Efficient pattern-based time series classification on gpu. In M. J. Zaki, A. Siebes, J. X. Yu, B. Goethals, G. I. Webb, and X. Wu, editors, ICDM, pages 131--140. IEEE Computer Society, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Das and D. Kempe. Algorithms for subset selection in linear regression. In Proceedings of the 40th Annual ACM Symposium on Theory of Computing, STOC '08, pages 45--54, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Demsar. Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res., 7:1--30, Dec. 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. H. Ding, G. Trajcevski, P. Scheuermann, X. Wang, and E. J. Keogh. Querying and mining of time series data: experimental comparison of representations and distance measures. PVLDB, 1(2):1542--1552, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. B. Hartmann and N. Link. Gesture recognition with inertial sensors and optimized dtw prototypes. In IEEE International Conference on Systems Man and Cybernetics, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  6. B. Hartmann, I. Schwab, and N. Link. Prototype optimization for temporarily and spatially distorted time series. In the AAAI Spring Symposia, 2010.Google ScholarGoogle Scholar
  7. Q. He, F. Zhuang, T. Shang, Z. Shi, et al. Fast time series classification based on infrequent shapelets. In 11th IEEE International Conference on Machine Learning and Applications, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Hills, J. Lines, E. Baranauskas, J. Mapp, and A. Bagnall. Classification of time series by shapelet transformation. Data Mining and Knowledge Discovery, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Lines and A. Bagnall. Alternative quality measures for time series shapelets. In Intelligent Data Engineering and Automated Learning, volume 7435 of Lecture Notes in Computer Science, pages 475--483. 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. Lines, L. Davis, J. Hills, and A. Bagnall. A shapelet transform for time series classification. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Mueen, E. Keogh, and N. Young. Logical-shapelets: an expressive primitive for time series classification. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T. Rakthanmanon and E. Keogh. Fast shapelets: A scalable algorithm for discovering time series shapelets. Proceedings of the 13th SIAM International Conference on Data Mining, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  13. P. Sivakumar and T. Shajina. Human gait recognition and classification using time series shapelets. In IEEE International Conference on Advances in Computing and Communications, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. E. W. Wild. Optimization-based Machine Learning and Data Mining. ProQuest, 2008.Google ScholarGoogle Scholar
  15. Z. Xing, J. Pei, and P. Yu. Early classification on time series. Knowledge and information systems, 31(1):105--127, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Z. Xing, J. Pei, P. Yu, and K. Wang. Extracting interpretable features for early classification on time series. Proceedings of the 11th SIAM International Conference on Data Mining, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  17. L. Ye and E. Keogh. Time series shapelets: a new primitive for data mining. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. L. Ye and E. Keogh. Time series shapelets: a novel technique that allows accurate, interpretable and fast classification. Data Mining and Knowledge Discovery, 22(1):149--182, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Zakaria, A. Mueen, and E. Keogh. Clustering time series using unsupervised-shapelets. In Proceedings of the 12th IEEE International Conference on Data Mining, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Learning time-series shapelets

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '14: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining
      August 2014
      2028 pages
      ISBN:9781450329569
      DOI:10.1145/2623330

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 August 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      KDD '14 Paper Acceptance Rate151of1,036submissions,15%Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader