Skip to main content
Top
Published in: World Wide Web 6/2020

21-05-2020

Extracting diverse-shapelets for early classification on time series

Authors: Wenhe Yan, Guiling Li, Zongda Wu, Senzhang Wang, Philip S. Yu

Published in: World Wide Web | Issue 6/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In recent years, early classification on time series has become increasingly important in time-sensitive applications. Existing shapelet based methods still cannot work well on this problem. First, the effectiveness of traditional shapelet based methods would be influenced by the number of shapelet candidates. Second, it is difficult for previous methods to obtain diverse shapelets in shapelet selection. In this paper, we propose an Improved Early Distinctive Shapelet Classification method named IEDSC. We first present a new method to more precisely measure the similarity between time series, which takes into account of the relative trend of time series. Second, in shapelet extraction, we propose a pruning technique to reduce the number of shapelets by predicting the starting positions of shapelets with good quality. In addition, a new shapelet selection method is also proposed to remove the similar shapelets, so as to maintain the diversity of shapelets. Finally, the experimental results on 16 benchmark datasets show that the proposed method outperforms state-of-the-art for early classification on time series.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Ando, S., Suzuki, E.: Minimizing response time in time series classification. Knowl. Inf. Syst. 46(2), 449–476 (2016)CrossRef Ando, S., Suzuki, E.: Minimizing response time in time series classification. Knowl. Inf. Syst. 46(2), 449–476 (2016)CrossRef
2.
go back to reference Bentley, J. L., Sedgewick, R.: Fast algorithms for sorting and searching strings. In: 8th Acm-Siam symposium on discrete algorithms, pp. 360–369 (1997) Bentley, J. L., Sedgewick, R.: Fast algorithms for sorting and searching strings. In: 8th Acm-Siam symposium on discrete algorithms, pp. 360–369 (1997)
3.
go back to reference Chiu, B., Keogh, E., Lonardi, S.: Probabilistic discovery of time series motifs. In: Proc.acm Sigkdd int.conf.on knowledge discovery & data mining, pp 493–498 (2003) Chiu, B., Keogh, E., Lonardi, S.: Probabilistic discovery of time series motifs. In: Proc.acm Sigkdd int.conf.on knowledge discovery & data mining, pp 493–498 (2003)
5.
go back to reference Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7(Jan), 1–30 (2006)MathSciNetMATH Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7(Jan), 1–30 (2006)MathSciNetMATH
6.
go back to reference Di Marzio, M., Taylor, C. C.: Kernel density classification and boosting: an l2 analysis. Stat. Comput. 15(2), 113–123 (2005)MathSciNetCrossRef Di Marzio, M., Taylor, C. C.: Kernel density classification and boosting: an l2 analysis. Stat. Comput. 15(2), 113–123 (2005)MathSciNetCrossRef
9.
go back to reference Ghalwash, M. F., Radosavljevic, V., Obradovic, Z.: Utilizing temporal patterns for estimating uncertainty in interpretable early decision making. In: Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining, pp. 402–411 (2014) Ghalwash, M. F., Radosavljevic, V., Obradovic, Z.: Utilizing temporal patterns for estimating uncertainty in interpretable early decision making. In: Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining, pp. 402–411 (2014)
10.
go back to reference Ghalwash, M. F., Ramljak, D., Obradovic, Z.: Early classification of multivariate time series using a hybrid hmm/svm model. In: Proceedings of the 2012 IEEE international conference on bioinformatics and biomedicine (BIBM), pp. 1–6 (2012) Ghalwash, M. F., Ramljak, D., Obradovic, Z.: Early classification of multivariate time series using a hybrid hmm/svm model. In: Proceedings of the 2012 IEEE international conference on bioinformatics and biomedicine (BIBM), pp. 1–6 (2012)
11.
go back to reference Grabocka, J., Schilling, N., Wistuba, M., Schmidt-Thieme, L.: Learning time-series shapelets. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’14, pp. 392–401 (2014) Grabocka, J., Schilling, N., Wistuba, M., Schmidt-Thieme, L.: Learning time-series shapelets. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’14, pp. 392–401 (2014)
12.
go back to reference Hartvigsen, T., Sen, C., Kong, X., Rundensteiner, E.: Adaptive-halting policy network for early classification. In: ACM SIGKDD international conference on knowledge discovery and data mining, pp. 101–110 (2019) Hartvigsen, T., Sen, C., Kong, X., Rundensteiner, E.: Adaptive-halting policy network for early classification. In: ACM SIGKDD international conference on knowledge discovery and data mining, pp. 101–110 (2019)
13.
go back to reference He, G., Duan, Y., Peng, R., Jing, X., Qian, T., Wang, L.: Early classification on multivariate time series. Neurocomputing 149, 777–787 (2015)CrossRef He, G., Duan, Y., Peng, R., Jing, X., Qian, T., Wang, L.: Early classification on multivariate time series. Neurocomputing 149, 777–787 (2015)CrossRef
14.
go back to reference He, G., Zhao, W., Xia, X., Peng, R., Wu, X.: An ensemble of shapelet-based classifiers on inter-class and intra-class imbalanced multivariate time series at the early stage. Soft Computing (2018) He, G., Zhao, W., Xia, X., Peng, R., Wu, X.: An ensemble of shapelet-based classifiers on inter-class and intra-class imbalanced multivariate time series at the early stage. Soft Computing (2018)
15.
go back to reference Jiang, L., Li, C., Cai, Z.: Learning decision tree for ranking. Knowl. Inf. Syst. 20(1), 123–135 (2009)CrossRef Jiang, L., Li, C., Cai, Z.: Learning decision tree for ranking. Knowl. Inf. Syst. 20(1), 123–135 (2009)CrossRef
16.
go back to reference Karlsson, I., Papapetrou, P., Boström, H.: Early random shapelet forest. In: Calders, T., Ceci, M., Malerba, D. (eds.) Discovery science, pp 261–276. Springer International Publishing, Cham (2016) Karlsson, I., Papapetrou, P., Boström, H.: Early random shapelet forest. In: Calders, T., Ceci, M., Malerba, D. (eds.) Discovery science, pp 261–276. Springer International Publishing, Cham (2016)
17.
go back to reference Keller, J. M., Gray, M. R., Givens, J. A.: A fuzzy k-nearest neighbor algorithm. IEEE Trans. Syst. Man Cybern. 4, 580–585 (1985)CrossRef Keller, J. M., Gray, M. R., Givens, J. A.: A fuzzy k-nearest neighbor algorithm. IEEE Trans. Syst. Man Cybern. 4, 580–585 (1985)CrossRef
18.
go back to reference Keogh, E., Jessica, L., Ada, F.: Hot sax: Finding the most unusual time series subsequence: Algorithms and applications. In: International conference on data mining, pp. 1–27 (2008) Keogh, E., Jessica, L., Ada, F.: Hot sax: Finding the most unusual time series subsequence: Algorithms and applications. In: International conference on data mining, pp. 1–27 (2008)
19.
go back to reference Li, G., Bräysy, O., Jiang, L., Wu, Z., Wang, Y.: Finding time series discord based on bit representation clustering. Knowl.-Based Syst. 54, 243–254 (2013)CrossRef Li, G., Bräysy, O., Jiang, L., Wu, Z., Wang, Y.: Finding time series discord based on bit representation clustering. Knowl.-Based Syst. 54, 243–254 (2013)CrossRef
20.
go back to reference Li, G., Yan, W., Wu, Z.: Discovering shapelets with key points in time series classification. Expert Syst. Appl. 132, 76–86 (2019)CrossRef Li, G., Yan, W., Wu, Z.: Discovering shapelets with key points in time series classification. Expert Syst. Appl. 132, 76–86 (2019)CrossRef
21.
go back to reference Lin, T. H., Kaminski, N., Bar-Joseph, Z.: Alignment and classification of time series gene expression in clinical studies. Bioinformatics 24(13), 147–155 (2008)CrossRef Lin, T. H., Kaminski, N., Bar-Joseph, Z.: Alignment and classification of time series gene expression in clinical studies. Bioinformatics 24(13), 147–155 (2008)CrossRef
22.
go back to reference Lines, J., Davis, L. M., Hills, J., Bagnall, A.: A shapelet transform for time series classification. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’12, pp. 289–297. ACM (2012) Lines, J., Davis, L. M., Hills, J., Bagnall, A.: A shapelet transform for time series classification. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’12, pp. 289–297. ACM (2012)
23.
go back to reference Ma, C., Weng, X., Shan, Z.: Early classification of multivariate time series based on piecewise aggregate approximation. In: Health information science, pp. 81–88 (2017) Ma, C., Weng, X., Shan, Z.: Early classification of multivariate time series based on piecewise aggregate approximation. In: Health information science, pp. 81–88 (2017)
24.
go back to reference Mori, U., Mendiburu, A., Dasgupta, S., Lozano, J. A.: Early classification of time series by simultaneously optimizing the accuracy and earliness. IEEE Transactions on Neural Networks and Learning Systems (2017) Mori, U., Mendiburu, A., Dasgupta, S., Lozano, J. A.: Early classification of time series by simultaneously optimizing the accuracy and earliness. IEEE Transactions on Neural Networks and Learning Systems (2017)
25.
go back to reference Mori, U., Mendiburu, A., Keogh, E., Lozano, J. A.: Reliable early classification of time series based on discriminating the classes over time. Data Min. Knowl. Disc. 31(1), 233–263 (2017)MathSciNetCrossRef Mori, U., Mendiburu, A., Keogh, E., Lozano, J. A.: Reliable early classification of time series based on discriminating the classes over time. Data Min. Knowl. Disc. 31(1), 233–263 (2017)MathSciNetCrossRef
26.
go back to reference Parrish, N., Anderson, H. S., Gupta, M. R., Hsiao, D. Y.: Classifying with confidence from incomplete information. J. Mach. Learn. Res. 14(1), 3561–3589 (2013)MathSciNetMATH Parrish, N., Anderson, H. S., Gupta, M. R., Hsiao, D. Y.: Classifying with confidence from incomplete information. J. Mach. Learn. Res. 14(1), 3561–3589 (2013)MathSciNetMATH
27.
go back to reference Romain, T., Simon, M.: Cost-aware early classification of time series. In: Machine learning and knowledge discovery in databases, pp. 632–647 (2016) Romain, T., Simon, M.: Cost-aware early classification of time series. In: Machine learning and knowledge discovery in databases, pp. 632–647 (2016)
28.
go back to reference Sangnier, M., Gauthier, J., Rakotomamonjy, A.: Early and reliable event detection using proximity space representation. In: Proceedings of the 33rd international conference on international conference on machine learning - vol. 48, ICML’16, pp. 2310–2319 (2016) Sangnier, M., Gauthier, J., Rakotomamonjy, A.: Early and reliable event detection using proximity space representation. In: Proceedings of the 33rd international conference on international conference on machine learning - vol. 48, ICML’16, pp. 2310–2319 (2016)
29.
go back to reference Schäfer, P., Leser, U.: Teaser: Early and accurate time series classification. arXiv:1908.03405 (2019) Schäfer, P., Leser, U.: Teaser: Early and accurate time series classification. arXiv:1908.​03405 (2019)
30.
go back to reference Song, W., Wang, L., Xiang, Y., Zomaya, A. Y.: Geographic spatiotemporal big data correlation analysis via the hilbert-huang transformation. J. Comput. Syst. Sci. 89, 130–141 (2017)MathSciNetMATHCrossRef Song, W., Wang, L., Xiang, Y., Zomaya, A. Y.: Geographic spatiotemporal big data correlation analysis via the hilbert-huang transformation. J. Comput. Syst. Sci. 89, 130–141 (2017)MathSciNetMATHCrossRef
31.
go back to reference Wang, S., Cao, J., Yu, P.S.: Deep learning for spatio-temporal data mining: A survey. arXiv:1906.04928 (2019) Wang, S., Cao, J., Yu, P.S.: Deep learning for spatio-temporal data mining: A survey. arXiv:1906.​04928 (2019)
32.
go back to reference Wang, W., Chen, C., Wang, W., Rai, P., Carin, L.: Earliness-aware deep convolutional networks for early time series classification. arXiv:1611.04578 (2016) Wang, W., Chen, C., Wang, W., Rai, P., Carin, L.: Earliness-aware deep convolutional networks for early time series classification. arXiv:1611.​04578 (2016)
33.
go back to reference Wu, J., Pan, S., Zhu, X., Cai, Z.: Boosting for multi-graph classification. IEEE Trans. Cybern. 45(3), 430–443 (2015)CrossRef Wu, J., Pan, S., Zhu, X., Cai, Z.: Boosting for multi-graph classification. IEEE Trans. Cybern. 45(3), 430–443 (2015)CrossRef
34.
go back to reference Xing, Z., Pei, J., Yu, P. S.: Early prediction on time series: A nearest neighbor approach. In: International jont conference on artifical intelligence, pp. 1297–1302 (2009) Xing, Z., Pei, J., Yu, P. S.: Early prediction on time series: A nearest neighbor approach. In: International jont conference on artifical intelligence, pp. 1297–1302 (2009)
35.
go back to reference Xing, Z., Pei, J., Yu, P. S.: Early classification on time series. Knowl. Inf. Syst. 31(1), 105–127 (2012)CrossRef Xing, Z., Pei, J., Yu, P. S.: Early classification on time series. Knowl. Inf. Syst. 31(1), 105–127 (2012)CrossRef
36.
go back to reference Xing, Z., Pei, J., Yu, P. S., Wang, K.: Extracting interpretable features for early classification on time series. In: 11th Siam international conference on data mining, SDM 2011, April 28-30, 2011, Mesa, Arizona, USA, pp. 247–258 (2011) Xing, Z., Pei, J., Yu, P. S., Wang, K.: Extracting interpretable features for early classification on time series. In: 11th Siam international conference on data mining, SDM 2011, April 28-30, 2011, Mesa, Arizona, USA, pp. 247–258 (2011)
37.
go back to reference Ye, L., Keogh, E.: Time series shapelets:a new primitive for data mining. In: ACM SIGKDD international conference on knowledge discovery and data mining, Paris, France, June 28 - July, pp. 947–956 (2009) Ye, L., Keogh, E.: Time series shapelets:a new primitive for data mining. In: ACM SIGKDD international conference on knowledge discovery and data mining, Paris, France, June 28 - July, pp. 947–956 (2009)
38.
go back to reference Yeh, C. C. M., Zhu, Y., Ulanova, L., Begum, N., Ding, Y., Dau, H. A., Zimmerman, Z., Silva, D. F., Mueen, A., Keogh, E.: Time series joins, motifs, discords and shapelets: A unifying view that exploits the matrix profile. Data Mining & Knowledge Discovery 32(1), 83–123 (2018)MathSciNetMATHCrossRef Yeh, C. C. M., Zhu, Y., Ulanova, L., Begum, N., Ding, Y., Dau, H. A., Zimmerman, Z., Silva, D. F., Mueen, A., Keogh, E.: Time series joins, motifs, discords and shapelets: A unifying view that exploits the matrix profile. Data Mining & Knowledge Discovery 32(1), 83–123 (2018)MathSciNetMATHCrossRef
39.
go back to reference Zalewski, W., Silva, F., Maletzke, A. G., Ferrero, C. A.: Exploring shapelet transformation for time series classification in decision trees. Knowl.-Based Syst. 112, 80–91 (2016)CrossRef Zalewski, W., Silva, F., Maletzke, A. G., Ferrero, C. A.: Exploring shapelet transformation for time series classification in decision trees. Knowl.-Based Syst. 112, 80–91 (2016)CrossRef
Metadata
Title
Extracting diverse-shapelets for early classification on time series
Authors
Wenhe Yan
Guiling Li
Zongda Wu
Senzhang Wang
Philip S. Yu
Publication date
21-05-2020
Publisher
Springer US
Published in
World Wide Web / Issue 6/2020
Print ISSN: 1386-145X
Electronic ISSN: 1573-1413
DOI
https://doi.org/10.1007/s11280-020-00820-z

Other articles of this Issue 6/2020

World Wide Web 6/2020 Go to the issue

Premium Partner