Skip to main content
Erschienen in: Knowledge and Information Systems 3/2019

30.01.2019 | Regular Paper

Mining frequent and top-K High Utility Time Interval-based Events with Duration patterns

verfasst von: Jen-Wei Huang, Bijay Prasad Jaysawal, Kuan-Ying Chen, Yong-Bin Wu

Erschienen in: Knowledge and Information Systems | Ausgabe 3/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Traditional frequent sequential pattern mining only considers the time point-based item or event in the patterns. However, in many application, the events may span over multiple time points and the relations among events are also important. Time interval-based pattern mining is proposed to mine the interesting patterns of events that span over some time periods and also by considering the relations among events. Previous works of time interval-based pattern mining focused on the relations between events without considering the duration of each event. However, the same event with different time duration may cause different results. In this work, we propose two algorithms, SARA and SARS, for mining frequent time interval-based events with duration, TIED, patterns. TIED patterns not only keep the relations between two events but also reveal the time periods when each event happens and ends. For the performance evaluation, we propose a naive algorithm and modify a previous algorithm along with the implementation of SARA and SARS. The experimental results show that SARA and SARS are more efficient in terms of execution time and memory usage than other two algorithms. Moreover, we extend this work by considering utility value or importance of event in each time stamp. Therefore, we propose another new High Utility Time Interval-based Events with Duration, HU-TIED, pattern. HU-TIED incorporates the concept of utility pattern mining and TIED pattern mining. We design an algorithm, LMSpan, to mine top-K HU-TIED patterns. For the performance evaluation, we design a baseline algorithm, GenerateNCheck to compare with LMSpan. LMSpan takes less time and memory and generates less candidates than GenerateNCheck.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Agrawal R, Srikant R (1994) Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th international conference on very large data base, pp 487–499 Agrawal R, Srikant R (1994) Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th international conference on very large data base, pp 487–499
2.
Zurück zum Zitat Agrawal R, Srikant R (1995) Mining sequential patterns. In: Proceedings of the 12nd IEEE international conference on data engineering, pp 3–14 Agrawal R, Srikant R (1995) Mining sequential patterns. In: Proceedings of the 12nd IEEE international conference on data engineering, pp 3–14
3.
Zurück zum Zitat Allen J (1983) Maintaining knowledge about temporal intervals. Commun ACM 26(11):832–843CrossRef Allen J (1983) Maintaining knowledge about temporal intervals. Commun ACM 26(11):832–843CrossRef
4.
Zurück zum Zitat Batal I, Fradkin D, Harrison J, Moerchen F, Hauskrecht M (2012) Mining recent temporal patterns for event detection in multivariate time series data. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp 280–288 Batal I, Fradkin D, Harrison J, Moerchen F, Hauskrecht M (2012) Mining recent temporal patterns for event detection in multivariate time series data. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp 280–288
5.
Zurück zum Zitat Bergen B, Chang N (2005) Embodied construction grammar in simulation-based language understanding. In: Construction grammars: cognitive grounding and theoretical extensions, vol 3. John Benjamins, Amsterdam, pp 147–190CrossRef Bergen B, Chang N (2005) Embodied construction grammar in simulation-based language understanding. In: Construction grammars: cognitive grounding and theoretical extensions, vol 3. John Benjamins, Amsterdam, pp 147–190CrossRef
6.
Zurück zum Zitat Chen KY, Jaysawal BP, Huang JW, Wu YB (2014) Mining frequent time interval-based event with duration patterns from temporal database. In: 2014 international conference on data science and advanced analytics (DSAA), pp 548–554 Chen KY, Jaysawal BP, Huang JW, Wu YB (2014) Mining frequent time interval-based event with duration patterns from temporal database. In: 2014 international conference on data science and advanced analytics (DSAA), pp 548–554
7.
Zurück zum Zitat Chen YC, Chen CC, Peng WC, Lee WC (2014) Mining correlation patterns among appliances in smart home environment. In: Proceedings of the 18th Pacific-Asia conference on knowledge discovery and data mining, pp 222–233CrossRef Chen YC, Chen CC, Peng WC, Lee WC (2014) Mining correlation patterns among appliances in smart home environment. In: Proceedings of the 18th Pacific-Asia conference on knowledge discovery and data mining, pp 222–233CrossRef
8.
Zurück zum Zitat Chen YC, Jiang JC, Peng WC, Lee SY (2010) An efficient algorithm for mining time interval-based patterns in large database. In: Proceedings of the 19th ACM international conference on information and knowledge management, pp 49–58 Chen YC, Jiang JC, Peng WC, Lee SY (2010) An efficient algorithm for mining time interval-based patterns in large database. In: Proceedings of the 19th ACM international conference on information and knowledge management, pp 49–58
9.
Zurück zum Zitat Cheng H, Yan XY, Han JW (2004) IncSpan: incremental mining of sequential patterns in large database. In: Proceedings of the 10th ACM SIGKDD international conference on knowledge discovery and data mining, pp 527–532 Cheng H, Yan XY, Han JW (2004) IncSpan: incremental mining of sequential patterns in large database. In: Proceedings of the 10th ACM SIGKDD international conference on knowledge discovery and data mining, pp 527–532
10.
Zurück zum Zitat Erwin A, Gopalan RP, Achuthan NR (2008) Efficient mining of high utility itemsets from large datasets. In: Proceedings of the 12th Pacific-Asia conference on knowledge discovery and data mining, pp 554–561 Erwin A, Gopalan RP, Achuthan NR (2008) Efficient mining of high utility itemsets from large datasets. In: Proceedings of the 12th Pacific-Asia conference on knowledge discovery and data mining, pp 554–561
11.
Zurück zum Zitat Guyet T, Quiniou R (2008) Mining temporal patterns with quantitative intervals. In: Proceedings of the 18th IEEE international conference on data mining workshops, pp 218–227 Guyet T, Quiniou R (2008) Mining temporal patterns with quantitative intervals. In: Proceedings of the 18th IEEE international conference on data mining workshops, pp 218–227
12.
Zurück zum Zitat Han JW, Pei J, Yin YW (2000) Mining frequent patterns without candidate generation. In: ACM special interest group on management of data, vol 29(2), pp 1–12CrossRef Han JW, Pei J, Yin YW (2000) Mining frequent patterns without candidate generation. In: ACM special interest group on management of data, vol 29(2), pp 1–12CrossRef
13.
14.
Zurück zum Zitat Huang JW, Tseng CY, Ou JC, Chen MS (2008) A general model for sequential pattern mining with a progressive database. IEEE Trans Knowl Data Eng 20(9):1153–1167CrossRef Huang JW, Tseng CY, Ou JC, Chen MS (2008) A general model for sequential pattern mining with a progressive database. IEEE Trans Knowl Data Eng 20(9):1153–1167CrossRef
17.
Zurück zum Zitat Liu J, Wang K, Fung BCM (2012) Direct discovery of high utility itemsets without candidate generation. In: 2012 IEEE 12th international conference on data mining, pp. 984–989 Liu J, Wang K, Fung BCM (2012) Direct discovery of high utility itemsets without candidate generation. In: 2012 IEEE 12th international conference on data mining, pp. 984–989
18.
Zurück zum Zitat Liu J, Wang K, Fung BCM (2016) Mining high utility patterns in one phase without generating candidates. IEEE Trans Knowl Data Eng 28(5):1245–1257CrossRef Liu J, Wang K, Fung BCM (2016) Mining high utility patterns in one phase without generating candidates. IEEE Trans Knowl Data Eng 28(5):1245–1257CrossRef
19.
Zurück zum Zitat Liu M, Qu J (2012) Mining high utility itemsets without candidate generation. In: Proceedings of the 21st ACM international conference on information and knowledge management, CIKM ’12. ACM, New York, pp 55–64 Liu M, Qu J (2012) Mining high utility itemsets without candidate generation. In: Proceedings of the 21st ACM international conference on information and knowledge management, CIKM ’12. ACM, New York, pp 55–64
20.
Zurück zum Zitat Mörchen F, Fradkin D (2010) Robust mining of time intervals with semi-interval partial order patterns. In: Proceedings of the 2010 SIAM international conference on data mining. SIAM, Philadelphia, pp 315–326 Mörchen F, Fradkin D (2010) Robust mining of time intervals with semi-interval partial order patterns. In: Proceedings of the 2010 SIAM international conference on data mining. SIAM, Philadelphia, pp 315–326
21.
Zurück zum Zitat Nakagaito F, Ozaki T, Ohkawa T (2009) Discovery of quantitative sequential patterns from event sequences. In: Proceedings of the 19th IEEE international conference on data mining workshops, pp 31–36 Nakagaito F, Ozaki T, Ohkawa T (2009) Discovery of quantitative sequential patterns from event sequences. In: Proceedings of the 19th IEEE international conference on data mining workshops, pp 31–36
22.
Zurück zum Zitat Papapetrou P, Kollios G, Sclaroff S (2009) Mining frequent arrangements of temporal intervals. Knowl Inf Syst 21(2):133–171CrossRef Papapetrou P, Kollios G, Sclaroff S (2009) Mining frequent arrangements of temporal intervals. Knowl Inf Syst 21(2):133–171CrossRef
23.
Zurück zum Zitat Papapetrou P, Kollios G, Sclaroff S, Gunopulos D (2005) Discovering frequent arrangements of temporal intervals. In: Proceedings of the 15th IEEE international conference on data mining, pp 354–361 Papapetrou P, Kollios G, Sclaroff S, Gunopulos D (2005) Discovering frequent arrangements of temporal intervals. In: Proceedings of the 15th IEEE international conference on data mining, pp 354–361
24.
Zurück zum Zitat Patel D, Hsu W, Lee ML (2008) Mining relationships among interval-based events for classification. In: Proceedings of the ACM special interest group on management of data, pp 393–404 Patel D, Hsu W, Lee ML (2008) Mining relationships among interval-based events for classification. In: Proceedings of the ACM special interest group on management of data, pp 393–404
25.
Zurück zum Zitat Pei J, Han J, Mortazavi-Asl B, Wang J, Pinto H, Chen Q, Dayal U, Hsu MC (2004) Mining sequential patterns by pattern-growth: the prefixspan approach. IEEE Trans Knowl Data Eng 16(11):1424–1440CrossRef Pei J, Han J, Mortazavi-Asl B, Wang J, Pinto H, Chen Q, Dayal U, Hsu MC (2004) Mining sequential patterns by pattern-growth: the prefixspan approach. IEEE Trans Knowl Data Eng 16(11):1424–1440CrossRef
26.
Zurück zum Zitat Ryang H, Yun U (2015) Top-k high utility pattern mining with effective threshold raising strategies. Knowl Based Syst 76:109–126CrossRef Ryang H, Yun U (2015) Top-k high utility pattern mining with effective threshold raising strategies. Knowl Based Syst 76:109–126CrossRef
27.
Zurück zum Zitat Tseng VS, Wu CW, Fournier-Viger P, Yu PS (2016) Efficient algorithms for mining top-k high utility itemsets. IEEE Trans Knowl Data Eng 28(1):54–67CrossRef Tseng VS, Wu CW, Fournier-Viger P, Yu PS (2016) Efficient algorithms for mining top-k high utility itemsets. IEEE Trans Knowl Data Eng 28(1):54–67CrossRef
28.
Zurück zum Zitat Tseng VS, Wu CW, Shie BE, Yu PS (2010) UP-Growth: an efficient algorithm for high utility itemset mining. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining, pp 253–262 Tseng VS, Wu CW, Shie BE, Yu PS (2010) UP-Growth: an efficient algorithm for high utility itemset mining. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining, pp 253–262
29.
Zurück zum Zitat Wu CW, Lin YF, Yu PS, Tseng VS (2013) Mining high utility episodes in complex event sequences. In: Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’13. ACM, New York, pp 536–544 Wu CW, Lin YF, Yu PS, Tseng VS (2013) Mining high utility episodes in complex event sequences. In: Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’13. ACM, New York, pp 536–544
30.
Zurück zum Zitat Wu CW, Shie BE, Tseng VS, Yu PS (2012) Mining top-K high utility itemsets. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp 78–86 Wu CW, Shie BE, Tseng VS, Yu PS (2012) Mining top-K high utility itemsets. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp 78–86
31.
Zurück zum Zitat Wu SY, Chen YL (2007) Mining nonambiguous temporal patterns for interval-based events. IEEE Trans Knowl Data Eng 19(6):742–758CrossRef Wu SY, Chen YL (2007) Mining nonambiguous temporal patterns for interval-based events. IEEE Trans Knowl Data Eng 19(6):742–758CrossRef
32.
Zurück zum Zitat Yao H, Hamilton HJ, Geng L (2006) A unified framework for utility-based measures for mining itemsets. In: Proceedings of 2nd workshop on utility-based data mining, joint with the 11th ACM SIGKDD international conference on knowledge discovery and data mining, pp 28–37 Yao H, Hamilton HJ, Geng L (2006) A unified framework for utility-based measures for mining itemsets. In: Proceedings of 2nd workshop on utility-based data mining, joint with the 11th ACM SIGKDD international conference on knowledge discovery and data mining, pp 28–37
33.
Zurück zum Zitat Yin JF, Zheng ZG, Cao LB (2012) USpan: an efficient algorithm for mining high utility sequential patterns. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp 660–668 Yin JF, Zheng ZG, Cao LB (2012) USpan: an efficient algorithm for mining high utility sequential patterns. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp 660–668
34.
Zurück zum Zitat Yin JF, Zheng ZG, Cao LB, Song Y, Wei W (2013) Efficiently mining top-K high utility sequential patterns. In: Proceedings of the 23th IEEE international conference on data mining, pp 1259–1264 Yin JF, Zheng ZG, Cao LB, Song Y, Wei W (2013) Efficiently mining top-K high utility sequential patterns. In: Proceedings of the 23th IEEE international conference on data mining, pp 1259–1264
Metadaten
Titel
Mining frequent and top-K High Utility Time Interval-based Events with Duration patterns
verfasst von
Jen-Wei Huang
Bijay Prasad Jaysawal
Kuan-Ying Chen
Yong-Bin Wu
Publikationsdatum
30.01.2019
Verlag
Springer London
Erschienen in
Knowledge and Information Systems / Ausgabe 3/2019
Print ISSN: 0219-1377
Elektronische ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-019-01333-6

Weitere Artikel der Ausgabe 3/2019

Knowledge and Information Systems 3/2019 Zur Ausgabe

Premium Partner