Skip to main content
Top
Published in: Knowledge and Information Systems 6/2023

03-02-2023 | Regular Paper

FCHM-stream: fast closed high utility itemsets mining over data streams

Authors: Muhang Li, Meng Han, Zhiqiang Chen, Hongxin Wu, Xilong Zhang

Published in: Knowledge and Information Systems | Issue 6/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The high-speed, continuous and endless characteristics of data streams make it a challenging task to quickly mine high utility itemsets in limited memory space. The sliding window model, which focuses only on the most recent data, has received extensive research and attention as it can effectively adapt to the data stream environment. However, the presence of many communal batches in adjacent sliding windows causes the algorithm to repeatedly generate a large number of identical itemsets, which reduces the spatiotemporal performance of the algorithm. In order to solve these problems and provide users with a concise and lossless resultset, a new closed high utility pattern mining algorithm over data stream is proposed, named FCHM-Stream. A new utility list structure based on batch division and a resultset maintenance strategy based on skip-list structure are designed to effectively reduce identical itemsets repeatedly generated and thus reduce the running time of the algorithm. Extensive experimental results show that the proposed algorithm has a large improvement in runtime compared to the state-of-the-art algorithms.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Zida S, Fournier-Viger P, Lin JC-W, Wu C-W, Tseng VS (2017) EFIM: a fast and memory efficient algorithm for high-utility itemset mining. Knowl Inf Syst 51(2):595–625CrossRef Zida S, Fournier-Viger P, Lin JC-W, Wu C-W, Tseng VS (2017) EFIM: a fast and memory efficient algorithm for high-utility itemset mining. Knowl Inf Syst 51(2):595–625CrossRef
2.
go back to reference Krishnamoorthy S (2017) HMiner: efficiently mining high utility itemsets. Expert Syst Appl 90:168–183CrossRef Krishnamoorthy S (2017) HMiner: efficiently mining high utility itemsets. Expert Syst Appl 90:168–183CrossRef
3.
go back to reference Dawar S, Sharma V, Goyal V (2017) Mining top-k high-utility itemsets from a data stream under sliding window model. Appl Intell 47(4):1240–1255CrossRef Dawar S, Sharma V, Goyal V (2017) Mining top-k high-utility itemsets from a data stream under sliding window model. Appl Intell 47(4):1240–1255CrossRef
4.
go back to reference Liu Y, Liao W-k and Choudhary A (2005) A two-phase algorithm for fast discovery of high utility itemsets. In Pacific-Asia conference on knowledge discovery and data mining. Springer Berlin Heidelberg, Berlin, pp. 689–695. Liu Y, Liao W-k and Choudhary A (2005) A two-phase algorithm for fast discovery of high utility itemsets. In Pacific-Asia conference on knowledge discovery and data mining. Springer Berlin Heidelberg, Berlin, pp. 689–695.
5.
go back to reference Tseng VS, Wu C-W, Shie B-E and Yu PS (2010) UP-Growth: an efficient algorithm for high utility itemset mining. Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining. pp. 253–262. Tseng VS, Wu C-W, Shie B-E and Yu PS (2010) UP-Growth: an efficient algorithm for high utility itemset mining. Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining. pp. 253–262.
6.
go back to reference Liu M and Qu J (2012) Mining high utility itemsets without candidate generation. Proceedings of the 21st ACM international conference on Information and knowledge management. pp. 55–64. Liu M and Qu J (2012) Mining high utility itemsets without candidate generation. Proceedings of the 21st ACM international conference on Information and knowledge management. pp. 55–64.
7.
go back to reference Krishnamoorthy S (2015) Pruning strategies for mining high utility itemsets. Expert Syst Appl 42(5):2371–2381CrossRef Krishnamoorthy S (2015) Pruning strategies for mining high utility itemsets. Expert Syst Appl 42(5):2371–2381CrossRef
8.
go back to reference Fournier-Viger P, Wu C-W, Zida S and Tseng VS (2014) FHM: faster high-utility itemset mining using estimated utility co-occurrence pruning. International symposium on methodologies for intelligent systems. Springer International Publishing, Cham, pp. 83–92. Fournier-Viger P, Wu C-W, Zida S and Tseng VS (2014) FHM: faster high-utility itemset mining using estimated utility co-occurrence pruning. International symposium on methodologies for intelligent systems. Springer International Publishing, Cham, pp. 83–92.
9.
go back to reference Duong Q-H, Fournier-Viger P, Ramampiaro H, Nørvåg K, Dam T-L (2018) Efficient high utility itemset mining using buffered utility-lists. Appl Intell 48(7):1859–1877CrossRef Duong Q-H, Fournier-Viger P, Ramampiaro H, Nørvåg K, Dam T-L (2018) Efficient high utility itemset mining using buffered utility-lists. Appl Intell 48(7):1859–1877CrossRef
10.
go back to reference Wu P, Niu X, Fournier-Viger P, Huang C, Wang B (2022) UBP-Miner: An efficient bit based high utility itemset mining algorithm. Knowl Based Syst 248:108865CrossRef Wu P, Niu X, Fournier-Viger P, Huang C, Wang B (2022) UBP-Miner: An efficient bit based high utility itemset mining algorithm. Knowl Based Syst 248:108865CrossRef
11.
go back to reference Sohrabi MK (2020) An efficient projection-based method for high utility itemset mining using a novel pruning approach on the utility matrix. Knowl Inf Syst 62(11):4141–4167CrossRef Sohrabi MK (2020) An efficient projection-based method for high utility itemset mining using a novel pruning approach on the utility matrix. Knowl Inf Syst 62(11):4141–4167CrossRef
12.
go back to reference Ahmed CF, Tanbeer SK, Jeong B-S, Choi H-J (2012) Interactive mining of high utility patterns over data streams. Expert Syst Appl 39(15):11979–11991CrossRef Ahmed CF, Tanbeer SK, Jeong B-S, Choi H-J (2012) Interactive mining of high utility patterns over data streams. Expert Syst Appl 39(15):11979–11991CrossRef
13.
go back to reference Baek Y, Yun U, Kim H, Nam H, Kim H, Lin JC-W, Vo B, Pedrycz W (2021) Rhups: Mining recent high utility patterns with sliding window–based arrival time control over data streams. ACM Trans Intell Syst Technol TIST 12(2):1–27CrossRef Baek Y, Yun U, Kim H, Nam H, Kim H, Lin JC-W, Vo B, Pedrycz W (2021) Rhups: Mining recent high utility patterns with sliding window–based arrival time control over data streams. ACM Trans Intell Syst Technol TIST 12(2):1–27CrossRef
14.
go back to reference Ryang H, Yun U (2016) High utility pattern mining over data streams with sliding window technique. Expert Syst Appl 57:214–231CrossRef Ryang H, Yun U (2016) High utility pattern mining over data streams with sliding window technique. Expert Syst Appl 57:214–231CrossRef
15.
go back to reference Jaysawal BP and Huang J-W (2020) Sohupds: a single-pass one-phase algorithm for mining high utility patterns over a data stream. In Proceedings of the 35th annual acm symposium on applied computing. pp. 490–497. Jaysawal BP and Huang J-W (2020) Sohupds: a single-pass one-phase algorithm for mining high utility patterns over a data stream. In Proceedings of the 35th annual acm symposium on applied computing. pp. 490–497.
16.
go back to reference Chen H, Han M, Zhang N, Li X, Wang L (2021) Closed high utility itemsets mining over data stream based on sliding window. J Comput Res Dev 58(11):1–15 Chen H, Han M, Zhang N, Li X, Wang L (2021) Closed high utility itemsets mining over data stream based on sliding window. J Comput Res Dev 58(11):1–15
17.
go back to reference Chen X, Zhai P and Fang Y (2021) High utility pattern mining based on historical data table over data streams. In 2021 4th International Conference on Data Science and Information Technology. pp. 368–376. Chen X, Zhai P and Fang Y (2021) High utility pattern mining based on historical data table over data streams. In 2021 4th International Conference on Data Science and Information Technology. pp. 368–376.
18.
go back to reference Wu C-W, Fournier-Viger P, Gu J-Y and Tseng VS (2015) Mining closed+ high utility itemsets without candidate generation. In 2015 conference on technologies and applications of artificial intelligence (TAAI). IEEE, pp. 187–194. Wu C-W, Fournier-Viger P, Gu J-Y and Tseng VS (2015) Mining closed+ high utility itemsets without candidate generation. In 2015 conference on technologies and applications of artificial intelligence (TAAI). IEEE, pp. 187–194.
19.
go back to reference Tseng VS, Wu C-W, Fournier-Viger P, Philip SY (2014) Efficient algorithms for mining the concise and lossless representation of high utility itemsets. IEEE Trans Knowl Data Eng 27(3):726–739CrossRef Tseng VS, Wu C-W, Fournier-Viger P, Philip SY (2014) Efficient algorithms for mining the concise and lossless representation of high utility itemsets. IEEE Trans Knowl Data Eng 27(3):726–739CrossRef
20.
go back to reference Fournier-Viger P, Zida S, Lin JC-W, Wu C-W, Tseng VS (2016) EFIM-closed: fast and memory efficient discovery of closed high-utility itemsets. In: Perner P (ed) Machine learning and data mining in pattern recognition. Springer, Cham, pp 199–213CrossRef Fournier-Viger P, Zida S, Lin JC-W, Wu C-W, Tseng VS (2016) EFIM-closed: fast and memory efficient discovery of closed high-utility itemsets. In: Perner P (ed) Machine learning and data mining in pattern recognition. Springer, Cham, pp 199–213CrossRef
21.
go back to reference Dam T-L, Li K, Fournier-Viger P, Duong Q-H (2019) CLS-Miner: efficient and effective closed high-utility itemset mining. Front Comp Sci 13(2):357–381CrossRef Dam T-L, Li K, Fournier-Viger P, Duong Q-H (2019) CLS-Miner: efficient and effective closed high-utility itemset mining. Front Comp Sci 13(2):357–381CrossRef
22.
go back to reference Nguyen LT, Vu VV, Lam MT, Duong TT, Manh LT, Nguyen TT, Vo B, Fujita H (2019) An efficient method for mining high utility closed itemsets. Inf Sci 495:78–99CrossRef Nguyen LT, Vu VV, Lam MT, Duong TT, Manh LT, Nguyen TT, Vo B, Fujita H (2019) An efficient method for mining high utility closed itemsets. Inf Sci 495:78–99CrossRef
23.
go back to reference Dam T-L, Ramampiaro H, Nørvåg K, Duong Q-H (2019) Towards efficiently mining closed high utility itemsets from incremental databases. Knowl Based Syst 165:13–29CrossRef Dam T-L, Ramampiaro H, Nørvåg K, Duong Q-H (2019) Towards efficiently mining closed high utility itemsets from incremental databases. Knowl Based Syst 165:13–29CrossRef
24.
go back to reference Yun U, Lee G, Yoon E (2017) Efficient high utility pattern mining for establishing manufacturing plans with sliding window control. IEEE Trans Indust Electron 9:7239–7249CrossRef Yun U, Lee G, Yoon E (2017) Efficient high utility pattern mining for establishing manufacturing plans with sliding window control. IEEE Trans Indust Electron 9:7239–7249CrossRef
25.
go back to reference Lee J, Yun U, Lee G, Yoon E (2018) Efficient incremental high utility pattern mining based on pre-large concept. Eng Appl Artif Intell 72:111–123CrossRef Lee J, Yun U, Lee G, Yoon E (2018) Efficient incremental high utility pattern mining based on pre-large concept. Eng Appl Artif Intell 72:111–123CrossRef
26.
go back to reference Fournier-Viger P, Lin JC-W, Gueniche T, Barhate P (2015) Efficient incremental high utility itemset mining. Proc ASE Big Data Soc Inform 2015:1–6 Fournier-Viger P, Lin JC-W, Gueniche T, Barhate P (2015) Efficient incremental high utility itemset mining. Proc ASE Big Data Soc Inform 2015:1–6
27.
go back to reference Yun U, Ryang H, Lee G, Fujita H (2017) An efficient algorithm for mining high utility patterns from incremental databases with one database scan. Knowl-Based Syst 124:188–206CrossRef Yun U, Ryang H, Lee G, Fujita H (2017) An efficient algorithm for mining high utility patterns from incremental databases with one database scan. Knowl-Based Syst 124:188–206CrossRef
28.
go back to reference Yun U, Nam H, Lee G, Yoon E (2019) Efficient approach for incremental high utility pattern mining with indexed list structure. Futur Gener Comput Syst 95:221–239CrossRef Yun U, Nam H, Lee G, Yoon E (2019) Efficient approach for incremental high utility pattern mining with indexed list structure. Futur Gener Comput Syst 95:221–239CrossRef
29.
go back to reference Kim H, Yun U, Baek Y, Kim H, Nam H, Lin JC-W, Fournier-Viger P (2021) Damped sliding based utility oriented pattern mining over stream data. Knowl-Based Syst 213:106653CrossRef Kim H, Yun U, Baek Y, Kim H, Nam H, Lin JC-W, Fournier-Viger P (2021) Damped sliding based utility oriented pattern mining over stream data. Knowl-Based Syst 213:106653CrossRef
30.
go back to reference Lee C, Ryu T, Kim H, Kim H, Vo B, Lin JC-W, Yun U (2022) Efficient approach of sliding window-based high average-utility pattern mining with list structures. Knowl Based Syst 256:109702CrossRef Lee C, Ryu T, Kim H, Kim H, Vo B, Lin JC-W, Yun U (2022) Efficient approach of sliding window-based high average-utility pattern mining with list structures. Knowl Based Syst 256:109702CrossRef
31.
go back to reference Lucchese C, Orlando S, Perego R (2005) Fast and memory efficient mining of frequent closed itemsets. IEEE Trans Knowl Data Eng 18(1):21–36CrossRef Lucchese C, Orlando S, Perego R (2005) Fast and memory efficient mining of frequent closed itemsets. IEEE Trans Knowl Data Eng 18(1):21–36CrossRef
32.
go back to reference Sahoo J, Das AK, Goswami A (2016) An efficient fast algorithm for discovering closed+ high utility itemsets. Appl Intell 45(1):44–74CrossRef Sahoo J, Das AK, Goswami A (2016) An efficient fast algorithm for discovering closed+ high utility itemsets. Appl Intell 45(1):44–74CrossRef
33.
go back to reference Lin JC-W, Djenouri Y, Srivastava G, Yun U, Fournier-Viger P (2021) A predictive GA-based model for closed high-utility itemset mining. Appl Soft Comput 108:107422CrossRef Lin JC-W, Djenouri Y, Srivastava G, Yun U, Fournier-Viger P (2021) A predictive GA-based model for closed high-utility itemset mining. Appl Soft Comput 108:107422CrossRef
34.
go back to reference Pramanik S, Goswami A (2022) Discovery of closed high utility itemsets using a fast nature-inspired ant colony algorithm. Appl Intell 52(8):8839–8855CrossRef Pramanik S, Goswami A (2022) Discovery of closed high utility itemsets using a fast nature-inspired ant colony algorithm. Appl Intell 52(8):8839–8855CrossRef
35.
go back to reference Lin JC-W, Djenouri Y, Srivastava G, Fourier-Viger P (2022) Efficient evolutionary computation model of closed high-utility itemset mining. Appl Intell 14:1–13 Lin JC-W, Djenouri Y, Srivastava G, Fourier-Viger P (2022) Efficient evolutionary computation model of closed high-utility itemset mining. Appl Intell 14:1–13
36.
go back to reference Hidouri A, Jabbour S, Raddaoui B, Yaghlane BB (2021) Mining closed high utility itemsets based on propositional satisfiability. Data Knowl Eng 136:101927CrossRef Hidouri A, Jabbour S, Raddaoui B, Yaghlane BB (2021) Mining closed high utility itemsets based on propositional satisfiability. Data Knowl Eng 136:101927CrossRef
Metadata
Title
FCHM-stream: fast closed high utility itemsets mining over data streams
Authors
Muhang Li
Meng Han
Zhiqiang Chen
Hongxin Wu
Xilong Zhang
Publication date
03-02-2023
Publisher
Springer London
Published in
Knowledge and Information Systems / Issue 6/2023
Print ISSN: 0219-1377
Electronic ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-023-01831-8

Other articles of this Issue 6/2023

Knowledge and Information Systems 6/2023 Go to the issue

Premium Partner