Skip to main content
Erschienen in: The Journal of Supercomputing 6/2022

09.01.2022

OHUQI: Mining on-shelf high-utility quantitative itemsets

verfasst von: Lili Chen, Wensheng Gan, Qi Lin, Shuqiang Huang, Chien-Ming Chen

Erschienen in: The Journal of Supercomputing | Ausgabe 6/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Mobile edge computing has brought fresh opportunities and challenges to data science. Utility-driven mining, a recently emerging branch of utility-based data science, has been widely applied because it considers both the utility factor and the quantity characteristic with ranges of patterns. However, most existing utility-mining algorithms assume that patterns always appear regardless of the period. For instance, some products may sell well at certain times of the year. Considering the rich information in the database, such as quantity and time, we propose an effective and efficient approach, namely OHUQI, for discovering on-shelf high-utility quantitative itemsets. To avoid scanning the database multiple times, we adopt a data structure to maintain some necessary information, and thus OHUQI only accesses the database twice. Several pruning strategies are also designed to prune a large number of unpromising itemsets in advance to shrink the search space. Finally, the subsequent experimental results show that OHUQI performs well on several real-world datasets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Aggarwal CC (2015) Data mining: the textbook. Springer, BerlinMATH Aggarwal CC (2015) Data mining: the textbook. Springer, BerlinMATH
2.
Zurück zum Zitat Agrawal R, Imieliński T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp 207–216 Agrawal R, Imieliński T, Swami A (1993) Mining association rules between sets of items in large databases. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp 207–216
3.
Zurück zum Zitat Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proceedings of the International Conference Very Large Data Bases, vol 1215, pp 487–499 Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proceedings of the International Conference Very Large Data Bases, vol 1215, pp 487–499
4.
Zurück zum Zitat Agrawal R, Srikant R (2000) Privacy-preserving data mining. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp 439–450 Agrawal R, Srikant R (2000) Privacy-preserving data mining. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp 439–450
5.
Zurück zum Zitat Chan R, Yang Q, Shen YD (2003) Mining high utility itemsets. In: IEEE International Conference on Data Mining. IEEE Computer Society, p 19 Chan R, Yang Q, Shen YD (2003) Mining high utility itemsets. In: IEEE International Conference on Data Mining. IEEE Computer Society, p 19
6.
Zurück zum Zitat Chang CY, Chen MS, Lee CH (2002) Mining general temporal association rules for items with different exhibition periods. In: Proceedings of IEEE International Conference on Data Mining. IEEE, pp 59–66 Chang CY, Chen MS, Lee CH (2002) Mining general temporal association rules for items with different exhibition periods. In: Proceedings of IEEE International Conference on Data Mining. IEEE, pp 59–66
7.
Zurück zum Zitat Chen CM, Chen L, Gan W, Qiu L, Ding W (2021) Discovering high utility-occupancy patterns from uncertain data. Inf Sci 546:1208–1229MathSciNetCrossRef Chen CM, Chen L, Gan W, Qiu L, Ding W (2021) Discovering high utility-occupancy patterns from uncertain data. Inf Sci 546:1208–1229MathSciNetCrossRef
8.
Zurück zum Zitat Chen J, Guo X, Gan W, Chen CM, Ding W, Chen G (2020) OSUMI: on-shelf utility mining from itemset-based data. In: IEEE International Conference on Big Data. IEEE , pp 5340–5349 Chen J, Guo X, Gan W, Chen CM, Ding W, Chen G (2020) OSUMI: on-shelf utility mining from itemset-based data. In: IEEE International Conference on Big Data. IEEE , pp 5340–5349
9.
Zurück zum Zitat Chen M, Han J, Yu PS (1996) Data mining: an overview from a database perspective. IEEE Trans Knowl Data Eng 8(6):866–883CrossRef Chen M, Han J, Yu PS (1996) Data mining: an overview from a database perspective. IEEE Trans Knowl Data Eng 8(6):866–883CrossRef
11.
Zurück zum Zitat Dam TL, Li K, Fournier-Viger P, Duong QH (2017) An efficient algorithm for mining top-\(k\) on-shelf high utility itemsets. Knowl Inf Syst 52(3):621–655CrossRef Dam TL, Li K, Fournier-Viger P, Duong QH (2017) An efficient algorithm for mining top-\(k\) on-shelf high utility itemsets. Knowl Inf Syst 52(3):621–655CrossRef
12.
Zurück zum Zitat Fournier-Viger P, Lin JCW, Duong QH, Dam TL (2016) PHM: mining periodic high-utility itemsets. In: Industrial Conference on Data Mining. Springer, pp 64–79 Fournier-Viger P, Lin JCW, Duong QH, Dam TL (2016) PHM: mining periodic high-utility itemsets. In: Industrial Conference on Data Mining. Springer, pp 64–79
13.
Zurück zum Zitat Fournier-Viger P, Wu CW, Zida S, Tseng VS (2014) FHM: faster high-utility itemset mining using estimated utility co-occurrence pruning. In: International Symposium on Methodologies for Intelligent Systems. Springer, pp 83–92 Fournier-Viger P, Wu CW, Zida S, Tseng VS (2014) FHM: faster high-utility itemset mining using estimated utility co-occurrence pruning. In: International Symposium on Methodologies for Intelligent Systems. Springer, pp 83–92
14.
Zurück zum Zitat Fournier-Viger P, Zida S (2015) FOSHU: faster on-shelf high utility itemset mining–with or without negative unit profit. In: Proceedings of the 30th Annual ACM Symposium on Applied Computing, pp 857–864 Fournier-Viger P, Zida S (2015) FOSHU: faster on-shelf high utility itemset mining–with or without negative unit profit. In: Proceedings of the 30th Annual ACM Symposium on Applied Computing, pp 857–864
15.
Zurück zum Zitat Gan W, Lin CW, Fournier-Viger P, Chao HC, Tseng V, Yu PS (2021) A survey of utility-oriented pattern mining. IEEE Trans Knowl Data Eng 33(4):1306–1327CrossRef Gan W, Lin CW, Fournier-Viger P, Chao HC, Tseng V, Yu PS (2021) A survey of utility-oriented pattern mining. IEEE Trans Knowl Data Eng 33(4):1306–1327CrossRef
16.
Zurück zum Zitat Gan W, Lin JCW, Chao HC, Vasilakos AV, Yu PS (2020) Utility-driven data analytics on uncertain data. IEEE Syst J 14(3):4442–4453CrossRef Gan W, Lin JCW, Chao HC, Vasilakos AV, Yu PS (2020) Utility-driven data analytics on uncertain data. IEEE Syst J 14(3):4442–4453CrossRef
17.
Zurück zum Zitat Gan W, Lin JCW, Chao HC, Wang SL, Yu PS (2018) Privacy preserving utility mining: a survey. In: Proceedings of the IEEE International Conference on Big Data. IEEE, pp 2617–2626 Gan W, Lin JCW, Chao HC, Wang SL, Yu PS (2018) Privacy preserving utility mining: a survey. In: Proceedings of the IEEE International Conference on Big Data. IEEE, pp 2617–2626
18.
Zurück zum Zitat Gan W, Lin JCW, Chao HC, Zhan J (2017) Data mining in distributed environment: a survey. Wiley Interdiscip Rev Data Min Knowl Discov 7(6):e1216 Gan W, Lin JCW, Chao HC, Zhan J (2017) Data mining in distributed environment: a survey. Wiley Interdiscip Rev Data Min Knowl Discov 7(6):e1216
19.
Zurück zum Zitat Gan W, Lin JCW, Fournier-Viger P, Chao HC, Hong TP, Fujita H (2018) A survey of incremental high-utility itemset mining. Wiley Interdiscip Rev Data Min Knowl Discov 8(2):e1242CrossRef Gan W, Lin JCW, Fournier-Viger P, Chao HC, Hong TP, Fujita H (2018) A survey of incremental high-utility itemset mining. Wiley Interdiscip Rev Data Min Knowl Discov 8(2):e1242CrossRef
20.
Zurück zum Zitat Gan W, Lin JCW, Fournier-Viger P, Chao HC, Yu PS (2020) HUOPM: high-utility occupancy pattern mining. IEEE Trans Cybern 50(3):1195–1208CrossRef Gan W, Lin JCW, Fournier-Viger P, Chao HC, Yu PS (2020) HUOPM: high-utility occupancy pattern mining. IEEE Trans Cybern 50(3):1195–1208CrossRef
21.
Zurück zum Zitat Gan W, Lin JCW, Fournier-Viger P, Chao HC, Yu PS (2021) Beyond frequency: utility mining with varied item-specific minimum utility. ACM Trans Intern Technol 21(1):1–32CrossRef Gan W, Lin JCW, Fournier-Viger P, Chao HC, Yu PS (2021) Beyond frequency: utility mining with varied item-specific minimum utility. ACM Trans Intern Technol 21(1):1–32CrossRef
22.
Zurück zum Zitat Gan W, Wan S, Chen J, Chen CM, Qiu L (2020) TopHUI: top-\(k\) high-utility itemset mining with negative utility. In: IEEE International Conference on Big Data. IEEE, pp 5350–5359 Gan W, Wan S, Chen J, Chen CM, Qiu L (2020) TopHUI: top-\(k\) high-utility itemset mining with negative utility. In: IEEE International Conference on Big Data. IEEE, pp 5350–5359
23.
Zurück zum Zitat Gupta D, Rani S, Ahmed SH, Verma S, Ijaz MF, Shafi J (2021) Edge caching based on collaborative filtering for heterogeneous ICN-IoT applications. Sensors 21(16):5491CrossRef Gupta D, Rani S, Ahmed SH, Verma S, Ijaz MF, Shafi J (2021) Edge caching based on collaborative filtering for heterogeneous ICN-IoT applications. Sensors 21(16):5491CrossRef
24.
Zurück zum Zitat Hackman A, Huang Y, Tseng VS (2018) Mining trending high utility itemsets from temporal transaction databases. In: International Conference on Database and Expert Systems Applications. Springer, pp 461–470 Hackman A, Huang Y, Tseng VS (2018) Mining trending high utility itemsets from temporal transaction databases. In: International Conference on Database and Expert Systems Applications. Springer, pp 461–470
25.
Zurück zum Zitat Hackman A, Huang Y, Yu PS, Tseng VS (2019) Mining emerging high utility itemsets over streaming database. In: International Conference on Advanced Data Mining and Applications. Springer, pp 3–16 Hackman A, Huang Y, Yu PS, Tseng VS (2019) Mining emerging high utility itemsets over streaming database. In: International Conference on Advanced Data Mining and Applications. Springer, pp 3–16
26.
Zurück zum Zitat Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. ACM SIGMOD Rec 29(2):1–12CrossRef Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. ACM SIGMOD Rec 29(2):1–12CrossRef
27.
Zurück zum Zitat Hu YC, Patel M, Sabella D, Sprecher N, Young V (2015) Mobile edge computing—a key technology towards 5g. ETSI White Paper 11(11):1–16 Hu YC, Patel M, Sabella D, Sprecher N, Young V (2015) Mobile edge computing—a key technology towards 5g. ETSI White Paper 11(11):1–16
28.
Zurück zum Zitat Lan GC, Hong TP, Huang JP, Tseng VS (2014) On-shelf utility mining with negative item values. Expert Syst Appl 41:3450–3459CrossRef Lan GC, Hong TP, Huang JP, Tseng VS (2014) On-shelf utility mining with negative item values. Expert Syst Appl 41:3450–3459CrossRef
29.
Zurück zum Zitat Lan GC, Hong TP, Tseng VS (2011) Discovery of high utility itemsets from on-shelf time periods of products. Expert Syst Appl 38:5851–5857CrossRef Lan GC, Hong TP, Tseng VS (2011) Discovery of high utility itemsets from on-shelf time periods of products. Expert Syst Appl 38:5851–5857CrossRef
30.
Zurück zum Zitat Laxman S, Sastry PS (2006) A survey of temporal data mining. SADHANA Acad Proc Eng Sci 31(2):173–198MathSciNetMATH Laxman S, Sastry PS (2006) A survey of temporal data mining. SADHANA Acad Proc Eng Sci 31(2):173–198MathSciNetMATH
31.
Zurück zum Zitat Li CH, Wu CW, Huang J, Tseng VS (2019) An efficient algorithm for mining high utility quantitative itemsets. In: Proceedings of International Conference on Data Mining Workshops. IEEE, pp 1005–1012 Li CH, Wu CW, Huang J, Tseng VS (2019) An efficient algorithm for mining high utility quantitative itemsets. In: Proceedings of International Conference on Data Mining Workshops. IEEE, pp 1005–1012
32.
Zurück zum Zitat Li CH, Wu CW, Tseng VS (2014) Efficient vertical mining of high utility quantitative itemsets. In: Proceedings of International Conference on Granular Computing. IEEE, pp 155–160 Li CH, Wu CW, Tseng VS (2014) Efficient vertical mining of high utility quantitative itemsets. In: Proceedings of International Conference on Granular Computing. IEEE, pp 155–160
33.
Zurück zum Zitat Li H, Shou G, Hu Y, Guo Z (2016) Mobile edge computing: progress and challenges. In: Proceedings of the 4th IEEE International Conference on Mobile Cloud Computing, Services, and Engineering. IEEE, pp 83–84 Li H, Shou G, Hu Y, Guo Z (2016) Mobile edge computing: progress and challenges. In: Proceedings of the 4th IEEE International Conference on Mobile Cloud Computing, Services, and Engineering. IEEE, pp 83–84
34.
Zurück zum Zitat Lin B, Huang Y, Zhang J, Hu J, Chen X, Li J (2019) Cost-driven off-loading for DNN-based applications over cloud, edge, and end devices. IEEE Trans Ind Inform 16(8):5456–5466CrossRef Lin B, Huang Y, Zhang J, Hu J, Chen X, Li J (2019) Cost-driven off-loading for DNN-based applications over cloud, edge, and end devices. IEEE Trans Ind Inform 16(8):5456–5466CrossRef
35.
Zurück zum Zitat Lin JCW, Fournier-Viger P, Gan W (2016) FHN: an efficient algorithm for mining high-utility itemsets with negative unit profits. Knowl-Based Syst 111:283–298CrossRef Lin JCW, Fournier-Viger P, Gan W (2016) FHN: an efficient algorithm for mining high-utility itemsets with negative unit profits. Knowl-Based Syst 111:283–298CrossRef
36.
Zurück zum Zitat Lin JCW, Gan W, Fournier-Viger P, Hong TP, Tseng VS (2016) Efficient algorithms for mining high-utility itemsets in uncertain databases. Knowl-Based Syst 96:171–187CrossRef Lin JCW, Gan W, Fournier-Viger P, Hong TP, Tseng VS (2016) Efficient algorithms for mining high-utility itemsets in uncertain databases. Knowl-Based Syst 96:171–187CrossRef
37.
Zurück zum Zitat Lin JCW, Gan W, Hong TP, Pan JS (2014) Incrementally updating high-utility itemsets with transaction insertion. In: International Conference on Advanced Data Mining and Applications. Springer, pp 44–56 Lin JCW, Gan W, Hong TP, Pan JS (2014) Incrementally updating high-utility itemsets with transaction insertion. In: International Conference on Advanced Data Mining and Applications. Springer, pp 44–56
38.
Zurück zum Zitat Liu M, Qu J (2012) Mining high utility itemsets without candidate generation. In: Proceedings of the 21st ACM International Conference on Information and knowledge Management, pp 55–64 Liu M, Qu J (2012) Mining high utility itemsets without candidate generation. In: Proceedings of the 21st ACM International Conference on Information and knowledge Management, pp 55–64
39.
Zurück zum Zitat Liu Y, Liao WK, Choudhary A (2005) A two-phase algorithm for fast discovery of high utility itemsets. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, pp 689–695 Liu Y, Liao WK, Choudhary A (2005) A two-phase algorithm for fast discovery of high utility itemsets. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, pp 689–695
40.
41.
Zurück zum Zitat Nouioua M, Fournier-Viger P, Wu CW, Lin JCW, Gan W (2021) FHUQI-Miner: fast high utility quantitative itemset mining. Appl Intell 51(10):6785–6809CrossRef Nouioua M, Fournier-Viger P, Wu CW, Lin JCW, Gan W (2021) FHUQI-Miner: fast high utility quantitative itemset mining. Appl Intell 51(10):6785–6809CrossRef
42.
Zurück zum Zitat Rani S, Koundal D, Ijaz MF, Elhoseny M, Alghamdi MI et al (2021) An optimized framework for wsn routing in the context of industry 4.0. Sensors 21(19):6474CrossRef Rani S, Koundal D, Ijaz MF, Elhoseny M, Alghamdi MI et al (2021) An optimized framework for wsn routing in the context of industry 4.0. Sensors 21(19):6474CrossRef
43.
Zurück zum Zitat Reddy TY, Kiran RU, Toyoda M, Reddy PK, Kitsuregawa M (2019) Discovering partial periodic high utility itemsets in temporal databases. In: Proceedings of International Conference on Database and Expert Systems Applications. Springer, pp 351–361 Reddy TY, Kiran RU, Toyoda M, Reddy PK, Kitsuregawa M (2019) Discovering partial periodic high utility itemsets in temporal databases. In: Proceedings of International Conference on Database and Expert Systems Applications. Springer, pp 351–361
44.
Zurück zum Zitat Tseng VS, Shie BE, Wu CW, Yu PS (2012) Efficient algorithms for mining high utility itemsets from transactional databases. IEEE Trans Knowl Data Eng 25:1772–1786CrossRef Tseng VS, Shie BE, Wu CW, Yu PS (2012) Efficient algorithms for mining high utility itemsets from transactional databases. IEEE Trans Knowl Data Eng 25:1772–1786CrossRef
45.
Zurück zum Zitat Tseng VS, Wu CW, Shie BE, Yu PS (2010) UP-Growth: an efficient algorithm for high utility itemset mining. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp 253–262 Tseng VS, Wu CW, Shie BE, Yu PS (2010) UP-Growth: an efficient algorithm for high utility itemset mining. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp 253–262
46.
Zurück zum Zitat Wang CM, Chen SH, Huang YF (2009) A fuzzy approach for mining high utility quantitative itemsets. In: Proceedings of International Conference on Fuzzy Systems. IEEE, pp 1909–1913 Wang CM, Chen SH, Huang YF (2009) A fuzzy approach for mining high utility quantitative itemsets. In: Proceedings of International Conference on Fuzzy Systems. IEEE, pp 1909–1913
47.
Zurück zum Zitat Yao H, Hamilton HJ (2006) Mining itemset utilities from transaction databases. Data Knowl Eng 59:603–626CrossRef Yao H, Hamilton HJ (2006) Mining itemset utilities from transaction databases. Data Knowl Eng 59:603–626CrossRef
48.
Zurück zum Zitat Yen SJ, Lee YS (2007) Mining high utility quantitative association rules. In: International Conference on Data Warehousing and Knowledge Discovery. Springer, pp 283–292 Yen SJ, Lee YS (2007) Mining high utility quantitative association rules. In: International Conference on Data Warehousing and Knowledge Discovery. Springer, pp 283–292
49.
Zurück zum Zitat Zhang C, Du Z, Yang Y, Gan W, Yu PS (2021) On-shelf utility mining of sequence data. ACM Trans Knowl Discov Data 16(2):1–31CrossRef Zhang C, Du Z, Yang Y, Gan W, Yu PS (2021) On-shelf utility mining of sequence data. ACM Trans Knowl Discov Data 16(2):1–31CrossRef
Metadaten
Titel
OHUQI: Mining on-shelf high-utility quantitative itemsets
verfasst von
Lili Chen
Wensheng Gan
Qi Lin
Shuqiang Huang
Chien-Ming Chen
Publikationsdatum
09.01.2022
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 6/2022
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-021-04218-0

Weitere Artikel der Ausgabe 6/2022

The Journal of Supercomputing 6/2022 Zur Ausgabe

Premium Partner