Skip to main content
Erschienen in: Arabian Journal for Science and Engineering 8/2022

16.11.2021 | Research Article-Computer Engineering and Computer Science

SR-Mine: Adaptive Transaction Compression Method for Frequent Itemsets Mining

verfasst von: O. Jamsheela, G. Raju

Erschienen in: Arabian Journal for Science and Engineering | Ausgabe 8/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Extraction of frequent itemsets is a key step in association rule mining. Frequent Pattern (FP) mining from a very large dataset is still a challenging research problem. The basic frequent itemset algorithms are Apriori and FP-growth. FP-growth uses Frequent Pattern Tree (FP-tree) to store the database information in a compressed form. A large number of research papers have been proposed as an improvement of the basic frequent itemset mining algorithms. Several researchers have proposed modifications to existing data structures as well as new data structures to improve the mining process. A new method, Size Reduced Mining (SR-Mine), is proposed to speed up the FP-tree creation. The proposed work is implemented with the basic FP-growth algorithm and with the other two recent algorithms based on FP-tree. The three modified algorithms have been tested with standard datasets and compared with the original algorithms. The proposed method can be applied with the frequent itemset mining algorithms which consider each transaction one by one to construct a data structure for mining. The experimental results show that the proposed method can improve the performance of the mining.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Agrawal, R.; Imieliński, T.; Swami, A.: Mining association rules between sets of items in large databases. ACM SIGMOD Rec. 22(2), 207–216 (1993)CrossRef Agrawal, R.; Imieliński, T.; Swami, A.: Mining association rules between sets of items in large databases. ACM SIGMOD Rec. 22(2), 207–216 (1993)CrossRef
2.
Zurück zum Zitat Hu, J.; Yang-Li, X.: A new fast frequent itemsets mining algorithm based on forest. In: Fifth International Conference on Fuzzy Systems and Knowledge Discovery, 2008. FSKD’08, Vol. 2. IEEE (2008) Hu, J.; Yang-Li, X.: A new fast frequent itemsets mining algorithm based on forest. In: Fifth International Conference on Fuzzy Systems and Knowledge Discovery, 2008. FSKD’08, Vol. 2. IEEE (2008)
3.
Zurück zum Zitat Han, J.; Pei, J.; Yin, Y.: Mining frequent patterns without Candidate Generation. In: SIGMOD ’00 Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pp. 1–12 Han, J.; Pei, J.; Yin, Y.: Mining frequent patterns without Candidate Generation. In: SIGMOD ’00 Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pp. 1–12
4.
Zurück zum Zitat Narvekar, M.; Shafaque, F.S.: An optimized algorithm for association rule mining using FP tree. Procedia Comput. Sci. 45, 101–110 (2015) Narvekar, M.; Shafaque, F.S.: An optimized algorithm for association rule mining using FP tree. Procedia Comput. Sci. 45, 101–110 (2015)
5.
Zurück zum Zitat Tsay, Y.-J.; Hsu, T.-J.; Jing-Rung, Yu.: FIUT: a new method for mining frequent itemsets. Inf. Sci. 179(11), 1724–1737 (2009)CrossRef Tsay, Y.-J.; Hsu, T.-J.; Jing-Rung, Yu.: FIUT: a new method for mining frequent itemsets. Inf. Sci. 179(11), 1724–1737 (2009)CrossRef
6.
Zurück zum Zitat Lin, K.-C.; Liao, I.-E.; Chen, Z.-S.: An improved frequent pattern growth method for mining association rules. Expert Syst. Appl. 38(5), 5154–5161 (2011)CrossRef Lin, K.-C.; Liao, I.-E.; Chen, Z.-S.: An improved frequent pattern growth method for mining association rules. Expert Syst. Appl. 38(5), 5154–5161 (2011)CrossRef
7.
Zurück zum Zitat Deng, Z.-H.; Lv, S.-L.: PrePost+: an efficient N-lists-based algorithm for mining frequent itemsets via Children–Parent Equivalence pruning. Expert Syst. Appl. 42(13), 5424–5432 (2015)CrossRef Deng, Z.-H.; Lv, S.-L.: PrePost+: an efficient N-lists-based algorithm for mining frequent itemsets via Children–Parent Equivalence pruning. Expert Syst. Appl. 42(13), 5424–5432 (2015)CrossRef
8.
Zurück zum Zitat Agarwal, V.; Kushal, M.; Kumar, D.: An improvised frequent pattern tree Based association rule mining Technique with mining frequent itemsets algorithm and a modified header Table. arXiv preprint arXiv:1504.07018 (2015) Agarwal, V.; Kushal, M.; Kumar, D.: An improvised frequent pattern tree Based association rule mining Technique with mining frequent itemsets algorithm and a modified header Table. arXiv preprint arXiv:​1504.​07018 (2015)
9.
Zurück zum Zitat Grahne, G.; Zhu, J.: Fast algorithms for frequent itemset mining using FP-trees. IEEE Trans. Knowl. Data Eng. 17(10), 1347–1362 (2005)CrossRef Grahne, G.; Zhu, J.: Fast algorithms for frequent itemset mining using FP-trees. IEEE Trans. Knowl. Data Eng. 17(10), 1347–1362 (2005)CrossRef
10.
Zurück zum Zitat Pyun, G.; Yun, U.; Ryu, K.H.: Efficient frequent pattern mining based on linear prefix tree. Knowl.-Based Syst. 55, 125–139 (2014)CrossRef Pyun, G.; Yun, U.; Ryu, K.H.: Efficient frequent pattern mining based on linear prefix tree. Knowl.-Based Syst. 55, 125–139 (2014)CrossRef
11.
Zurück zum Zitat Rácz, B.: nonordfp: an FP-growth variation without rebuilding the FP-tree. In: FIMI (2004) Rácz, B.: nonordfp: an FP-growth variation without rebuilding the FP-tree. In: FIMI (2004)
12.
Zurück zum Zitat Deng, Z.H.; Wang, Z.H.; Jiang, J.J.: A new algorithm for fast mining frequent itemsets using N-lists. Sci. China Inf. Sci. 55(9), 2008–2030 (2012)MathSciNetCrossRef Deng, Z.H.; Wang, Z.H.; Jiang, J.J.: A new algorithm for fast mining frequent itemsets using N-lists. Sci. China Inf. Sci. 55(9), 2008–2030 (2012)MathSciNetCrossRef
13.
Zurück zum Zitat Deng, Z.-H.; Lv, S.-L.: Fast mining frequent itemsets using Nodesets. Expert Syst. Appl. 41(10), 4505–4512 (2014)CrossRef Deng, Z.-H.; Lv, S.-L.: Fast mining frequent itemsets using Nodesets. Expert Syst. Appl. 41(10), 4505–4512 (2014)CrossRef
14.
Zurück zum Zitat Borgelt, C.: Simple Algorithms for Frequent Item Set Mining. Advances in Machine Learning II, pp. 351–369. Springer, Berlin (2010) Borgelt, C.: Simple Algorithms for Frequent Item Set Mining. Advances in Machine Learning II, pp. 351–369. Springer, Berlin (2010)
15.
Zurück zum Zitat Tseng, F.-C.: An adaptive approach to mining frequent itemsets efficiently. Expert Syst. Appl. 39(18), 13166–13172 (2012)CrossRef Tseng, F.-C.: An adaptive approach to mining frequent itemsets efficiently. Expert Syst. Appl. 39(18), 13166–13172 (2012)CrossRef
16.
Zurück zum Zitat Ashrafi, M.Z.; Taniar, D.; Smith, K.: A compress–based association mining algorithm for large dataset. In: International Conference on Computational Science. Springer, Berlin (2003) Ashrafi, M.Z.; Taniar, D.; Smith, K.: A compress–based association mining algorithm for large dataset. In: International Conference on Computational Science. Springer, Berlin (2003)
17.
Zurück zum Zitat Hung, M.-C.; et al.: Efficient mining of association rules using merged transactions approach. WSEAS Trans. Comput. 5(5), 916–923 (2006) Hung, M.-C.; et al.: Efficient mining of association rules using merged transactions approach. WSEAS Trans. Comput. 5(5), 916–923 (2006)
18.
Zurück zum Zitat Dai, J.-Y.; et al.: An efficient data mining approach on compressed transactions. Int. J. Electr. Comput. Eng. 3(2), 76–83 (2008) Dai, J.-Y.; et al.: An efficient data mining approach on compressed transactions. Int. J. Electr. Comput. Eng. 3(2), 76–83 (2008)
19.
Zurück zum Zitat Nair, B.; Tripathy, A.K.: Accelerating closed frequent itemset mining by elimination of null transactions. J. Emerg. Trends Comput. Inf. Sci. 2(7), 317–324 (2011) Nair, B.; Tripathy, A.K.: Accelerating closed frequent itemset mining by elimination of null transactions. J. Emerg. Trends Comput. Inf. Sci. 2(7), 317–324 (2011)
20.
Zurück zum Zitat Fournier-Viger, P.; Gomariz, A.; Gueniche, T.; Soltani, A.; Wu, C.; Tseng, V.S.: SPMF: a java open-source pattern mining library. J. Mach. Learn. Res. (JMLR) 15, 3389–3393 (2014)MATH Fournier-Viger, P.; Gomariz, A.; Gueniche, T.; Soltani, A.; Wu, C.; Tseng, V.S.: SPMF: a java open-source pattern mining library. J. Mach. Learn. Res. (JMLR) 15, 3389–3393 (2014)MATH
Metadaten
Titel
SR-Mine: Adaptive Transaction Compression Method for Frequent Itemsets Mining
verfasst von
O. Jamsheela
G. Raju
Publikationsdatum
16.11.2021
Verlag
Springer Berlin Heidelberg
Erschienen in
Arabian Journal for Science and Engineering / Ausgabe 8/2022
Print ISSN: 2193-567X
Elektronische ISSN: 2191-4281
DOI
https://doi.org/10.1007/s13369-021-06298-9

Weitere Artikel der Ausgabe 8/2022

Arabian Journal for Science and Engineering 8/2022 Zur Ausgabe

Research Article-Computer Engineering and Computer Science

Transformer-Based Word Embedding With CNN Model to Detect Sarcasm and Irony

Research Article-Computer Engineering and Computer Science

A Heuristic Local-sensitive Program-Wide Diffing Method for IoT Binary Files

Research Article-Computer Engineering and Computer Science

A Multi-level Correlation-Based Feature Selection for Intrusion Detection

Research Article-Computer Engineering and Computer Science

Application of Mathematical Modeling in Prediction of COVID-19 Transmission Dynamics

Research Article-Computer Engineering and Computer Science

Anisotropic Diffusion Filter Based on Spiking Neural Network Model

    Marktübersichten

    Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.