Skip to main content
Erschienen in: Computing 3/2016

01.03.2016

A fast and distributed algorithm for mining frequent patterns in congested networks

verfasst von: Kawuu W. Lin, Sheng-Hao Chung, Chun-Cheng Lin

Erschienen in: Computing | Ausgabe 3/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

With advances in technology, frequent pattern mining has been used widely in our daily lives. By using this technology, one can obtain interesting or useful information that would help one make decisions and apply judgment. For example, marketplace managers mine transaction data to obtain information that can help improve services, understand customer buying habits, determine a suitable scheme for placement of goods to increase profits, or for medical and biotechnology applications. However, the rate at which data is generated is very rapid, leading to problems caused by Big Data. Therefore, many researchers have studied distributed, parallel and cloud computing technology to select the best among them. However, data mining uses multiple computing nodes, which requires the transmission of a considerable amount of data in a network environment. The available network bandwidth is limited when many different tasks are being transmitted at the same time and many servers are working in the same network segment. This results in poor transmission, causing severe transfer delay, either internal or external to the network. Thus, we propose the fast and distributed mining algorithm for discovering frequent patterns in congested networks (FDMCN) algorithm, which is based on CARM. The main purpose is to reduce FP-tree transmission such that only a portion of the information is required for mining using computing nodes. The results of empirical evaluation under various simulation conditions show that the proposed method FDMCN delivers excellent performance in terms of execution efficiency and scalability when compared with the PSWS algorithm.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Adnan M, Alhajj R (2009) DRFP-tree: disk-resident frequent pattern tree. Appl Intell 30(2):84–97CrossRef Adnan M, Alhajj R (2009) DRFP-tree: disk-resident frequent pattern tree. Appl Intell 30(2):84–97CrossRef
3.
Zurück zum Zitat Agrawal R, Srikant R (1994) Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th international conference on very large data bases, VLDB, Santiago, pp 487–499 Agrawal R, Srikant R (1994) Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th international conference on very large data bases, VLDB, Santiago, pp 487–499
4.
Zurück zum Zitat Agrawal R, Shafer JC (1996) Parallel mining of association rules. IEEE Trans Knowl Data Eng 8(6):962–969CrossRef Agrawal R, Shafer JC (1996) Parallel mining of association rules. IEEE Trans Knowl Data Eng 8(6):962–969CrossRef
5.
Zurück zum Zitat Baralis E, Cerquitelli T, Chiusano S, Grand A (2013) P-mine: parallel itemset mining on large datasets. ICDE Baralis E, Cerquitelli T, Chiusano S, Grand A (2013) P-mine: parallel itemset mining on large datasets. ICDE
6.
Zurück zum Zitat Ezeife CI, Zhang D (2009) TidFP: mining frequent patterns in different databases with transaction ID. Data Warehousing Knowl Discov, Lecture Notes Comput Sci 5691:125–137 Ezeife CI, Zhang D (2009) TidFP: mining frequent patterns in different databases with transaction ID. Data Warehousing Knowl Discov, Lecture Notes Comput Sci 5691:125–137
7.
Zurück zum Zitat Grahne G, Zhu J (2003) Efficiently using prefix-trees in mining frequent itemsets. In: Proceedings of the IEEE ICDM workshop on frequent itemset mining implementations Grahne G, Zhu J (2003) Efficiently using prefix-trees in mining frequent itemsets. In: Proceedings of the IEEE ICDM workshop on frequent itemset mining implementations
8.
Zurück zum Zitat Grahne G, Zhu J (2004) Mining frequent itemsets from secondary memory. International conference on data mining, pp 91–98 Grahne G, Zhu J (2004) Mining frequent itemsets from secondary memory. International conference on data mining, pp 91–98
9.
Zurück zum Zitat Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. In: Proceedings of the ACM SIGMOD international conference on management of data, pp 1–12 Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. In: Proceedings of the ACM SIGMOD international conference on management of data, pp 1–12
10.
Zurück zum Zitat Han J, Pei J, Yin Y, Mao R (2004) Mining frequent patterns without candidate generation: a frequent-pattern tree approach. J Data Min Knowl Discov 8(1):53–87CrossRefMathSciNet Han J, Pei J, Yin Y, Mao R (2004) Mining frequent patterns without candidate generation: a frequent-pattern tree approach. J Data Min Knowl Discov 8(1):53–87CrossRefMathSciNet
11.
Zurück zum Zitat Javed A, Khokhar A (2004) Frequent pattern mining on message passing multiprocessor systems. Distrib Parallel Databases 16:321–334CrossRef Javed A, Khokhar A (2004) Frequent pattern mining on message passing multiprocessor systems. Distrib Parallel Databases 16:321–334CrossRef
12.
Zurück zum Zitat Schlegel B, Gemulla R, Lehner W (2011) Memory-efficient frequent-itemset mining. In: EDBT/ICDT11 proceedings of the 14th international conference on extending database technology, pp 461–472 Schlegel B, Gemulla R, Lehner W (2011) Memory-efficient frequent-itemset mining. In: EDBT/ICDT11 proceedings of the 14th international conference on extending database technology, pp 461–472
13.
Zurück zum Zitat Lai Y, Zhongzhi S (2010) An efficient data mining framework on Hadoop using java persistence API. International conference on computer and information technology, pp 203–209 Lai Y, Zhongzhi S (2010) An efficient data mining framework on Hadoop using java persistence API. International conference on computer and information technology, pp 203–209
14.
Zurück zum Zitat Lai Y, Zhongzhi S, Xu LD, Fan L, Kirsh I (2011) DH-TRIE frequent pattern mining on hadoop using JPA. International conference on granular computing, pp 875–878 Lai Y, Zhongzhi S, Xu LD, Fan L, Kirsh I (2011) DH-TRIE frequent pattern mining on hadoop using JPA. International conference on granular computing, pp 875–878
15.
Zurück zum Zitat Lin KW, Luo YC (2009) A fast parallel algorithm for discovering frequent patterns. GRC ’09. IEEE international conference on granular computing, pp 398–403 Lin KW, Luo YC (2009) A fast parallel algorithm for discovering frequent patterns. GRC ’09. IEEE international conference on granular computing, pp 398–403
16.
Zurück zum Zitat Lin KW, Lo YC (2013) Efficient algorithms for frequent pattern mining in many-task computing environments. Knowl Based Syst 49 Lin KW, Lo YC (2013) Efficient algorithms for frequent pattern mining in many-task computing environments. Knowl Based Syst 49
17.
Zurück zum Zitat Qiu Y, Lan YJ, Xie QS (2004) An improved algorithm of mining from FP- tree. In: Proceedings of the third international conference on machine learning and cybernetics, pp 26–29 Qiu Y, Lan YJ, Xie QS (2004) An improved algorithm of mining from FP- tree. In: Proceedings of the third international conference on machine learning and cybernetics, pp 26–29
18.
Zurück zum Zitat Vu L, Alaghband G (2013) Novel parallel method for mining frequent patterns on multi-core shared memory systems. In: DISCS-2013 proceedings of the 2013 international workshop on data-intensive scalable computing systems, pp 49–54 Vu L, Alaghband G (2013) Novel parallel method for mining frequent patterns on multi-core shared memory systems. In: DISCS-2013 proceedings of the 2013 international workshop on data-intensive scalable computing systems, pp 49–54
19.
Zurück zum Zitat Wu X, Zhu X, Gong-Qing W, Ding W (2014) Data mining with big data, TKDE Wu X, Zhu X, Gong-Qing W, Ding W (2014) Data mining with big data, TKDE
20.
Zurück zum Zitat Yang XY, Liu Z, Fu Y (2010) MapReduce as a programming model for association rules algorithm on Hadoop. International conference on information sciences and interaction sciences, pp 99–102 Yang XY, Liu Z, Fu Y (2010) MapReduce as a programming model for association rules algorithm on Hadoop. International conference on information sciences and interaction sciences, pp 99–102
21.
Zurück zum Zitat Yen SJ, Lee YS, Wang CK, Wu JW, Ouyang LY (2009) The studies of mining frequent patterns based on frequent pattern tree. Adv Knowl Discov Data Min, Lecture Notes Comput Sci 5476:232–241 Yen SJ, Lee YS, Wang CK, Wu JW, Ouyang LY (2009) The studies of mining frequent patterns based on frequent pattern tree. Adv Knowl Discov Data Min, Lecture Notes Comput Sci 5476:232–241
22.
Zurück zum Zitat Zaki MJ (2000) Scalable algorithms for association mining. IEEE Trans Knowl Data Eng 12(3) Zaki MJ (2000) Scalable algorithms for association mining. IEEE Trans Knowl Data Eng 12(3)
23.
Zurück zum Zitat Zhou J, Yu KM (2008) Tidset-based Parallel FP-tree algorithm for the frequent pattern mining problem on PC clusters. Adv Grid Pervas Comput, Lecture Notes Comput Sci 5036:18–28CrossRef Zhou J, Yu KM (2008) Tidset-based Parallel FP-tree algorithm for the frequent pattern mining problem on PC clusters. Adv Grid Pervas Comput, Lecture Notes Comput Sci 5036:18–28CrossRef
24.
Zurück zum Zitat Zhou J, Yu KM (2008) Balanced tidset-based parallel FP-tree algorithm for the frequent pattern mining on grid system. Fourth international conference on semantics, knowledge and grid, pp 103–108 Zhou J, Yu KM (2008) Balanced tidset-based parallel FP-tree algorithm for the frequent pattern mining on grid system. Fourth international conference on semantics, knowledge and grid, pp 103–108
Metadaten
Titel
A fast and distributed algorithm for mining frequent patterns in congested networks
verfasst von
Kawuu W. Lin
Sheng-Hao Chung
Chun-Cheng Lin
Publikationsdatum
01.03.2016
Verlag
Springer Vienna
Erschienen in
Computing / Ausgabe 3/2016
Print ISSN: 0010-485X
Elektronische ISSN: 1436-5057
DOI
https://doi.org/10.1007/s00607-015-0457-6

Weitere Artikel der Ausgabe 3/2016

Computing 3/2016 Zur Ausgabe