Skip to main content
Top
Published in: The Journal of Supercomputing 4/2015

01-04-2015

GPU-based bees swarm optimization for association rules mining

Authors: Youcef Djenouri, Ahcene Bendjoudi, Malika Mehdi, Nadia Nouali-Taboudjemat, Zineb Habbas

Published in: The Journal of Supercomputing | Issue 4/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Association rules mining (ARM) is a well-known combinatorial optimization problem aiming at extracting relevant rules from given large-scale datasets. According to the state of the art, the bio-inspired methods proved their efficiency by generating acceptable solutions in a reasonable time when dealing with small and medium size instances. Unfortunately, to cope with large instances such as the webdocs benchmark, these methods require more and more powerful processors and are time expensive. Nowadays, computing power is no longer a real issue. It can be provided by the power of emerging technologies such as graphics processing units (GPUs) that are massively multi-threaded processors. In this paper, we investigate the use of GPUs to speed up the computation. We propose two GPU-based bees swarm algorithms for association rules mining (single evaluation in GPU, SE-GPU and multiple evaluation in GPU, ME-GPU). SE-GPU aims at evaluating one rule at a time where each thread is associated with one transaction, whereas ME-GPU evaluates multiple rules in parallel on GPU where each thread is associated with several transactions. To validate our approaches, the two algorithms have been executed to solve well-known large ARM instances. Real experiments have been carried out on an Intel Xeon 64 bit quad-core processor E5520 coupled to an Nvidia Tesla C2075 GPU device. The results show that our approaches improve the execution time up to 100\(\times \) over the sequential mono-core bees swarm optimization-ARM algorithm. Moreover, the proposed approaches have been compared with CPU multi-core ones (1–8 cores). The results show that they are faster than the multi-core versions whatever is the number of used cores.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Adil SH, Qamar S (2009) Implementation of association rule mining using CUDA. In: International conference onemerging technologies (ICET 2009). IEEE, New York Adil SH, Qamar S (2009) Implementation of association rule mining using CUDA. In: International conference onemerging technologies (ICET 2009). IEEE, New York
2.
go back to reference Agrawal R, Shafer JC (1996) Parallel mining of association rules. IEEE Trans Knowl Data Eng 8(6):962–969CrossRef Agrawal R, Shafer JC (1996) Parallel mining of association rules. IEEE Trans Knowl Data Eng 8(6):962–969CrossRef
3.
go back to reference Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. ACM SIGMOD Rec 22(2) (ACM) Agrawal R, Imielinski T, Swami A (1993) Mining association rules between sets of items in large databases. ACM SIGMOD Rec 22(2) (ACM)
4.
go back to reference Arabnia HR, Oliver MA (1989) A transputer network for fast operations on digitised images. Int J Eurograph Assoc (Comput Graph Forum) 8(1):3–12CrossRef Arabnia HR, Oliver MA (1989) A transputer network for fast operations on digitised images. Int J Eurograph Assoc (Comput Graph Forum) 8(1):3–12CrossRef
5.
go back to reference Bendjoudi A, Chekini M, Gharbi M, Mehdi M, Benatchba K, Sitayeb-Benbouzid F, Melab N (2013) Parallel B&B algorithm on hybrid multicore/GPU architecture. In: IEEE international conference on high performance computing and communications, vol 15. IEEE, New York Bendjoudi A, Chekini M, Gharbi M, Mehdi M, Benatchba K, Sitayeb-Benbouzid F, Melab N (2013) Parallel B&B algorithm on hybrid multicore/GPU architecture. In: IEEE international conference on high performance computing and communications, vol 15. IEEE, New York
6.
go back to reference Boley D et al (1999) Partitioning-based clustering for web document categorization. Decis Support Syst 27(3):329–341CrossRef Boley D et al (1999) Partitioning-based clustering for web document categorization. Decis Support Syst 27(3):329–341CrossRef
7.
go back to reference Brin S et al (1997) Dynamic itemset counting and implication rules for market basket data. ACM SIGMOD Rec 26(2) (ACM) Brin S et al (1997) Dynamic itemset counting and implication rules for market basket data. ACM SIGMOD Rec 26(2) (ACM)
8.
go back to reference Cano A, Luna JM, Ventura S (2013) High performance evaluation of evolutionary-mined association rules on GPUs. J Supercomput 1–24 Cano A, Luna JM, Ventura S (2013) High performance evaluation of evolutionary-mined association rules on GPUs. J Supercomput 1–24
9.
go back to reference Cecilia JM, García JM, Nisbet A, Amos M, Ujaldón M (2013) Enhancing data parallelism for ant colony optimization on gpus. J Parallel Distrib Comput 73(1):42–51CrossRef Cecilia JM, García JM, Nisbet A, Amos M, Ujaldón M (2013) Enhancing data parallelism for ant colony optimization on gpus. J Parallel Distrib Comput 73(1):42–51CrossRef
10.
go back to reference Cui Q, Xiaobo G (2013) Research on parallel association rules mining on GPU. In: Proceedings of the 2nd international conference on green communications and networks 2012 (GCN 2012), vol 2. Springer, Berlin Cui Q, Xiaobo G (2013) Research on parallel association rules mining on GPU. In: Proceedings of the 2nd international conference on green communications and networks 2012 (GCN 2012), vol 2. Springer, Berlin
11.
go back to reference Djenouri Y, Drias H, Habbas Z (2014) Bees swarm optimisation using multiple strategies for association rule mining. Int J Bio-Inspir Comput 6(4):239–249CrossRef Djenouri Y, Drias H, Habbas Z (2014) Bees swarm optimisation using multiple strategies for association rule mining. Int J Bio-Inspir Comput 6(4):239–249CrossRef
12.
go back to reference Fang W et al (2009) Frequent itemset mining on graphics processors. In: Proceedings of the fifth international workshop on data management on new hardware. ACM, New York Fang W et al (2009) Frequent itemset mining on graphics processors. In: Proceedings of the fifth international workshop on data management on new hardware. ACM, New York
13.
go back to reference Gada FV, Yadav KR, Shah RD (2013) Data mining in medical application for better detection of disease. IJCER 2(2):173–176 Gada FV, Yadav KR, Shah RD (2013) Data mining in medical application for better detection of disease. IJCER 2(2):173–176
14.
go back to reference Garg R, Mishra PK (2011) Exploiting parallelism in association rule mining algorithms. Int J Adv Technol 2.2:222–232 Garg R, Mishra PK (2011) Exploiting parallelism in association rule mining algorithms. Int J Adv Technol 2.2:222–232
15.
go back to reference Goethals B, Zaki MJ (2003) Frequent itemset mining implementations repository. This site contains a wide-variety of algorithms for mining frequent, closed, and maximal itemsets. http://fimi.cs.helsinki.fi Goethals B, Zaki MJ (2003) Frequent itemset mining implementations repository. This site contains a wide-variety of algorithms for mining frequent, closed, and maximal itemsets. http://​fimi.​cs.​helsinki.​fi
17.
go back to reference Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. ACM SIGMOD Rec 29(2) (ACM) Han J, Pei J, Yin Y (2000) Mining frequent patterns without candidate generation. ACM SIGMOD Rec 29(2) (ACM)
18.
go back to reference Han J, Kamber M, Pei J (2006) Data mining: concepts and techniques. Morgan Kaufmann, Burlington Han J, Kamber M, Pei J (2006) Data mining: concepts and techniques. Morgan Kaufmann, Burlington
19.
go back to reference Kacprzyk J, Zadrozny S (2013) Derivation of linguistic summaries is inherently difficult: can association rule mining help? In: Towards advanced data analysis by combining soft computing and statistics, pp 291–303. Springer, Berlin Kacprzyk J, Zadrozny S (2013) Derivation of linguistic summaries is inherently difficult: can association rule mining help? In: Towards advanced data analysis by combining soft computing and statistics, pp 291–303. Springer, Berlin
20.
go back to reference Khademolghorani F, Baraani A, Zamanifar K (2014) Efficient mining of association rules based on gravitational search algorithm. Int J Comput Sci 8 Khademolghorani F, Baraani A, Zamanifar K (2014) Efficient mining of association rules based on gravitational search algorithm. Int J Comput Sci 8
21.
go back to reference Khabzaoui M, Dhaenens C, Talbi E-G (2005) Parallel genetic algorithms for multi-objective rule mining. In: The 6th MIC 2005 Khabzaoui M, Dhaenens C, Talbi E-G (2005) Parallel genetic algorithms for multi-objective rule mining. In: The 6th MIC 2005
22.
go back to reference Kozawa Y, Amagasa T, Kitagawa H (2014) Probabilistic frequent itemset mining on a GPU cluster. IEICE Trans Inf Syst 97(4):779–789CrossRef Kozawa Y, Amagasa T, Kitagawa H (2014) Probabilistic frequent itemset mining on a GPU cluster. IEICE Trans Inf Syst 97(4):779–789CrossRef
23.
go back to reference Kuo RJ, Chao CM, Chiu YT (2011) Application of particle swarm optimization to association rule mining. Appl Soft Comput 11(1):326–336CrossRef Kuo RJ, Chao CM, Chiu YT (2011) Application of particle swarm optimization to association rule mining. Appl Soft Comput 11(1):326–336CrossRef
24.
go back to reference Li H, Zhang N (2010) Mining maximal frequent itemsets on graphics processors. In: 2010 seventh international conference on fuzzy systems and knowledge discovery (FSKD), vol 3. IEEE, New York Li H, Zhang N (2010) Mining maximal frequent itemsets on graphics processors. In: 2010 seventh international conference on fuzzy systems and knowledge discovery (FSKD), vol 3. IEEE, New York
25.
go back to reference Lin KW, Deng DJ (2010) A novel parallel algorithm for frequent pattern mining with privacy preserved in cloud computing environments. Int J Ad Hoc Ubiquitous Comput 6(4):205–215CrossRef Lin KW, Deng DJ (2010) A novel parallel algorithm for frequent pattern mining with privacy preserved in cloud computing environments. Int J Ad Hoc Ubiquitous Comput 6(4):205–215CrossRef
26.
go back to reference Liu K, Hogan WR, Crowley RS (2011) Natural language processing methods and systems for biomedical ontology learning. J Biomed Inform 44(1):163–179CrossRef Liu K, Hogan WR, Crowley RS (2011) Natural language processing methods and systems for biomedical ontology learning. J Biomed Inform 44(1):163–179CrossRef
27.
go back to reference Liu Z, Chen S, Cai J, Zhang E, Lan L, Zheng J, Du J (2013) Traditional Chinese medicine syndrome-related herbal prescriptions in treatment of malignant tumors. J Tradit Chin Med 33(1):19–26CrossRef Liu Z, Chen S, Cai J, Zhang E, Lan L, Zheng J, Du J (2013) Traditional Chinese medicine syndrome-related herbal prescriptions in treatment of malignant tumors. J Tradit Chin Med 33(1):19–26CrossRef
28.
go back to reference Luke S (2011) Essentials of meta-heuristics, pp 11–52 Luke S (2011) Essentials of meta-heuristics, pp 11–52
29.
go back to reference Luper D, Cameron D, Miller JA, Arabnia HR (2007) Spatial and temporal target association through semantic analysis and GPS data mining. In: International conference on information and knowledge engineering (IKE’07), USA, pp 251–257 (ISBN 1-60132-050-7) Luper D, Cameron D, Miller JA, Arabnia HR (2007) Spatial and temporal target association through semantic analysis and GPS data mining. In: International conference on information and knowledge engineering (IKE’07), USA, pp 251–257 (ISBN 1-60132-050-7)
30.
go back to reference Martínez-Ballesteros M, Nepomuceno-Chamorro IA, Riquelme JC (2013) Discovering gene association networks by multi-objective evolutionary quantitative association rules. J Comput Syst Sci Martínez-Ballesteros M, Nepomuceno-Chamorro IA, Riquelme JC (2013) Discovering gene association networks by multi-objective evolutionary quantitative association rules. J Comput Syst Sci
31.
go back to reference Mata J, José-Luis A, Riquelme J-C (2002) An evolutionary algorithm to discover numeric association rules. In: Proceedings of the 2002 ACM symposium on applied computing. ACM, New York Mata J, José-Luis A, Riquelme J-C (2002) An evolutionary algorithm to discover numeric association rules. In: Proceedings of the 2002 ACM symposium on applied computing. ACM, New York
32.
go back to reference Melab N, Talbi E-G (2001) A parallel genetic algorithm for rule mining. In: Proceedings of the 15th international parallel and distributed processing symposium. IEEE Computer Society, London Melab N, Talbi E-G (2001) A parallel genetic algorithm for rule mining. In: Proceedings of the 15th international parallel and distributed processing symposium. IEEE Computer Society, London
33.
go back to reference Melab N, Chakroun I, Bendjoudi A (2013) Graphics processing unit-accelerated bounding for branch-and-bound applied to a permutation problem using data access optimization. Concurr Comput Pract Exp Melab N, Chakroun I, Bendjoudi A (2013) Graphics processing unit-accelerated bounding for branch-and-bound applied to a permutation problem using data access optimization. Concurr Comput Pract Exp
34.
go back to reference Moslehi P et al (2014) Multi-objective numeric association rules mining via ant colony optimization for continuous domains without specifying minimum support and minimum confidence. Int J Comput Sci 8 Moslehi P et al (2014) Multi-objective numeric association rules mining via ant colony optimization for continuous domains without specifying minimum support and minimum confidence. Int J Comput Sci 8
35.
go back to reference Mishra BSP et al (2011) Parallel multi-objective genetic algorithms for associative classification rule mining. In: Proceedings of the 2011 international conference on communication, computing and security. ACM, New York Mishra BSP et al (2011) Parallel multi-objective genetic algorithms for associative classification rule mining. In: Proceedings of the 2011 international conference on communication, computing and security. ACM, New York
37.
go back to reference Orlando S et al (2002) Adaptive and resource-aware mining of frequent sets. In: Proceedings of the 2002 IEEE international conference on data mining, ICDM 2003. IEEE, New York Orlando S et al (2002) Adaptive and resource-aware mining of frequent sets. In: Proceedings of the 2002 IEEE international conference on data mining, ICDM 2003. IEEE, New York
38.
go back to reference Park JS, Chen M-S, Yu PS (1995) An effective hash-based algorithm for mining association rules, vol 24(2). ACM, New York Park JS, Chen M-S, Yu PS (1995) An effective hash-based algorithm for mining association rules, vol 24(2). ACM, New York
39.
go back to reference Park JS, Chen M-S, Yu PS (1995) Efficient parallel data mining for association rules. In: Proceedings of the fourth international conference on information and knowledge management. ACM, New York Park JS, Chen M-S, Yu PS (1995) Efficient parallel data mining for association rules. In: Proceedings of the fourth international conference on information and knowledge management. ACM, New York
40.
go back to reference Parthasarathy S et al (2001) Parallel data mining for association rules on shared-memory systems. Knowl Inf Syst 3.1:1–29 Parthasarathy S et al (2001) Parallel data mining for association rules on shared-memory systems. Knowl Inf Syst 3.1:1–29
41.
go back to reference Rahbarinia B, Pedram MM, Arabnia HR, Alavi Z (2010) A multi-objective scheme to hide sequential patterns. In: The proceedings of the 2010 international conference on computer and automation engineering, ICCAE, Singapore, 26–28 February 2010, pp 153–158 (E-ISBN 978-1-4244-5586-7, doi:10.1109/ICCAE.2010.5451977) Rahbarinia B, Pedram MM, Arabnia HR, Alavi Z (2010) A multi-objective scheme to hide sequential patterns. In: The proceedings of the 2010 international conference on computer and automation engineering, ICCAE, Singapore, 26–28 February 2010, pp 153–158 (E-ISBN 978-1-4244-5586-7, doi:10.​1109/​ICCAE.​2010.​5451977)
42.
go back to reference Romero C et al (2012) Association rule mining using genetic programming to provide feedback to instructors from multiple-choice quiz data. Expert Syst Romero C et al (2012) Association rule mining using genetic programming to provide feedback to instructors from multiple-choice quiz data. Expert Syst
43.
go back to reference Silvestri C, Orlando S (2012) gpuDCI: exploiting GPUs in frequent itemset mining. In: 20th euromicro international conference on parallel, distributed and network-based processing (PDP). IEEE, New York Silvestri C, Orlando S (2012) gpuDCI: exploiting GPUs in frequent itemset mining. In: 20th euromicro international conference on parallel, distributed and network-based processing (PDP). IEEE, New York
44.
go back to reference Sun J, Pan W (2009) Parallel association rule mining for image data. In: International conference on photonics and image in agriculture engineering (PIAGENG 2009). International Society for Optics and Photonics, London Sun J, Pan W (2009) Parallel association rule mining for image data. In: International conference on photonics and image in agriculture engineering (PIAGENG 2009). International Society for Optics and Photonics, London
45.
go back to reference Talia D, Trunfio P, Verta O (2005) Weka4ws: a wsrf-enabled weka toolkit for distributed data mining on grids. In: Knowledge discovery in databases: PKDD 2005, pp 309–320. Springer, Berlin Talia D, Trunfio P, Verta O (2005) Weka4ws: a wsrf-enabled weka toolkit for distributed data mining on grids. In: Knowledge discovery in databases: PKDD 2005, pp 309–320. Springer, Berlin
46.
go back to reference Wang ZG, Wang CS (2012) A parallel association-rule mining algorithm. In: Web information systems and mining, pp 125–129. Springer, Berlin Wang ZG, Wang CS (2012) A parallel association-rule mining algorithm. In: Web information systems and mining, pp 125–129. Springer, Berlin
48.
go back to reference Yan X, Zhang C, Zhang S (2009) Genetic algorithm-based strategy for identifying association rules without specifying actual minimum support. Expert Syst Appl 36(2):3066–3076CrossRef Yan X, Zhang C, Zhang S (2009) Genetic algorithm-based strategy for identifying association rules without specifying actual minimum support. Expert Syst Appl 36(2):3066–3076CrossRef
49.
go back to reference Yang J, Yang Y (2010) A parallel algorithm for mining association rules. In: 2nd international conference on networking and digital society (ICNDS), vol 1. IEEE, New York Yang J, Yang Y (2010) A parallel algorithm for mining association rules. In: 2nd international conference on networking and digital society (ICNDS), vol 1. IEEE, New York
50.
go back to reference Zaïane OR, El-Hajj M, Lu P (2001) Fast parallel association rule mining without candidacy generation. In: Proceedings of the IEEE international conference on data mining, ICDM 2001. IEEE, New York Zaïane OR, El-Hajj M, Lu P (2001) Fast parallel association rule mining without candidacy generation. In: Proceedings of the IEEE international conference on data mining, ICDM 2001. IEEE, New York
51.
go back to reference Zhang F, Zhang Y, Bakos J (2011) GPApriori: GPU-accelerated frequent itemset mining. In: IEEE international conference on cluster computing (CLUSTER). IEEE, New York Zhang F, Zhang Y, Bakos J (2011) GPApriori: GPU-accelerated frequent itemset mining. In: IEEE international conference on cluster computing (CLUSTER). IEEE, New York
52.
go back to reference Zhou J, Yu K-M, Wu B-C (2010) Parallel frequent patterns mining algorithm on GPU. In: IEEE international conference on systems man and cybernetics (SMC). IEEE, New York Zhou J, Yu K-M, Wu B-C (2010) Parallel frequent patterns mining algorithm on GPU. In: IEEE international conference on systems man and cybernetics (SMC). IEEE, New York
53.
go back to reference Zhou Y, Tan Y (2009) GPU-based parallel particle swarm optimization. In: IEEE congress on evolutionary computation, 2009, CEC’09, pp 1493–1500. IEEE, New York Zhou Y, Tan Y (2009) GPU-based parallel particle swarm optimization. In: IEEE congress on evolutionary computation, 2009, CEC’09, pp 1493–1500. IEEE, New York
Metadata
Title
GPU-based bees swarm optimization for association rules mining
Authors
Youcef Djenouri
Ahcene Bendjoudi
Malika Mehdi
Nadia Nouali-Taboudjemat
Zineb Habbas
Publication date
01-04-2015
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 4/2015
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-014-1366-8

Other articles of this Issue 4/2015

The Journal of Supercomputing 4/2015 Go to the issue

Premium Partner