Skip to main content
Top
Published in: The Journal of Supercomputing 11/2017

03-05-2017

Reconstructing permutation table to improve the Tabu Search for the PFSP on GPU

Published in: The Journal of Supercomputing | Issue 11/2017

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

General-purpose computing on graphics processing unit (GPGPU) has been adopted to accelerate the running of applications which require long execution time in various problem domains. Tabu Search belonging to meta-heuristics optimization has been used to find a suboptimal solution for NP-hard problems within a more reasonable time interval. In this paper, we have investigated in how to improve the performance of Tabu Search algorithm on GPGPU and took the permutation flow shop scheduling problem (PFSP) as the example for our study. In previous approach proposed recently for solving PFSP by Tabu Search on GPU, all the job permutations are stored in global memory to successfully eliminate the occurrences of branch divergence. Nevertheless, the previous algorithm requires a large amount of global memory space, because of a lot of global memory access resulting in system performance degradation. We propose a new approach to address the problem. The main contribution of this paper is an efficient multiple-loop struct to generate most part of the permutation on the fly, which can decrease the size of permutation table and significantly reduce the amount of global memory access. Computational experiments on problems according with benchmark suite for PFSP reveal that the best performance improvement of our approach is about 100%, comparing with the previous work.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
1.
go back to reference Fung J, Tang F, Mann S (2002) Mediated reality using computer graphics hardware for computer vision. In: Proceedings of the International Symposium on Wearable Computing 2002, 83–89 Fung J, Tang F, Mann S (2002) Mediated reality using computer graphics hardware for computer vision. In: Proceedings of the International Symposium on Wearable Computing 2002, 83–89
2.
go back to reference Fung J, Mann S (2004) Computer vision signal processing on graphics processing units. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp V-93–V-96 Fung J, Mann S (2004) Computer vision signal processing on graphics processing units. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp V-93–V-96
3.
go back to reference Abi-Chahla F (2015) Nvidia’s CUDA: The End of the CPU?. Tom’s Hardware Abi-Chahla F (2015) Nvidia’s CUDA: The End of the CPU?. Tom’s Hardware
4.
go back to reference Zouaneb I, Belarbi M, Chouarfia A (2016) Multi approach for real-time systems specification: case study of GPU parallel systems. Int J Big Data Intell 3(2):122–141CrossRef Zouaneb I, Belarbi M, Chouarfia A (2016) Multi approach for real-time systems specification: case study of GPU parallel systems. Int J Big Data Intell 3(2):122–141CrossRef
5.
go back to reference Playne DP, Hawick KA (2015) Benchmarking multi-GPU communication using the shallow water equations. Int J Big Data Intell 2(3):157–167CrossRef Playne DP, Hawick KA (2015) Benchmarking multi-GPU communication using the shallow water equations. Int J Big Data Intell 2(3):157–167CrossRef
6.
go back to reference Wu CC, Ke JY, Lin H, Jhan SS (2014) Adjusting thread parallelism dynamically to accelerate dynamic programming with irregular workload distribution on GPGPUs. Int J Grid High Perform Comput (IJGHPC) 6(1):1–20CrossRef Wu CC, Ke JY, Lin H, Jhan SS (2014) Adjusting thread parallelism dynamically to accelerate dynamic programming with irregular workload distribution on GPGPUs. Int J Grid High Perform Comput (IJGHPC) 6(1):1–20CrossRef
7.
go back to reference Novoa C, Qasem A, Chaparala A (2015) A SIMD tabu search implementation for solving the quadratic assignment problem with GPU acceleration. In: Proceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure, pp 13 Novoa C, Qasem A, Chaparala A (2015) A SIMD tabu search implementation for solving the quadratic assignment problem with GPU acceleration. In: Proceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure, pp 13
8.
go back to reference Czapiński M, Barnes S (2011) Tabu search with two approaches to parallel flowshop evaluation on CUDA platform. J Parallel Distrib Comput 71:802–811CrossRef Czapiński M, Barnes S (2011) Tabu search with two approaches to parallel flowshop evaluation on CUDA platform. J Parallel Distrib Comput 71:802–811CrossRef
9.
go back to reference Johnson SM (1954) Optimal two- and three-stage production schedules with setup times included. Naval Res Logist Q 1(1):61–68CrossRefMATH Johnson SM (1954) Optimal two- and three-stage production schedules with setup times included. Naval Res Logist Q 1(1):61–68CrossRefMATH
11.
go back to reference Chung C-S, Flynn J, Kirca O (2002) A branch and bound algorithm to minimize the total flow time for m-machine permutation flowshop problems. Int J Prod Econ 79(3):185–196CrossRefMATH Chung C-S, Flynn J, Kirca O (2002) A branch and bound algorithm to minimize the total flow time for m-machine permutation flowshop problems. Int J Prod Econ 79(3):185–196CrossRefMATH
12.
go back to reference Bautista J, Canoa A, Companys R, Ribasb I (2012) Solving the Fm\(\mid \)block\(\mid \)C\(_{max}\) problem using bounded dynamic programming. Eng Appl Artif Intell 25(6):1235–1245CrossRef Bautista J, Canoa A, Companys R, Ribasb I (2012) Solving the Fm\(\mid \)block\(\mid \)C\(_{max}\) problem using bounded dynamic programming. Eng Appl Artif Intell 25(6):1235–1245CrossRef
14.
go back to reference Gangadharan R, Rajendran C (1993) Heuristic algorithms for scheduling in the no-wait flowshop. Int J Prod Econ 32(3):285–290CrossRef Gangadharan R, Rajendran C (1993) Heuristic algorithms for scheduling in the no-wait flowshop. Int J Prod Econ 32(3):285–290CrossRef
15.
go back to reference Santos N, Rebelo R, Pedroso J (2014) A tabu search for the permutation flow shop problem with sequence dependent setup times. Int J Data Anal Tech Strateg 6(3):275–285CrossRef Santos N, Rebelo R, Pedroso J (2014) A tabu search for the permutation flow shop problem with sequence dependent setup times. Int J Data Anal Tech Strateg 6(3):275–285CrossRef
16.
go back to reference Gao J, Chen R, Dong W (2013) An efficient tabu search algorithm for the distributed permutation flowshop scheduling problem. Int J Prod Res 51(3):641–651CrossRef Gao J, Chen R, Dong W (2013) An efficient tabu search algorithm for the distributed permutation flowshop scheduling problem. Int J Prod Res 51(3):641–651CrossRef
17.
go back to reference Rajkumar R, Shahabudeen P (2009) An improved genetic algorithm for the flowshop scheduling problem. Int J Prod Res 47(1):233–249CrossRefMATH Rajkumar R, Shahabudeen P (2009) An improved genetic algorithm for the flowshop scheduling problem. Int J Prod Res 47(1):233–249CrossRefMATH
18.
go back to reference Jarosław P, Czesław S, Dominik Ż (2013) Optimizing bicriteria flow shop scheduling problem by simulated annealing algorithm. Proc Comput Sci 18:936–945CrossRef Jarosław P, Czesław S, Dominik Ż (2013) Optimizing bicriteria flow shop scheduling problem by simulated annealing algorithm. Proc Comput Sci 18:936–945CrossRef
19.
go back to reference Xu X, Xu Z, Gu X (2011) An asynchronous genetic local search algorithm for the permutation flowshop scheduling problem with total flowtime minimization. Expert Syst Appl 38(7):7970–7979CrossRef Xu X, Xu Z, Gu X (2011) An asynchronous genetic local search algorithm for the permutation flowshop scheduling problem with total flowtime minimization. Expert Syst Appl 38(7):7970–7979CrossRef
20.
go back to reference Banka M, Ghomia SMTF, Jolai F, Behnamian J (2012) Application of particle swarm optimization and simulated annealing algorithms in flow shop scheduling problem under linear deterioration. Adv Eng Softw 47(1):1–6CrossRef Banka M, Ghomia SMTF, Jolai F, Behnamian J (2012) Application of particle swarm optimization and simulated annealing algorithms in flow shop scheduling problem under linear deterioration. Adv Eng Softw 47(1):1–6CrossRef
21.
go back to reference Ahmadiza F (2012) A new ant colony algorithm for makespan minimization in permutation flow shops. Comput Ind Eng 63(2):355–361CrossRef Ahmadiza F (2012) A new ant colony algorithm for makespan minimization in permutation flow shops. Comput Ind Eng 63(2):355–361CrossRef
22.
go back to reference Bożejko W, Uchroński M, Wodeck M (2016) Parallel metaheuristics for the cyclic flow shop scheduling problem. Comput Ind Eng 95:156–163CrossRef Bożejko W, Uchroński M, Wodeck M (2016) Parallel metaheuristics for the cyclic flow shop scheduling problem. Comput Ind Eng 95:156–163CrossRef
23.
go back to reference Czapiński M (2010) Parallel simulated annealing with genetic enhancement for flowshop problem with C\(_{sum}\). Comput Ind Eng 59(4):778–785CrossRef Czapiński M (2010) Parallel simulated annealing with genetic enhancement for flowshop problem with C\(_{sum}\). Comput Ind Eng 59(4):778–785CrossRef
24.
go back to reference Bożejko W (2009) Solving the flow shop problem by parallel programming. J Parallel Distrib Comput 69(5):470–481CrossRef Bożejko W (2009) Solving the flow shop problem by parallel programming. J Parallel Distrib Comput 69(5):470–481CrossRef
25.
go back to reference Nowicki E, Smutnicki C (1998) The flow shop with parallel machines: a tabu search approach. Eur J Oper Res 106(2–3):226–253CrossRefMATH Nowicki E, Smutnicki C (1998) The flow shop with parallel machines: a tabu search approach. Eur J Oper Res 106(2–3):226–253CrossRefMATH
26.
go back to reference Janiak A, Janiak WA, Lichtenstein M (2008) Tabu Search on GPU. J UCS 14(14):2416–2426 Janiak A, Janiak WA, Lichtenstein M (2008) Tabu Search on GPU. J UCS 14(14):2416–2426
27.
go back to reference Kaviani M, Abbasi M, Rahpeyma B, Yusefi M (2014) A hybrid tabu search-simulated annealing method to solve quadratic assignment problem. Decis Sci Lett 3(3):391–396CrossRef Kaviani M, Abbasi M, Rahpeyma B, Yusefi M (2014) A hybrid tabu search-simulated annealing method to solve quadratic assignment problem. Decis Sci Lett 3(3):391–396CrossRef
28.
go back to reference Pattnaik A, Tang X, Jog A, Kayiran O, Mishra AK, Kandemir MT, Mutlu O, Das CR (2016) Scheduling techniques for GPU architectures with processing-in-memory capabilities. In: Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, pp 31–44 Pattnaik A, Tang X, Jog A, Kayiran O, Mishra AK, Kandemir MT, Mutlu O, Das CR (2016) Scheduling techniques for GPU architectures with processing-in-memory capabilities. In: Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, pp 31–44
29.
go back to reference Han TD, Abdelrahman TS (2011) Reducing branch divergence in GPU programs. In: Proceedings of 4th Workshop on General Purpose Processing on Graphics Processing Units, pp 1–8 Han TD, Abdelrahman TS (2011) Reducing branch divergence in GPU programs. In: Proceedings of 4th Workshop on General Purpose Processing on Graphics Processing Units, pp 1–8
30.
go back to reference Lindholm E, Nickolls J, Oberman S, Montrym J (2008) NVIDIA Tesla: a unified graphics and computing architecture. IEEE Micro 28(2):39–55CrossRef Lindholm E, Nickolls J, Oberman S, Montrym J (2008) NVIDIA Tesla: a unified graphics and computing architecture. IEEE Micro 28(2):39–55CrossRef
31.
go back to reference Lu F, Song J, Cao X, Zhu X (2012) CPU/GPU computing for long-wave radiation physics on large GPU clusters. Comput Geosci 41:47–55CrossRef Lu F, Song J, Cao X, Zhu X (2012) CPU/GPU computing for long-wave radiation physics on large GPU clusters. Comput Geosci 41:47–55CrossRef
34.
go back to reference Liu Y-F, Liu S-Y (2011) A hybrid discrete artificial bee colony algorithm for permutation flowshop scheduling problem. Appl Soft Comput 13(3):1459–1463CrossRef Liu Y-F, Liu S-Y (2011) A hybrid discrete artificial bee colony algorithm for permutation flowshop scheduling problem. Appl Soft Comput 13(3):1459–1463CrossRef
35.
go back to reference Lin Q, Gao L, Li X, Zhang C (2015) A hybrid backtracking search algorithm for permutation flow-shop. Comput Ind Eng 85:437–446CrossRef Lin Q, Gao L, Li X, Zhang C (2015) A hybrid backtracking search algorithm for permutation flow-shop. Comput Ind Eng 85:437–446CrossRef
38.
go back to reference Huang L-T, Jhan S-S, Li Y-J, Wu C.C (2014) Solving the permutation problem efficiently for tabu search on CUDA GPUs. In: Proceedings of 6th International Conference on Computational Collective Intelligence Technologies and Applications, pp 342–352 Huang L-T, Jhan S-S, Li Y-J, Wu C.C (2014) Solving the permutation problem efficiently for tabu search on CUDA GPUs. In: Proceedings of 6th International Conference on Computational Collective Intelligence Technologies and Applications, pp 342–352
39.
go back to reference Wu C-C, Wei K-C, Lai W-S, Li Y-J (2016) Avoiding duplicated computation to improve the performance of PFSP on CUDA GPUs. Comput Sci Inform Technol 6:13–23 Wu C-C, Wei K-C, Lai W-S, Li Y-J (2016) Avoiding duplicated computation to improve the performance of PFSP on CUDA GPUs. Comput Sci Inform Technol 6:13–23
40.
go back to reference Fung WWL, Sham I, Yuan G, Aamodt TM (2007) Dynamic warp formation and scheduling for efficient GPU control flow. In: Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture, pp 407–420 Fung WWL, Sham I, Yuan G, Aamodt TM (2007) Dynamic warp formation and scheduling for efficient GPU control flow. In: Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture, pp 407–420
41.
Metadata
Title
Reconstructing permutation table to improve the Tabu Search for the PFSP on GPU
Publication date
03-05-2017
Published in
The Journal of Supercomputing / Issue 11/2017
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-017-2041-7

Other articles of this Issue 11/2017

The Journal of Supercomputing 11/2017 Go to the issue

Premium Partner