Skip to main content

2020 | OriginalPaper | Buchkapitel

FANG: Fast and Efficient Successor-State Generation for Heuristic Optimization on GPUs

verfasst von : Marcel Köster, Julian Groß, Antonio Krüger

Erschienen in: Algorithms and Architectures for Parallel Processing

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Many optimization problems (especially nonsmooth ones) are typically solved by genetic, evolutionary, or metaheuristic-based algorithms. However, these genetic approaches and other related papers typically assume the existence of a neighborhood or successor-state function N(x), where x is a candidate state. The implementation of such a function can become arbitrarily complex in the field of combinatorial optimization. Many N(x) functions for a huge variety of different domain-specific problems have been developed in the past to solve this general problem. However, it has always been a great challenge to port or realize these functions on a massively-parallel architecture like a Graphics Processing Unit (GPU). We present a GPU-based method called FANG that implements a generic and reusable N(x) for arbitrary domains in the field of combinatorial optimization. It can be customized to satisfy domain-specific requirements and leverages the underlying hardware in a fast and efficient way by construction. Moreover, our method has a high scalability with respect to the number of input states and the complexity of a single state. Measurements show significant performance improvements compared to traditional exploration approaches leveraging the CPU on our evaluation scenarios.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
We will refer to a single SIMT unit as warp in the scope of this paper.
 
2
The number of possibilities can be seen as a subset of all possible successor states from Fig. 1-2.
 
3
Without limiting the parallel execution of multiple groups per multiprocessor.
 
Literatur
1.
Zurück zum Zitat Abdelkafi, O., Chebil, K., Khemakhem, M.: Parallel local search on GPU and CPU with OpenCL language. In: Proceedings of the First International Conference on Reasoning and Optimization in Information Systems, September 2013 Abdelkafi, O., Chebil, K., Khemakhem, M.: Parallel local search on GPU and CPU with OpenCL language. In: Proceedings of the First International Conference on Reasoning and Optimization in Information Systems, September 2013
2.
Zurück zum Zitat Campeotto, F., Dovier, A., Fioretto, F., Pontelli, E.: A GPU implementation of large neighborhood search for solving constraint optimization problems. In: Proceedings of the Twenty-First European Conference on Artificial Intelligence (2014) Campeotto, F., Dovier, A., Fioretto, F., Pontelli, E.: A GPU implementation of large neighborhood search for solving constraint optimization problems. In: Proceedings of the Twenty-First European Conference on Artificial Intelligence (2014)
5.
Zurück zum Zitat Ghorpade, S., Kamalapur, S.: Solution level parallelization of local search metaheuristic algorithm on GPU. Int. J. Comput. Sci. Mob. Comput. (2014) Ghorpade, S., Kamalapur, S.: Solution level parallelization of local search metaheuristic algorithm on GPU. Int. J. Comput. Sci. Mob. Comput. (2014)
6.
Zurück zum Zitat Köster, M., Leißa, R., Hack, S., Membarth, R., Slusallek, P.: Code refinement of stencil codes. Parallel Process. Lett. (PPL) 24, 1–16 (2014)MathSciNet Köster, M., Leißa, R., Hack, S., Membarth, R., Slusallek, P.: Code refinement of stencil codes. Parallel Process. Lett. (PPL) 24, 1–16 (2014)MathSciNet
7.
Zurück zum Zitat Luong, T.V., Loukil, L., Melab, N., Talbi, E.: A GPU-based iterated tabu search for solving the quadratic 3-dimensional assignment problem. In: ACS/IEEE International Conference on Computer Systems and Applications (AICCSA) (2010) Luong, T.V., Loukil, L., Melab, N., Talbi, E.: A GPU-based iterated tabu search for solving the quadratic 3-dimensional assignment problem. In: ACS/IEEE International Conference on Computer Systems and Applications (AICCSA) (2010)
8.
Zurück zum Zitat Luong, T.V., Melab, N., Talbi, E.G.: Large neighborhood local search optimization on graphics processing units. In: Workshop on Large-Scale Parallel Processing (LSPP) in Conjunction with the International Parallel & Distributed Processing Symposium (IPDPS) (2010) Luong, T.V., Melab, N., Talbi, E.G.: Large neighborhood local search optimization on graphics processing units. In: Workshop on Large-Scale Parallel Processing (LSPP) in Conjunction with the International Parallel & Distributed Processing Symposium (IPDPS) (2010)
9.
Zurück zum Zitat Luong, T.V., Melab, N., Talbi, E.G.: Neighborhood structures for GPU-based local search algorithms. Parallel Process. Lett. 20, 307–324 (2010)MathSciNetCrossRef Luong, T.V., Melab, N., Talbi, E.G.: Neighborhood structures for GPU-based local search algorithms. Parallel Process. Lett. 20, 307–324 (2010)MathSciNetCrossRef
10.
Zurück zum Zitat Marsaglia, G.: Xorshift RNGs. J. Stat. Softw. 8, 1–6 (2003) Marsaglia, G.: Xorshift RNGs. J. Stat. Softw. 8, 1–6 (2003)
11.
Zurück zum Zitat Melab, N., Luong, T.V., Boufaras, K., Talbi, E.G.: ParadisEO-MO-GPU: a framework for parallel GPU-based local search metaheuristics. In: 11th International Work-Conference on Artificial Neural Networks (2011) Melab, N., Luong, T.V., Boufaras, K., Talbi, E.G.: ParadisEO-MO-GPU: a framework for parallel GPU-based local search metaheuristics. In: 11th International Work-Conference on Artificial Neural Networks (2011)
12.
Zurück zum Zitat Ming Lam, Y., Hung Tsoi, K., Luk, W.: Parallel neighbourhood search on many-core platforms. Int. J. Comput. Sci. Eng. 8, 281–293 (2013) Ming Lam, Y., Hung Tsoi, K., Luk, W.: Parallel neighbourhood search on many-core platforms. Int. J. Comput. Sci. Eng. 8, 281–293 (2013)
13.
Zurück zum Zitat Mohammad Harun Rashid, L.T.: Parallel combinatorial optimization heuristics with GPUs. Adv. Sci. Technol. Eng. Syst. J. 3 (2018) Mohammad Harun Rashid, L.T.: Parallel combinatorial optimization heuristics with GPUs. Adv. Sci. Technol. Eng. Syst. J. 3 (2018)
14.
Zurück zum Zitat Munawar, A., Wahib, M., Munetomo, M., Akama, K.: Hybrid of genetic algorithm and local search to solve MAX-SAT problem using nVidia CUDA framework. Genet. Program. Evolvable Mach. 10, 391 (2009)CrossRef Munawar, A., Wahib, M., Munetomo, M., Akama, K.: Hybrid of genetic algorithm and local search to solve MAX-SAT problem using nVidia CUDA framework. Genet. Program. Evolvable Mach. 10, 391 (2009)CrossRef
15.
Zurück zum Zitat Novoa, C., Qasem, A., Chaparala, A.: A SIMD tabu search implementation for solving the quadratic assignment problem with GPU acceleration. In: Proceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure (2015) Novoa, C., Qasem, A., Chaparala, A.: A SIMD tabu search implementation for solving the quadratic assignment problem with GPU acceleration. In: Proceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure (2015)
16.
Zurück zum Zitat NVIDIA: Faster Parallel Reductions on Kepler (2014) NVIDIA: Faster Parallel Reductions on Kepler (2014)
17.
Zurück zum Zitat NVIDIA: CUDA C Programming Guide v10 (2019) NVIDIA: CUDA C Programming Guide v10 (2019)
18.
Zurück zum Zitat Schulz, C., Hasle, G., Brodtkorb, A.R., Hagen, T.R.: GPU computing in discrete optimization. Part II: survey focused on routing problems. EURO J. Transp. Logistics 2, 159–186 (2013)CrossRef Schulz, C., Hasle, G., Brodtkorb, A.R., Hagen, T.R.: GPU computing in discrete optimization. Part II: survey focused on routing problems. EURO J. Transp. Logistics 2, 159–186 (2013)CrossRef
19.
Zurück zum Zitat Talbi, E.G.: Metaheuristics: From Design to Implementation. Wiley, Hoboken (2009)CrossRef Talbi, E.G.: Metaheuristics: From Design to Implementation. Wiley, Hoboken (2009)CrossRef
Metadaten
Titel
FANG: Fast and Efficient Successor-State Generation for Heuristic Optimization on GPUs
verfasst von
Marcel Köster
Julian Groß
Antonio Krüger
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-38991-8_15