Skip to main content

2022 | OriginalPaper | Buchkapitel

Unsupervised Training for Neural TSP Solver

verfasst von : Elīza Gaile, Andis Draguns, Emīls Ozoliņš, Kārlis Freivalds

Erschienen in: Learning and Intelligent Optimization

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

There has been a growing number of machine learning methods for approximately solving the travelling salesman problem. However, these methods often require solved instances for training or use complex reinforcement learning approaches that need a large amount of tuning. To avoid these problems, we introduce a novel unsupervised learning approach. We use a relaxation of an integer linear program for TSP to construct a loss function that does not require correct instance labels. With variable discretization, its minimum coincides with the optimal or near-optimal solution. Furthermore, this loss function is differentiable and thus can be used to train neural networks directly. We use our loss function with a Graph Neural Network and design controlled experiments on both Euclidean and asymmetric TSP. Our approach has the advantage over supervised learning of not requiring large labelled datasets. In addition, the performance of our approach surpasses reinforcement learning for asymmetric TSP and is comparable to reinforcement learning for Euclidean instances. Our approach is also more stable and easier to train than reinforcement learning.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Applegate, D., Bixby, R., Chvátal, V., Cook, W.: Implementing the dantzig-fulkerson-johnson algorithm for large traveling salesman problems. Math. Program. 97, 91–153 (2003)MathSciNetCrossRefMATH Applegate, D., Bixby, R., Chvátal, V., Cook, W.: Implementing the dantzig-fulkerson-johnson algorithm for large traveling salesman problems. Math. Program. 97, 91–153 (2003)MathSciNetCrossRefMATH
2.
Zurück zum Zitat Bello, I., Pham, H., Le, Q.V., Norouzi, M., Bengio, S.: Neural combinatorial optimization with reinforcement learning. ArXiv abs/1611.09940 (2017) Bello, I., Pham, H., Le, Q.V., Norouzi, M., Bengio, S.: Neural combinatorial optimization with reinforcement learning. ArXiv abs/1611.09940 (2017)
4.
Zurück zum Zitat Deudon, M., Cournut, P., Lacoste, A., Adulyasak, Y., Rousseau, L.M.: Learning heuristics for the tsp by policy gradient. In: CPAIOR (2018) Deudon, M., Cournut, P., Lacoste, A., Adulyasak, Y., Rousseau, L.M.: Learning heuristics for the tsp by policy gradient. In: CPAIOR (2018)
5.
Zurück zum Zitat Frey, B.: Continuous sigmoidal belief networks trained using slice sampling. In: NIPS (1996) Frey, B.: Continuous sigmoidal belief networks trained using slice sampling. In: NIPS (1996)
7.
8.
Zurück zum Zitat Jang, E., Gu, S., Poole, B.: Categorical reparameterization with gumbel-softmax. In: 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, 24–26 April, 2017, Conference Track Proceedings. OpenReview.net (2017). https://openreview.net/forum?id=rkE3y85ee Jang, E., Gu, S., Poole, B.: Categorical reparameterization with gumbel-softmax. In: 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, 24–26 April, 2017, Conference Track Proceedings. OpenReview.net (2017). https://​openreview.​net/​forum?​id=​rkE3y85ee
9.
Zurück zum Zitat Joshi, C.K., Cappart, Q., Rousseau, L.M., Laurent, T., Bresson, X.: Learning tsp requires rethinking generalization. arXiv preprint arXiv:2006.07054 (2020) Joshi, C.K., Cappart, Q., Rousseau, L.M., Laurent, T., Bresson, X.: Learning tsp requires rethinking generalization. arXiv preprint arXiv:​2006.​07054 (2020)
10.
Zurück zum Zitat Joshi, C.K., Laurent, T., Bresson, X.: On learning paradigms for the travelling salesman problem. ArXiv abs/1910.07210 (2019) Joshi, C.K., Laurent, T., Bresson, X.: On learning paradigms for the travelling salesman problem. ArXiv abs/1910.07210 (2019)
11.
Zurück zum Zitat Joshi, C.K., Laurent, T., Bresson, X.: An efficient graph convolutional network technique for the travelling salesman problem. arXiv preprint arXiv:1906.01227 (2019) Joshi, C.K., Laurent, T., Bresson, X.: An efficient graph convolutional network technique for the travelling salesman problem. arXiv preprint arXiv:​1906.​01227 (2019)
12.
Zurück zum Zitat Khalil, E.B., Dai, H., Zhang, Y., Dilkina, B., Song, L.: Learning combinatorial optimization algorithms over graphs. In: NIPS (2017) Khalil, E.B., Dai, H., Zhang, Y., Dilkina, B., Song, L.: Learning combinatorial optimization algorithms over graphs. In: NIPS (2017)
13.
Zurück zum Zitat Kool, W., Hoof, H.V., Welling, M.: Attention, learn to solve routing problems! In: ICLR (2019) Kool, W., Hoof, H.V., Welling, M.: Attention, learn to solve routing problems! In: ICLR (2019)
14.
Zurück zum Zitat Kwon, Y.D., Choo, J., Yoon, I., Park, M., Park, D., Gwon, Y.: Matrix encoding networks for neural combinatorial optimization. ArXiv abs/2106.11113 (2021) Kwon, Y.D., Choo, J., Yoon, I., Park, M., Park, D., Gwon, Y.: Matrix encoding networks for neural combinatorial optimization. ArXiv abs/2106.11113 (2021)
15.
Zurück zum Zitat Maddison, C.J., Mnih, A., Teh, Y.W.: The concrete distribution: a continuous relaxation of discrete random variables. In: 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24–26, 2017, Conference Track Proceedings. OpenReview.net (2017). https://openreview.net/forum?id=S1jE5L5gl Maddison, C.J., Mnih, A., Teh, Y.W.: The concrete distribution: a continuous relaxation of discrete random variables. In: 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24–26, 2017, Conference Track Proceedings. OpenReview.net (2017). https://​openreview.​net/​forum?​id=​S1jE5L5gl
16.
Zurück zum Zitat Matai, R., Singh, S., Mittal, M.L.: Traveling salesman problem: an overview of applications, formulations, and solution approaches (2010) Matai, R., Singh, S., Mittal, M.L.: Traveling salesman problem: an overview of applications, formulations, and solution approaches (2010)
17.
Zurück zum Zitat Nazari, M., Oroojlooy, A., Snyder, L., Takác, M.: Reinforcement learning for solving the vehicle routing problem. In: NeurIPS (2018) Nazari, M., Oroojlooy, A., Snyder, L., Takác, M.: Reinforcement learning for solving the vehicle routing problem. In: NeurIPS (2018)
18.
Zurück zum Zitat Vinyals, O., Fortunato, M., Jaitly, N.: Pointer networks. In: NIPS (2015) Vinyals, O., Fortunato, M., Jaitly, N.: Pointer networks. In: NIPS (2015)
Metadaten
Titel
Unsupervised Training for Neural TSP Solver
verfasst von
Elīza Gaile
Andis Draguns
Emīls Ozoliņš
Kārlis Freivalds
Copyright-Jahr
2022
DOI
https://doi.org/10.1007/978-3-031-24866-5_25