Skip to main content
Top

2023 | OriginalPaper | Chapter

A Goal-Oriented Specification Language for Reinforcement Learning

Authors : Simon Schwan, Verena Klös, Sabine Glesner

Published in: Modeling Decisions for Artificial Intelligence

Publisher: Springer Nature Switzerland

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The design of reinforcement learning (RL) agents is difficult, especially in domains with complex and possibly conflicting objectives such as autonomous driving. In addition to the formal nature of RL with high technical barriers, the fragility of the reward signal results in the common trial-and-error practice in the design of RL agents. We propose a novel goal-oriented specification language that is tailored to reinforcement learning but abstracts from technical details. To overcome the problematic trial-and-error practice, our specification language provides the foundation for an easy and systematic design process in RL.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Altman, E.: Constrained Markov decision processes, vol. 7. CRC Press (1999) Altman, E.: Constrained Markov decision processes, vol. 7. CRC Press (1999)
2.
go back to reference Andrychowicz, M., et al.: Hindsight experience replay. In: Advances in Neural Information Processing Systems, pp. 5048–5058 (2017) Andrychowicz, M., et al.: Hindsight experience replay. In: Advances in Neural Information Processing Systems, pp. 5048–5058 (2017)
3.
4.
go back to reference Camacho, A., Icarte, R.T., Klassen, T.Q., Valenzano, R.A., McIlraith, S.A.: LTL and beyond: formal languages for reward function specification in reinforcement learning. In: Proceedings of the International Joint Conference on Artificial Intelligence, pp. 6065–6073 (2019) Camacho, A., Icarte, R.T., Klassen, T.Q., Valenzano, R.A., McIlraith, S.A.: LTL and beyond: formal languages for reward function specification in reinforcement learning. In: Proceedings of the International Joint Conference on Artificial Intelligence, pp. 6065–6073 (2019)
5.
go back to reference Fujimoto, S., Van Hoof, H., Meger, D.: Addressing function approximation error in actor-critic methods. In: 35th International Conference on Machine Learning (2018) Fujimoto, S., Van Hoof, H., Meger, D.: Addressing function approximation error in actor-critic methods. In: 35th International Conference on Machine Learning (2018)
6.
go back to reference Jothimurugan, K., Alur, R., Bastani, O.: A composable specification language for reinforcement learning tasks. In: Advances in Neural Information Processing Systems, pp. 13021–13030 (2019) Jothimurugan, K., Alur, R., Bastani, O.: A composable specification language for reinforcement learning tasks. In: Advances in Neural Information Processing Systems, pp. 13021–13030 (2019)
7.
go back to reference Klös, V., Göthel, T., Glesner, S.: Runtime management and quantitative evaluation of changing system goals in complex autonomous systems. J. Syst. Softw. 144, 314–327 (2018)CrossRef Klös, V., Göthel, T., Glesner, S.: Runtime management and quantitative evaluation of changing system goals in complex autonomous systems. J. Syst. Softw. 144, 314–327 (2018)CrossRef
8.
go back to reference Li, X., Vasile, C.I., Belta, C.: Reinforcement learning with temporal logic rewards. In: IEEE International Conference on Intelligent Robots and Systems, pp. 3834–3839 (2017) Li, X., Vasile, C.I., Belta, C.: Reinforcement learning with temporal logic rewards. In: IEEE International Conference on Intelligent Robots and Systems, pp. 3834–3839 (2017)
9.
go back to reference Ng, A.Y., Harada, D., Russell, S.J.: Policy invariance under reward transformations: theory and application to reward shaping. In: Proceedings of the Sixteenth International Conference on Machine Learning, pp. 278–287 (1999) Ng, A.Y., Harada, D., Russell, S.J.: Policy invariance under reward transformations: theory and application to reward shaping. In: Proceedings of the Sixteenth International Conference on Machine Learning, pp. 278–287 (1999)
10.
go back to reference Schaul, T., Horgan, D., Gregor, K., Silver, D.: Universal value function approximators. In: Proceedings of the 32nd International Conference on Machine Learning, pp. 1312–1320 (2015) Schaul, T., Horgan, D., Gregor, K., Silver, D.: Universal value function approximators. In: Proceedings of the 32nd International Conference on Machine Learning, pp. 1312–1320 (2015)
11.
go back to reference Schulman, J., Levine, S., Abbeel, P., Jordan, M.I., Moritz, P.: Trust region policy optimization. In: Proceedings of the 32nd International Conference on Machine Learning, pp. 1889–1897 (2015) Schulman, J., Levine, S., Abbeel, P., Jordan, M.I., Moritz, P.: Trust region policy optimization. In: Proceedings of the 32nd International Conference on Machine Learning, pp. 1889–1897 (2015)
12.
go back to reference Sutton, R.S., Barto, A.G.: Reinforcement learning: an introduction. MIT press, 2 edn. (2018) Sutton, R.S., Barto, A.G.: Reinforcement learning: an introduction. MIT press, 2 edn. (2018)
13.
go back to reference Van Lamsweerde, A.: Goal-oriented requirements engineering: a guided tour. In: Proceedings of the IEEE International Conference on Requirements Engineering, pp. 249–261 (2001) Van Lamsweerde, A.: Goal-oriented requirements engineering: a guided tour. In: Proceedings of the IEEE International Conference on Requirements Engineering, pp. 249–261 (2001)
Metadata
Title
A Goal-Oriented Specification Language for Reinforcement Learning
Authors
Simon Schwan
Verena Klös
Sabine Glesner
Copyright Year
2023
DOI
https://doi.org/10.1007/978-3-031-33498-6_12

Premium Partner