Skip to main content

2020 | OriginalPaper | Buchkapitel

Applying External Guidance Commands to Deep Reinforcement Learning for Autonomous Driving

verfasst von : Fenjiro Youssef, Benbrahim Houda

Erschienen in: Innovations in Smart Cities Applications Edition 3

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

End-to-end deep reinforcement learning [1] algorithms used in autonomous car field and trained on lane-keeping task achieve good results in roads that don’t require decision making but cannot deal with situations where getting driving direction is mandatory like choosing to turn left or right in an upcoming crossroads, deciding when to leave a traffic circle or toward which path/destination to go. In this paper we introduce a new Deep Reinforcement Learning model that enable to integrate guidance commands at test time as a complementary input that indicate the right direction, that we call Deep Reinforcement Learning with guidance (DRLG), we apply the DRLG architecture on two algorithms, the asynchronous advantage actor-critic A3C and the Deep Deterministic Policy Gradient algorithm DDPG. For the training and experimentations of the new model, we adopt the CARLA virtual environment, a High-fidelity realistic driving simulator as a testbed since leading driving tests in the real world turns out to be neither safe nor affordable in term of materials and requirements. The results of testing show that DDPG and A3C with Guidance (DDPGG and A3CG) models succeed on their driving task through roads/roundabouts, by being appropriately responsive to the external commands, which allow to the autonomous car to follow the indicated route and take the right turns.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Bansal, M., Krizhevsky, A., Ogale, A.: ChauffeurNet: learning to drive by imitating the best and synthesizing the worst (2018). arXiv:181203079 [cs] Bansal, M., Krizhevsky, A., Ogale, A.: ChauffeurNet: learning to drive by imitating the best and synthesizing the worst (2018). arXiv:​181203079 [cs]
3.
Zurück zum Zitat Bojarski, M., Del Testa, D., Dworakowski, D., Firner, B., Flepp, B., Goyal, P., Jackel, L.D., Monfort, M., Muller, U., Zhang, J., Zhang, X., Zhao, J., Zieba, K.: End to end learning for self-driving cars (2016). arXiv:160407316 [cs] Bojarski, M., Del Testa, D., Dworakowski, D., Firner, B., Flepp, B., Goyal, P., Jackel, L.D., Monfort, M., Muller, U., Zhang, J., Zhang, X., Zhao, J., Zieba, K.: End to end learning for self-driving cars (2016). arXiv:​160407316 [cs]
6.
Zurück zum Zitat Chen, C., Seff, A., Kornhauser, A., Xiao, J.: DeepDriving: learning affordance for direct perception in autonomous driving. In: 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, pp. 2722–2730. IEEE (2015) Chen, C., Seff, A., Kornhauser, A., Xiao, J.: DeepDriving: learning affordance for direct perception in autonomous driving. In: 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, pp. 2722–2730. IEEE (2015)
7.
Zurück zum Zitat Codevilla, F., Müller, M., López, A., Koltun, V., Dosovitskiy, A.: End-to-end driving via conditional imitation learning (2017). arXiv:171002410 [cs] Codevilla, F., Müller, M., López, A., Koltun, V., Dosovitskiy, A.: End-to-end driving via conditional imitation learning (2017). arXiv:​171002410 [cs]
9.
Zurück zum Zitat Dosovitskiy, A., Ros, G., Codevilla, F., Lopez, A., Koltun, V.: CARLA: an open urban driving simulator (2017). arXiv:171103938 [cs] Dosovitskiy, A., Ros, G., Codevilla, F., Lopez, A., Koltun, V.: CARLA: an open urban driving simulator (2017). arXiv:​171103938 [cs]
11.
Zurück zum Zitat Fridman, L., Terwilliger, J., Jenik, B.: DeepTraffic: crowdsourced hyperparameter tuning of deep reinforcement learning systems for multi-agent dense traffic navigation (2018). arXiv:180102805 [cs] Fridman, L., Terwilliger, J., Jenik, B.: DeepTraffic: crowdsourced hyperparameter tuning of deep reinforcement learning systems for multi-agent dense traffic navigation (2018). arXiv:​180102805 [cs]
12.
Zurück zum Zitat Aghdam, H.H., Heravi, E.J.: Guide to convolutional neural networks a practical application to traffic-sign detection and classification. Springer Aghdam, H.H., Heravi, E.J.: Guide to convolutional neural networks a practical application to traffic-sign detection and classification. Springer
13.
Zurück zum Zitat Henein, M., Kennedy, G., Ila, V., Mahony, R.: Simultaneous localization and mapping with dynamic rigid objects (2018). arXiv:180503800 [cs] Henein, M., Kennedy, G., Ila, V., Mahony, R.: Simultaneous localization and mapping with dynamic rigid objects (2018). arXiv:​180503800 [cs]
15.
Zurück zum Zitat Kılıç, İ., Yazıcı, A., Yıldız, Ö., Özçelikors, M., Ondoğan, A.: Intelligent adaptive cruise control system design and implementation. In: 2015 10th SoSE Conference, pp 232–237 (2015) Kılıç, İ., Yazıcı, A., Yıldız, Ö., Özçelikors, M., Ondoğan, A.: Intelligent adaptive cruise control system design and implementation. In: 2015 10th SoSE Conference, pp 232–237 (2015)
16.
Zurück zum Zitat Kocic, J., Jovicic, N., Drndarevic, V.: Driver behavioral cloning using deep learning (2018) Kocic, J., Jovicic, N., Drndarevic, V.: Driver behavioral cloning using deep learning (2018)
17.
Zurück zum Zitat Pananurak, W., Thanok, S., Parnichkun, M.: pp. 1794–1799 (2009) Pananurak, W., Thanok, S., Parnichkun, M.: pp. 1794–1799 (2009)
19.
Zurück zum Zitat Pomerleau, D.A.: ALVINN: an autonomous land vehicle in a neural network. In: Touretzky, D.S. (ed.) Advances in Neural Information Processing Systems 1 (1989) Pomerleau, D.A.: ALVINN: an autonomous land vehicle in a neural network. In: Touretzky, D.S. (ed.) Advances in Neural Information Processing Systems 1 (1989)
20.
21.
Zurück zum Zitat Rosencrantz, M., Gordon, G., Thrun, S.: Decentralized sensor fusion with distributed particle filters (2012). arXiv:12122493 [cs] Rosencrantz, M., Gordon, G., Thrun, S.: Decentralized sensor fusion with distributed particle filters (2012). arXiv:​12122493 [cs]
22.
Zurück zum Zitat Ross, S., Gordon, G.J., Bagnell, J.A.: A reduction of imitation learning and structured prediction to no-regret online learning (2010). arXiv:10110686 [cs, stat] Ross, S., Gordon, G.J., Bagnell, J.A.: A reduction of imitation learning and structured prediction to no-regret online learning (2010). arXiv:​10110686 [cs, stat]
23.
Zurück zum Zitat Sanders, P., Schultes, D.: Engineering fast route planning algorithms. In: Proceedings of the 6th International Conference on Experimental Algorithms, pp. 23–36. Springer, Berlin (2007) Sanders, P., Schultes, D.: Engineering fast route planning algorithms. In: Proceedings of the 6th International Conference on Experimental Algorithms, pp. 23–36. Springer, Berlin (2007)
24.
Zurück zum Zitat Sasiadek, J., Hartana, P.: Sensor data fusion using Kalman filter (2000) Sasiadek, J., Hartana, P.: Sensor data fusion using Kalman filter (2000)
25.
Zurück zum Zitat Schratter, M., Bouton, M., Kochenderfer, M.J., Watzenig, D.: Pedestrian collision avoidance system for scenarios with occlusions (2019). arXiv:190411566 [cs] Schratter, M., Bouton, M., Kochenderfer, M.J., Watzenig, D.: Pedestrian collision avoidance system for scenarios with occlusions (2019). arXiv:​190411566 [cs]
26.
Zurück zum Zitat Wymann, B., Dimitrakakis, C., Sumner, A., Espie, E., Guionneau, C.: TORCS: open racing car Wymann, B., Dimitrakakis, C., Sumner, A., Espie, E., Guionneau, C.: TORCS: open racing car
27.
Zurück zum Zitat Zou, Q., Jiang, H., Dai, Q., Yue, Y., Chen, L., Wang, Q.: Robust lane detection from continuous driving scenes using deep neural networks (2019). arXiv:190302193 [cs] Zou, Q., Jiang, H., Dai, Q., Yue, Y., Chen, L., Wang, Q.: Robust lane detection from continuous driving scenes using deep neural networks (2019). arXiv:​190302193 [cs]
28.
Zurück zum Zitat Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (2018)MATH Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (2018)MATH
Metadaten
Titel
Applying External Guidance Commands to Deep Reinforcement Learning for Autonomous Driving
verfasst von
Fenjiro Youssef
Benbrahim Houda
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-37629-1_60

    Premium Partner