nach oben

International Journal of Intelligent Transportation Systems Research

Erschienen in:

02.02.2023

Deep Reinforcement Q-Learning for Intelligent Traffic Signal Control with Partial Detection

verfasst von: Romain Ducrocq, Nadir Farhi

Erschienen in: International Journal of Intelligent Transportation Systems Research | Ausgabe 1/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Intelligent traffic signal controllers, applying DQN algorithms to traffic light policy optimization, efficiently reduce traffic congestion by adjusting traffic signals to real-time traffic. Most propositions in the literature however consider that all vehicles at the intersection are detected, an unrealistic scenario. Recently, new wireless communication technologies have enabled cost-efficient detection of connected vehicles by infrastructures. With only a small fraction of the total fleet currently equipped, methods able to perform under low detection rates are desirable. In this paper, we propose a deep reinforcement Q-learning model to optimize traffic signal control at an isolated intersection, in a partially observable environment with connected vehicles. First, we present the novel DQN model within the RL framework. We introduce a new state representation for partially observable environments and a new reward function for traffic signal control, and provide a network architecture and tuned hyper-parameters. Second, we evaluate the performances of the model in numerical simulations on multiple scenarios, in two steps. At first in full detection against existing actuated controllers, then in partial detection with loss estimates for proportions of connected vehicles. Finally, from the obtained results, we define thresholds for detection rates with acceptable and optimal performance levels. The source code implementation of the model is available at: https://github.com/romainducrocq/DQN-ITSCwPD

Vorheriger Artikel Hybrid Acyclic and Spillback-Aware Scheduling Protocol for Intelligent Traffic Lights Control

Nächster Artikel Understanding the Relation of Psychological/Behavioral Factors and Cycling During the Covid-19 Pandemic: A Case Study in Iran

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

ATZelectronics worldwide

ATZlectronics worldwide is up-to-speed on new trends and developments in automotive electronics on a scientific level with a high depth of information.

Order your 30-days-trial for free and without any commitment.

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

ATZelektronik

Die Fachzeitschrift ATZelektronik bietet für Entwickler und Entscheider in der Automobil- und Zulieferindustrie qualitativ hochwertige und fundierte Informationen aus dem gesamten Spektrum der Pkw- und Nutzfahrzeug-Elektronik.

Lassen Sie sich jetzt unverbindlich 2 kostenlose Ausgabe zusenden.

Jetzt informieren

Fairness is to be understood here as follows: A distribution d₁ of delays over vehicles is fairer than another distribution d₂ of delays over vehicles, if d₁ is closer to the uniform distribution, compared to d₂.

Alam, J., ey, M.K.: Design and analysis of a two stage traffic light system using fuzzy logic. J. Inf. Technol. Softw. Eng. 5, 1–9 (2015)

Alemzadeh, S., Moslemi, R., Sharma, R., Mesbahi, M.: Adaptive traffic control with deep reinforcement learning towards state-of-the-art and beyond. arXiv:2007.10960 (2020)

Chen, C., Wei, H., Xu, N., Zheng, G., Yang, M., Xiong, Y., Xu, K., Li, Z.: Toward a thousand lights: decentralized deep reinforcement learning for large-scale traffic signal control. In: Proceedings of the AAAI conference on artificial intelligence, vol. 34(04), pp. 3414–3421. https://doi.org/10.1609/aaai.v34i04.5744. https://ojs.aaai.org/index.php/AAAI/article/view/5744 (2020)

Codecà, L., Frank, R., Engel, T.: Luxembourg sumo traffic (lust) scenario: 24 hours of mobility for vehicular networking research. In: 2015 IEEE vehicular networking conference (VNC), pp. 1–8 (2015)

Gao, J., Shen, Y., Liu, J., Ito, M., Shiratori, N.: Adaptive traffic signal control deep reinforcement learning algorithm with experience replay and target network. arXiv:1705.02755 (2017)

Genders, W., Razavi, S.N.: Evaluating reinforcement learning state representations for adaptive traffic signal control. In: ANT/SEIT (2018)

Genders, W., Razavi, S.N.: An open-source framework for adaptive traffic signal control. arXiv:1909.00395 (2019)

Gershenson, C.: Self-organizing traffic lights. Complex Syst.:16 (2005)

Hasselt, H.V., Guez, A., Silver, D.: Deep reinforcement learning with double q-learning. arXiv:1509.06461 (2016)

10.

Hessel, M., Modayil, J., Hasselt, H.V., Schaul, T., Ostrovski, G., Dabney, W., Horgan, D., Piot, B., Azar, M.G., Silver, D.: Rainbow: Combining improvements in deep reinforcement learning. In: AAAI (2018)

11.

Kheterpal, N., Parvate, K., Wu, C., Kreidieh, A., Vinitsky, E., Bayen, A.M.: Flow: deep reinforcement learning for control in sumo (2018)

12.

Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. CoRR arXiv:1412.6980(2015)

13.

Krajzewicz, D., Erdmann, J., Behrisch, M., Bieker, L.: Recent development and applications of sumo – simulation of urban mobility (2012)

14.

Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Commun. ACM 60, 84–90 (2012)CrossRef

15.

Li, L., Lv, Y., Wang, F.: Traffic signal timing via deep reinforcement learning. IEEE/CAA J. Autom. Sinica 3, 247–254 (2016)MathSciNetCrossRef

16.

Liang, X., Du, X., Wang, G., Han, Z.: A deep reinforcement learning network for traffic light cycle control. IEEE Trans. Veh. Technol. 68, 1243–1253 (2019)CrossRef

17.

Lillicrap, T.P., Hunt, J.J., Pritzel, A., Heess, N.M.O., Erez, T., Tassa, Y., Silver, D., Wierstra, D.: Continuous control with deep reinforcement learning. CoRR arXiv:1509.02971 (2016)

18.

Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D., Riedmiller, M.A.: Playing atari with deep reinforcement learning. arXiv:1312.5602 (2013)

19.

Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M.A., Fidjeland, A., Ostrovski, G., Petersen, S., Beattie, C., Sadik, A., Antonoglou, I., King, H., Kumaran, D., Wierstra, D., Legg, S., Hassabis, D.: Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015)CrossRef

20.

Nguyen Van Phu, C., Farhi, N.: Estimation of urban traffic state with probe vehicles. IEEE Trans. Intell. Transp. Syst. 22, 2797–2808 (2021)CrossRef

21.

Płaczek, B.: A self-organizing system for urban traffic control based on predictive interval microscopic model. arXiv:1406.1128 (2014)

22.

Stevens, M., Yeh, C.: Reinforcement learning for traffic optimization (2016)

23.

Sutton, R., Barto, A.: Reinforcement learning: an introduction (2nd edn.). http://incompleteideas.net/book/RLbook2020.pdf (2018)

24.

Touhbi, S., Babram, M.A., Nguyen-Huu, T., Marilleau, N., Hbid, M.L., Cambier, C., Stinckwich, S.: Adaptive traffic signal control : exploring reward definition for reinforcement learning. In: ANT/SEIT (2017)

25.

van der Pol, E., Oliehoek, F.A.: Coordinated deep reinforcement learners for traffic light control (2016)

26.

Varaiya, P.: Max pressure control of a network of signalized intersections. Trans. Res. Part C-emerging Technol. 36, 177–195 (2013)CrossRef

27.

Vidali, A., Crociani, L., Vizzari, G., Bandini, S.: A deep reinforcement learning approach to adaptive traffic lights management. In: WOA (2019)

28.

Wang, Z., Schaul, T., Hessel, M., Hasselt, H.V., Lanctot, M., de Freitas, N.: Dueling network architectures for deep reinforcement learning. arXiv:1511.06581 (2016)

29.

Wei, H., Zheng, G., Yao, H., Li, Z.: IntelliLight: A Reinforcement Learning Approach for Intelligent Traffic Light Control, pp 2496–2505. Association for Computing Machinery, New York (2018). ISBN 9781450355520. https://doi.org/10.1145/3219819.3220096

30.

Wei, H., Chen, C., Zheng, G., Wu, K., Gayah, V.V., Xu, K., Li, Z.J.: Presslight: learning max pressure control to coordinate traffic signals in arterial network. In: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining (2019a)

31.

Wei, H., Xu, N., Zhang, H., Zheng, G., Zang, X., Chen, C., Zhang, W., Zhu, Y., Xu, K., Li, Z.: Colight: Learning network-level cooperation for traffic signal control. In: Proceedings of the 28th ACM international conference on information and knowledge management, CIKM ’19, pp. 1913–1922. New York. Association for Computing Machinery. ISBN 9781450369763. https://doi.org/10.1145/3357384.3357902 (2019b)

32.

Wei, H., Zheng, G., Gayah, V.V., Li, Z.J.: A survey on traffic signal control methods. arXiv:1904.08117 (2019c)

33.

Yan, S., Zhang, J., Büscher, D., Burgard, W.: Efficiency and equity are both essential: A generalized traffic signal controller with deep reinforcement learning. In: 2020 IEEE/RSJ international conference on intelligent robots and systems (IROS), pp. 5526–5533. https://doi.org/10.1109/IROS45743.2020.9340784 (2020)

34.

Zhang, R., Leteurtre, R., Striner, B., Alanazi, A.S., Alghafis, A., Tonguz, O.K.: Partially detected intelligent traffic signal control: environmental adaptation. In: 2019 18th IEEE international conference on machine learning and applications (ICMLA), pp. 1956–1960 (2019)

35.

Zhang, R., Ishikawa, A., Wang, W., Striner, B., Tonguz, O.K.: Using reinforcement learning with partial vehicle detection for intelligent traffic signal control. IEEE Trans. Intell. Transp. Syst. 22, 404–415 (2021)CrossRef

36.

Zheng, G., Xiong, Y., Zang, X., Feng, J., Wei, H., Zhang, H., Li, Y., Xu, K., Li, Z.: Learning phase competition for traffic signal control. In: Proceedings of the 28th ACM international conference on information and knowledge management, CIKM ’19, pp. 1963–1972. New York. Association for Computing Machinery. ISBN 9781450369763. https://doi.org/10.1145/3357384.3357900 (2019)

Titel: Deep Reinforcement Q-Learning for Intelligent Traffic Signal Control with Partial Detection
verfasst von: Romain Ducrocq
Nadir Farhi
Publikationsdatum: 02.02.2023
Verlag: Springer US
Erschienen in: International Journal of Intelligent Transportation Systems Research / Ausgabe 1/2023
Print ISSN: 1348-8503
Elektronische ISSN: 1868-8659
DOI: https://doi.org/10.1007/s13177-023-00346-4

Premium Partner

Marktübersichten

Die im Laufe eines Jahres in der „adhäsion“ veröffentlichten Marktübersichten helfen Anwendern verschiedenster Branchen, sich einen gezielten Überblick über Lieferantenangebote zu verschaffen.

Zur Marktübersicht

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

ATZelectronics worldwide

Springer Professional "Technik"

ATZelektronik

Weitere Artikel der Ausgabe 1/2023

Safe and Efficient Advising Traffic System Around Critical Road Scenarios

Simulating the Impact of Shared Mobility on Demand: a Study of Future Transportation Systems in Gothenburg, Sweden

Selecting the Representative Travel Time Reliability Measure Based on Metric (Dis)Agreement Patterns

CATMI: Context-Aware Traffic Management at Autonomous Un-Signalized Intersections

Predicting Crash Injury Severity in Smart Cities: a Novel Computational Approach with Wide and Deep Learning Model

Hybrid Acyclic and Spillback-Aware Scheduling Protocol for Intelligent Traffic Lights Control

Premium Partner

Marktübersichten