Skip to main content
Erschienen in: Information Systems Frontiers 1/2024

06.11.2021

Deep Reinforcement Learning-Based Robot Exploration for Constructing Map of Unknown Environment

verfasst von: Shih-Yeh Chen, Qi-Fong He, Chin-Feng Lai

Erschienen in: Information Systems Frontiers | Ausgabe 1/2024

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In traditional environment exploration algorithms, two problems are still waiting to be solved. One is that as the exploration time increases, the robot will repeatedly explore the areas that have been explored. The other is that in order to explore the environment more accurately, the robot will cause slight collisions during the exploration process. In order to solve the two problems, a DQN-based exploration model is proposed, which enables the robot to quickly find the unexplored area in an unknown environment, and designs a DQN-based navigation model to solve the local minima problem generated by the robot during the exploration. Through the switching mechanism of exploration model and navigation model, the robot can quickly complete the exploration task through selecting the modes according to the environment exploration situation. In the experiment results, the difference between the proposed unknown environment exploration method and the previous known-environment exploration methods research is less than 5% under the same exploration time. And in the proposed method, the robot can achieve zero collision and almost zero repeated exploration of the area when it has been trained for 30w rounds. Therefore, it can be seen that the proposed method is more practical than the previous methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Chaisittiporn R. (2018). Service boy robot path planner by deep q network. International Journal of Applied Computer Technology and Information Systems, 8(1), 50–55. Chaisittiporn R. (2018). Service boy robot path planner by deep q network. International Journal of Applied Computer Technology and Information Systems, 8(1), 50–55.
Zurück zum Zitat Civera, J., Davison, A. J., & Montiel, J. M. (2008). Inverse depth parametrization for monocular slam. IEEE Transactions on Robotics, 24(5), 932–945.CrossRef Civera, J., Davison, A. J., & Montiel, J. M. (2008). Inverse depth parametrization for monocular slam. IEEE Transactions on Robotics, 24(5), 932–945.CrossRef
Zurück zum Zitat Cummins, M., & Newman, P. (2011). Appearance-only slam at large scale with fab-map 2.0. The International Journal of Robotics Research, 30(9), 1100–1123.CrossRef Cummins, M., & Newman, P. (2011). Appearance-only slam at large scale with fab-map 2.0. The International Journal of Robotics Research, 30(9), 1100–1123.CrossRef
Zurück zum Zitat Dissanayake, M. G., et al. (2001). A solution to the simultaneous localization and map building (slam) problem. IEEE Transactions on Robotics and Automation, 17(3), 229–241.CrossRef Dissanayake, M. G., et al. (2001). A solution to the simultaneous localization and map building (slam) problem. IEEE Transactions on Robotics and Automation, 17(3), 229–241.CrossRef
Zurück zum Zitat Engel J., Schöps T., & Cremers D. (2014). Lsd-slam: Large-scale direct monocular slam. European conference on computer vision, 834–849. Engel J., Schöps T., & Cremers D. (2014). Lsd-slam: Large-scale direct monocular slam. European conference on computer vision, 834–849.
Zurück zum Zitat Gálvez-López, D., & Tardos, J. D. (2012). Bags of binary words for fast place recognition in image sequences. IEEE Transactions on Robotics, 28(5), 1188–1197.CrossRef Gálvez-López, D., & Tardos, J. D. (2012). Bags of binary words for fast place recognition in image sequences. IEEE Transactions on Robotics, 28(5), 1188–1197.CrossRef
Zurück zum Zitat Khatib, O. (1986). Real-time obstacle avoidance for manipulators and mobile robots. Autonomous robot vehicles, 396–404. Khatib, O. (1986). Real-time obstacle avoidance for manipulators and mobile robots. Autonomous robot vehicles, 396–404.
Zurück zum Zitat Klein G. & Murray D. (2007). Parallel tracking and mapping for small ar workspaces. 6th IEEE and ACM international symposium on mixed and augmented reality, 225-234. Klein G. & Murray D. (2007). Parallel tracking and mapping for small ar workspaces. 6th IEEE and ACM international symposium on mixed and augmented reality, 225-234.
Zurück zum Zitat Levine, S., Pastor, P., Krizhevsky, A., Ibarz, J., & Quillen, D. (2018). Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection. The International Journal of Robotics Research, 37(4–5), 421–436.CrossRef Levine, S., Pastor, P., Krizhevsky, A., Ibarz, J., & Quillen, D. (2018). Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection. The International Journal of Robotics Research, 37(4–5), 421–436.CrossRef
Zurück zum Zitat Leofante, F., Ábrahám, E., Niemueller, T., et al. (2019). Integrated synthesis and execution of optimal plans for multi-robot systems in logistics. Information Systems Frontiers, 21, 87–107.CrossRef Leofante, F., Ábrahám, E., Niemueller, T., et al. (2019). Integrated synthesis and execution of optimal plans for multi-robot systems in logistics. Information Systems Frontiers, 21, 87–107.CrossRef
Zurück zum Zitat Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2), 91–110.CrossRef Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2), 91–110.CrossRef
Zurück zum Zitat Marchesini E. & Farinelli A. (2020). Genetic DRL for mapless navigation. Proceedings of the 19th international conference on autonomous agents and MultiAgent systems, 1919-1921. Marchesini E. & Farinelli A. (2020). Genetic DRL for mapless navigation. Proceedings of the 19th international conference on autonomous agents and MultiAgent systems, 1919-1921.
Zurück zum Zitat Mnih, V., et al. (2015). Human-level control through DRL. Nature, 518(7540), 529–533.CrossRef Mnih, V., et al. (2015). Human-level control through DRL. Nature, 518(7540), 529–533.CrossRef
Zurück zum Zitat Nister, D., & Stewenius, H. (2006). Scalable recognition with a vocabulary tree. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2, 2161–2168. Nister, D., & Stewenius, H. (2006). Scalable recognition with a vocabulary tree. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2, 2161–2168.
Zurück zum Zitat Rawin, C. (2018). Service boy robot path planner by deep Q network. International Journal of Applied Computer Technology and Information Systems, 8(1), 50–55. Rawin, C. (2018). Service boy robot path planner by deep Q network. International Journal of Applied Computer Technology and Information Systems, 8(1), 50–55.
Zurück zum Zitat Rosten E. & Drummond T. (2006). Machine learning for high-speed corner detection. European conference on computer vision, 430-443. Rosten E. & Drummond T. (2006). Machine learning for high-speed corner detection. European conference on computer vision, 430-443.
Zurück zum Zitat Rublee E., Rabaud V., Konolige K., & Bradski G. (2011). Orb: An efficient alternative to sift or surf. International conference on computer vision, 2564-2571. Rublee E., Rabaud V., Konolige K., & Bradski G. (2011). Orb: An efficient alternative to sift or surf. International conference on computer vision, 2564-2571.
Zurück zum Zitat Strasdat, H., Montiel, J. M., & Davison, A. J. (2012). Visual slam: Why filter? Image and Vision Computing, 30(2), 65–77.CrossRef Strasdat, H., Montiel, J. M., & Davison, A. J. (2012). Visual slam: Why filter? Image and Vision Computing, 30(2), 65–77.CrossRef
Zurück zum Zitat Tai L., Paolo G., & Liu M. (2017). Virtual-to-real DRL: Continuous control of mobile robots for mapless navigation. IEEE/RSJ international conference on intelligent robots and systems, 31–36. Tai L., Paolo G., & Liu M. (2017). Virtual-to-real DRL: Continuous control of mobile robots for mapless navigation. IEEE/RSJ international conference on intelligent robots and systems, 31–36.
Zurück zum Zitat Volos, C. K., Kyprianidis, I. M., & Stouboulos, I. N. (2013). Experimental investigation on coverage performance of a chaotic autonomous mobile robot. Robotics and Autonomous Systems, 61(12), 1314–1322.CrossRef Volos, C. K., Kyprianidis, I. M., & Stouboulos, I. N. (2013). Experimental investigation on coverage performance of a chaotic autonomous mobile robot. Robotics and Autonomous Systems, 61(12), 1314–1322.CrossRef
Zurück zum Zitat Xue X., Li Z., Zhang D., & Yan Y. (2019). A DRL method for mobile robot collision avoidance based on double dqn. IEEE 28th international symposium on industrial electronics, 2131-2136. Xue X., Li Z., Zhang D., & Yan Y. (2019). A DRL method for mobile robot collision avoidance based on double dqn. IEEE 28th international symposium on industrial electronics, 2131-2136.
Zurück zum Zitat Zhang W. & Zhang Y. (2019). Behavior switch for drl-based robot navigation. IEEE 15th international conference on control and automation, 284-288. Zhang W. & Zhang Y. (2019). Behavior switch for drl-based robot navigation. IEEE 15th international conference on control and automation, 284-288.
Metadaten
Titel
Deep Reinforcement Learning-Based Robot Exploration for Constructing Map of Unknown Environment
verfasst von
Shih-Yeh Chen
Qi-Fong He
Chin-Feng Lai
Publikationsdatum
06.11.2021
Verlag
Springer US
Erschienen in
Information Systems Frontiers / Ausgabe 1/2024
Print ISSN: 1387-3326
Elektronische ISSN: 1572-9419
DOI
https://doi.org/10.1007/s10796-021-10218-5

Weitere Artikel der Ausgabe 1/2024

Information Systems Frontiers 1/2024 Zur Ausgabe

Premium Partner