nach oben

Erschienen in:

2017 | OriginalPaper | Buchkapitel

8. Intelligent Human–Robot Interaction Systems Using Reinforcement Learning and Neural Networks

verfasst von : Hamidreza Modares, Isura Ranatunga, Bakur AlQaudi, Frank L. Lewis, Dan O. Popa

Erschienen in: Trends in Control and Decision-Making for Human–Robot Collaboration Systems

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

In this chapter, an intelligent human–robot system with adjustable robot autonomy is presented to assist the human operator to perform a given task with minimum workload demands and optimal performance. The proposed control methodology consists of two feedback loops: an inner loop that makes the robot with unknown dynamics behave like a prescribed impedance model as perceived by the operator, and an outer loop that finds the optimal parameters of this model to adjust the robot’s dynamics to the operator skills and minimize the tracking error. A nonlinear robust controller using neural networks is used in the inner loop to make the nonlinear unknown robot dynamics behave like a prescribed impedance model. The problem of finding the optimal parameters of the prescribed impedance model is formulated as an optimal control problem in the outer loop. The objective is to minimize the human effort and optimize the closed-loop behavior of the human–machine system for a given task. This design must take into account the unknown human dynamics as well as the desired overall performance of the human–robot system, which depends on the task. To obviate the requirement of the knowledge of the human model, reinforcement learning is used to learn the solution to the given optimal control problem online in real time.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Assistive Optimal Control-on-Request with Application in Standing Balance Therapy and Reinforcement

Nächstes Kapitel Regret-Based Allocation of Autonomy in Shared Visual DetectionShared visual detection for Human–Robot Collaborative Assembly in Manufacturing

Baron S, Kleinman DL, Levison WH (1970) An optimal control model of human response. part ii: prediction of human performance in a complex task. Automaica 6:371–383CrossRef

Bertsekas DP (2012) Dynamic programming and optimal control: approximate dynamic programming, 4th edn. Athena Scientific, MassachusettsMATH

Duchaine V, Gosselin C (2009) Safe, stable and intuitive control for physical human-robot interaction. In: Proceedings of the IEEE international conference on robotics and automation. Kobe, Japan, pp 3383–3388

Frankin S, Wolpert DM, Franklin DM (2012) Visuomotor feedback gains upregulate during the learning of novel dynamics. J Neurophysiol 108:467–478CrossRef

Furuta K, Kado Y, Shiratori S (2006) Assisting control in human adaptive mechatronics-single ball juggling. In: Proceedings of the IEEE international conference on control applications. Munich, Germany, pp 545–550

Ge SS, Hang CC, Woon LC, Chen XQ (1998) Impedance control of robot manipulators using adaptive neural networks. Int J Intell Control Syst 2:433–452MathSciNet

Ge SS, Harris CJ (1998) Adaptive neural network control of robotic manipulators. World Scientific, SingaporeCrossRef

Ge SS, Lee TH, Wang ZP (2001) Adaptive neural network control for smart materials robots using singular perturbation technique. Asian J Control 3:143–155CrossRef

Gribovskaya E, Kheddar A, Billard A (2011) Motion learning and adaptive impedance for robot control during physical interaction with humans. In: IEEE international conference on robotics and automation. Shanghai, China, pp 4326–4333

10.

Hogan N (1985) Impedance control: an approach to manipulation. i: theory. ii: implementation. iii: applications. ASME Trans J Dyn Syst Meas Control 107:1–24CrossRefMATH

11.

Huang L, Ge SS, Lee TH (2002) Neural network based adaptive impedance control of constrained robots. In: Proceedings of the IEEE international symposium on intelligent control. Vancouver, Canada, pp 615–619

12.

Hussain S, Xie SQ, Jamwal PK (2013) Adaptive impedance control of a robotic orthosis for gait rehabilitation. IEEE Trans Cybern 43:1025–1034CrossRef

13.

Ikeura R, Moriguchi T, Mizutani K (2002) Optimal variable impedance control for a robot and its application to lifting an object with a human. In: Proceedings of the 11th IEEE international workshop robot-human interactive communication, Berlin, Germany, pp 500–505

14.

Jung S, Hsia TC (1998) Neural network impedance force control of robot manipulator. IEEE Trans Ind Electron 45:451–461CrossRef

15.

Kosuge K, Furuta K, Yokoyama T (1987) Virtual internal model following control of robot arms. In: Proceedings of the IEEE international conference on robotics and automation, pp 1549–1554

16.

Kurihara K, Suzuki S, Harashima F, Furuta K (2004) Human adaptive mechatronics (HAM) for haptic system. In: Proceedings of the 30th IEEE Annual Conference of the Industrial Electronics, Busan, Korea, pp 647–652

17.

Lewis FL, Dawson DM, Abdallah CT (2003) Robot manipulator control: theory and practice, 2nd edn. CRC Press, Florida

18.

Lewis FL, Vraibe D, Syrmos V (2012) Optimal control, 3rd edn. Wiley, New Jersey

19.

Lewis FL, Vraibe D, Vamvoudakis KG (2014) Reinforcement learning and feedback control: using natural decision methods to design optimal adaptive controllers. IEEE Control Syst Mag 32:76–105MathSciNetCrossRef

20.

Lewis FL, Yesildirek A (1995) Neural net robot controller with guaranteed tracking performance. IEEE Trans Neural Netw 6:703–715CrossRef

21.

Lewis FL, Yesildirek A, Liu K (1996) Multilayer neural net robot controller with guaranteed tracking performance. IEEE Trans Neural Netw 7:388–399CrossRef

22.

Li Y, Ge SS (2014) Human? robot collaboration based on motion intention estimation. IEEE/ASME Trans Mechatron 19:1007–1014CrossRef

23.

Li Y, Ge SS, Yang C (2011) Impedance control for multi-point human-robot interaction. In: Proceedings of the 8th Asian Control Conference, Kaohsiung, Taiwan, pp 1187–1192

24.

Mitsantisuk C, Ohishi K, Katsura S (2011) Variable mechanical stiffness control based on human stiffness estimation. In: Proceedings of the IEEE international conference on mechatronics. Istanbul, Turkey, pp 731–736

25.

Modares H et al (2015) Optimized assistive human? robot interaction using reinforcement learning. IEEE Trans Cybern. doi:10.1109/TCYB.2015.2412554

26.

Oh S, Woo H, Kong K (2014) Frequency-shaped impedance control for safe human-robot interaction in reference tracking application. IEEE/ASME Trans Mechatron 19:1907–1916CrossRef

27.

Powell WB (2007) Approximate dynamic programming: solving the curses of dimensionality. Wiley-Interscience, New YorkCrossRefMATH

28.

Stulp F et al (2012) Model-free reinforcement learning of impedance control in stochastic environments. IEEE Trans Auton Mental Develop 4:330–341CrossRef

29.

Sutton RS, Barto AG (1998) Reinforcement learning: an introduction. Cambridge University Press, Cambridge

30.

Suzuki S, Furuta K (2012) Adaptive impedance control to enhance human skill on a haptic interface system. J Control Sci Eng 2012:1–10MathSciNetCrossRefMATH

31.

Tsuji T, Tanaka Y (2005) Tracking control properties of human-robotic systems based on impedance control. IEEE Trans Syst Man Cybern Part A 35:523–535CrossRef

32.

Tsumugiwa T, Yokogawa R, Hara K (2002) Variable impedance control based on estimation of human arm stiffness for human-robot cooperative calligraphic task. In: Proceedings of the IEEE international conference on robotics and automation, pp 644–650

33.

Tustin A (1947) The nature of the operator’s response in manual control and its implications for controller design. J Inst Electric Eng 94:190–202

34.

Vrabie D et al (2009) Adaptive optimal control for continuous-time linear systems based on policy iteration. Automatica 45:447–484MathSciNetCrossRef

35.

Vrabie D, Lewis FL (2009) Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems. Neural Netw 22:237–246

36.

Vraibe D, Vamvoudakis KG, Lewis FL (2012) Optimal adaptive control and differential games by reinforcement learning principles, control engineering series. IET Press

37.

Wang C et al (2013) Continuous critic learning for robot control in physical human-robot interaction. In: Proceedings of the 13th international conference on control, automation and system. Gwangju, Korea, pp 833–838

38.

Xu G, Song A (2009) Adaptive impedance control based on dynamic recurrent fuzzy neural network for upper-limb rehabilitation robot. In: IEEE international conference on control, automation. Christchurch, New Zealand, pp 1376–1381

Titel: Intelligent Human–Robot Interaction Systems Using Reinforcement Learning and Neural Networks
verfasst von: Hamidreza Modares
Isura Ranatunga
Bakur AlQaudi
Frank L. Lewis
Dan O. Popa
Verlag: Springer International Publishing
Buch: Trends in Control and Decision-Making for Human–Robot Collaboration Systems
Print ISBN: 978-3-319-40532-2

Electronic ISBN: 978-3-319-40533-9

Copyright-Jahr: 2017
DOI: https://doi.org/10.1007/978-3-319-40533-9_8

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Jonas Klose/© Pine Valley Capital GmbH, Carina Kießling von der Strategieberatung Roland Berger/© Monika Walther Fotografie | ATZ, Beijing Auto Show 2024: Deutsche Hersteller wollen angreifen./© EKH-Pictures / Generated with AI / Stock.adobe.com, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.