Skip to main content
Top

2018 | OriginalPaper | Chapter

Applying Adaptive Actor-Critic Learning to Human Upper Lime Lifting Motion

Authors : Ting Wang, Ryad Chellali

Published in: Artificial Intelligence and Robotics

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

An adaptive reinforcement learning method designed to facilitate the on-line lifting motion of the human forearm is here proposed. Its purpose is to use the control based on the proposed learning method to perform the lifting motion. The learning algorithm is an actor-critic learning based on the neural network that used the normalized radial basis function. The paper shows a simulation of the motion of the forearm lifting process. As shown in the results, the forearm continues to lift from a horizontal position to a vertical position. During this process, both the state space and action space are continuous.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Schaal, S., Atkeson, C.: Learning control in robotics. IEEE Robot. Autom. Mag. 17(2), 20–29 (2010)CrossRef Schaal, S., Atkeson, C.: Learning control in robotics. IEEE Robot. Autom. Mag. 17(2), 20–29 (2010)CrossRef
2.
go back to reference Schaal, S.: Is imitation learning the route to humanoid robots? Trends Cogn. Sci. 3(6), 233–242 (1999)CrossRef Schaal, S.: Is imitation learning the route to humanoid robots? Trends Cogn. Sci. 3(6), 233–242 (1999)CrossRef
3.
go back to reference Argall, B.D., Chernova, S., Veloso, M.: A survey of robot learning from demonstration. Robot. Auton. Syst. 57(5), 469–483 (2009)CrossRef Argall, B.D., Chernova, S., Veloso, M.: A survey of robot learning from demonstration. Robot. Auton. Syst. 57(5), 469–483 (2009)CrossRef
4.
go back to reference Billard, B.D., Calinon, S.: Robotic programming by demonstration. Springer Handbook of Robotics, pp. 1371–1394. Springer, Berlin (2008) Billard, B.D., Calinon, S.: Robotic programming by demonstration. Springer Handbook of Robotics, pp. 1371–1394. Springer, Berlin (2008)
5.
go back to reference Mulling, K., Kober, J., Kroemer, O.: Learning to select and generalize striking movements in robot table tennis. Int. J. Robot. Res. 32(3), 263–279 (2013)CrossRef Mulling, K., Kober, J., Kroemer, O.: Learning to select and generalize striking movements in robot table tennis. Int. J. Robot. Res. 32(3), 263–279 (2013)CrossRef
6.
go back to reference Pastor, P., Kalakrishnan, M., Chitta, S.: Skill learning and task outcome prediction for manipulation. In: IEEE International Conference on Robotics and Automation, pp. 3828–3824. IEEE, Shanghai (2011) Pastor, P., Kalakrishnan, M., Chitta, S.: Skill learning and task outcome prediction for manipulation. In: IEEE International Conference on Robotics and Automation, pp. 3828–3824. IEEE, Shanghai (2011)
7.
go back to reference Kormushev, P., Calinon, S., Caldwell, D.G.: Robot motor skill coordination with EM-based reinforcement learning. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 3232–3237. IEEE, Taipei (2010) Kormushev, P., Calinon, S., Caldwell, D.G.: Robot motor skill coordination with EM-based reinforcement learning. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 3232–3237. IEEE, Taipei (2010)
8.
go back to reference Kober, J., Peters, J.: Imitation and reinforcement learning. Robot. Autom. Mag. 17(2), 55–62 (2010)CrossRefMATH Kober, J., Peters, J.: Imitation and reinforcement learning. Robot. Autom. Mag. 17(2), 55–62 (2010)CrossRefMATH
9.
go back to reference Kormushev, P., Calinon, S., Caldwell, D.G.: Imitation learning of positional and force skills demonstrated via kinesthetic teaching and haptic input. Adv. Robot. 25(5), 581–603 (2011)CrossRef Kormushev, P., Calinon, S., Caldwell, D.G.: Imitation learning of positional and force skills demonstrated via kinesthetic teaching and haptic input. Adv. Robot. 25(5), 581–603 (2011)CrossRef
10.
go back to reference Abbeel, P., Coates A.: Autonomous helicopter aerobatics through apprentices ship learning. Int. J. Robot. Res. 1–31 (2010) Abbeel, P., Coates A.: Autonomous helicopter aerobatics through apprentices ship learning. Int. J. Robot. Res. 1–31 (2010)
11.
go back to reference Lee, K., Su, Y., Kim, T.K.: A syntactic approach to robot imitation learning using probabilistic activity grammars. Robot. Autom. Syst. 61(12), 1323–1334 (2013)CrossRef Lee, K., Su, Y., Kim, T.K.: A syntactic approach to robot imitation learning using probabilistic activity grammars. Robot. Autom. Syst. 61(12), 1323–1334 (2013)CrossRef
12.
go back to reference Blana, D., Kirsch, R.F., Chadwick, E.K.: Combined feed forward and feedback control of a redundant, nonlinear, dynamic musculoskeletal system. Med. Biol. Eng. Comput. 47, 533–542 (2009)CrossRef Blana, D., Kirsch, R.F., Chadwick, E.K.: Combined feed forward and feedback control of a redundant, nonlinear, dynamic musculoskeletal system. Med. Biol. Eng. Comput. 47, 533–542 (2009)CrossRef
13.
go back to reference Jagodnik, K., van den Bogert, A.: A proportional derivative FES controller for planar arm movement. In: 12th Annual Conference International FES Society, Philadelphia, PA (2007) Jagodnik, K., van den Bogert, A.: A proportional derivative FES controller for planar arm movement. In: 12th Annual Conference International FES Society, Philadelphia, PA (2007)
14.
go back to reference Thomas, P., Branicky, M., van den Bogert, A., Jagodnik, K.: Application of the actor-critic architecture to functional electrical stimulation control of a human arm. In: Proceeding Innovations Applied Artificial Intelligence Conference, pp. 165–172 (2009) Thomas, P., Branicky, M., van den Bogert, A., Jagodnik, K.: Application of the actor-critic architecture to functional electrical stimulation control of a human arm. In: Proceeding Innovations Applied Artificial Intelligence Conference, pp. 165–172 (2009)
15.
go back to reference Philip, S.T., Andrew, G.B.: Motor primitive discovery. In: Proceedings of the IEEE Conference on Development and Learning and Epigenetic Robotics (2012) Philip, S.T., Andrew, G.B.: Motor primitive discovery. In: Proceedings of the IEEE Conference on Development and Learning and Epigenetic Robotics (2012)
16.
go back to reference Lu, H., Li, Y., Uemura, T., Ge, Z., Xu, X.: FDCNet: Filtering deep convolution network for marine organism classification. In: Multimedia Tools and Applications, pp. 1–14 (2017) Lu, H., Li, Y., Uemura, T., Ge, Z., Xu, X.: FDCNet: Filtering deep convolution network for marine organism classification. In: Multimedia Tools and Applications, pp. 1–14 (2017)
17.
go back to reference He, L., Xing, X., Humin, L., Yang, Y., Fumin, S., Yuxuan, Z., Xing, X., Tao, H.: Unsupervised cross-model retrieval through adversarial learning. In: Proceeding of 2017 IEEE International Conference on Multimedia and Expo (2017) He, L., Xing, X., Humin, L., Yang, Y., Fumin, S., Yuxuan, Z., Xing, X., Tao, H.: Unsupervised cross-model retrieval through adversarial learning. In: Proceeding of 2017 IEEE International Conference on Multimedia and Expo (2017)
18.
go back to reference Xu, X., He, L., Shimada, A., Taniguchi, R.I., Lu, H.: Learning unified binary codes for cross-modal retrieval via latent semantic hashing. In: Neurocomputing, 213, 191–203 (2016) Xu, X., He, L., Shimada, A., Taniguchi, R.I., Lu, H.: Learning unified binary codes for cross-modal retrieval via latent semantic hashing. In: Neurocomputing, 213, 191–203 (2016)
19.
go back to reference Humin, L., Yujie, L., Min, Chen., Hyoungseop, Kim., Serikawa, S.: Brain intelligence: go beyond artificial intelligence. In: Mobile Networks Application, pp. 1–10 (2017) Humin, L., Yujie, L., Min, Chen., Hyoungseop, Kim., Serikawa, S.: Brain intelligence: go beyond artificial intelligence. In: Mobile Networks Application, pp. 1–10 (2017)
20.
go back to reference Serikawa, S., Lu, H.: Underwater image dehazing using joint trilateral filter. Comput. Electr. Eng. 40(1), 41–50 (2014)CrossRef Serikawa, S., Lu, H.: Underwater image dehazing using joint trilateral filter. Comput. Electr. Eng. 40(1), 41–50 (2014)CrossRef
21.
go back to reference McLean, S.G., Su, A., van den Bogert, A.J.: Development and validation of a 3-D model to predict knee joint loading during dynamic movement. J. Biomech. Eng. 125(6), 864–874 (2003)CrossRef McLean, S.G., Su, A., van den Bogert, A.J.: Development and validation of a 3-D model to predict knee joint loading during dynamic movement. J. Biomech. Eng. 125(6), 864–874 (2003)CrossRef
Metadata
Title
Applying Adaptive Actor-Critic Learning to Human Upper Lime Lifting Motion
Authors
Ting Wang
Ryad Chellali
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-69877-9_6

Premium Partner