Skip to main content
Erschienen in: Neural Computing and Applications 1/2014

01.01.2014 | ICONIP 2012

Robots learn to dance through interaction with humans

verfasst von: Qinggang Meng, Ibrahim Tholley, Paul W. H. Chung

Erschienen in: Neural Computing and Applications | Ausgabe 1/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we investigated an approach for robots to learn to adapt dance actions to human’s preferences through interaction and feedback. Human’s preferences were extracted by analysing the common action patterns with positive or negative feedback from the human during robot dancing. By using a buffering technique to store the dance actions before a feedback, each individual’s preferences can be extracted even when a reward is received late. The extracted preferred dance actions from different people were then combined to generate improved dance sequences, i.e. performing more of what was preferred and less of that was not preferred. Together with Softmax action-selection method, the Sarsa reinforcement learning algorithm was used as the underlining learning algorithm and to effectively control the trade-off between exploitation of the learnt dance skills and exploration of new dance actions. The results showed that the robot learnt, using interactive reinforcement learning, the preferences of human partners, and the dance improved with the extracted preferences from more human partners.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Aucouturier JJ (2008) Cheek to chip: dancing robots and AI’s future. IEEE Intell Syst 23(2):74–84CrossRef Aucouturier JJ (2008) Cheek to chip: dancing robots and AI’s future. IEEE Intell Syst 23(2):74–84CrossRef
2.
Zurück zum Zitat Austermann A, Yamada S (2008) “Good robot”, “bad robot”-analyzing users’ feedback in a human-robot teaching task. In: The 17th IEEE international symposium on robot and human interactive communication, pp 41–46 Austermann A, Yamada S (2008) “Good robot”, “bad robot”-analyzing users’ feedback in a human-robot teaching task. In: The 17th IEEE international symposium on robot and human interactive communication, pp 41–46
4.
Zurück zum Zitat Dozier G (2001) Evolving robot behavior via interactive evolutionary computation: from real-world to simulation. In: Proceedings of the 2001 ACM symposium on applied computing, pp 340–344 Dozier G (2001) Evolving robot behavior via interactive evolutionary computation: from real-world to simulation. In: Proceedings of the 2001 ACM symposium on applied computing, pp 340–344
5.
Zurück zum Zitat Jens H, Peer A, Buss M (2010) Synthesis of an interactive haptic dancing partner. In: The 19th IEEE international symposium on robot and human interactive communication, Viareggio, Italy, 12–15 Sept 2010, pp 527–532 Jens H, Peer A, Buss M (2010) Synthesis of an interactive haptic dancing partner. In: The 19th IEEE international symposium on robot and human interactive communication, Viareggio, Italy, 12–15 Sept 2010, pp 527–532
6.
Zurück zum Zitat Kober J, Bagnell JAD, Peters J (2013) Reinforcement learning in robotics: a survey. Int J Rob Res (in press) Kober J, Bagnell JAD, Peters J (2013) Reinforcement learning in robotics: a survey. Int J Rob Res (in press)
7.
Zurück zum Zitat Leopold T, Kern-Isberner G, Peters G (2008) Belief revision with reinforcement learning for interactive object recognition. In: Proceedings of 18th European conference on artificial intelligence, pp 65–69 Leopold T, Kern-Isberner G, Peters G (2008) Belief revision with reinforcement learning for interactive object recognition. In: Proceedings of 18th European conference on artificial intelligence, pp 65–69
8.
Zurück zum Zitat Liu F, Su J (2004) Visual learning framework based on reinforcement learning. In: Fifth World Congress on Intelligent Control and Automation, 6:4865–4868 Liu F, Su J (2004) Visual learning framework based on reinforcement learning. In: Fifth World Congress on Intelligent Control and Automation, 6:4865–4868
9.
Zurück zum Zitat Peralta R, Kaochar T, Fasel I, Morrison C, Walsh T, Cohen P (2011) Challenges to decoding the intention behind natural instruction. In: Proceedings of RO-MAN, 2011, pp 113–118 Peralta R, Kaochar T, Fasel I, Morrison C, Walsh T, Cohen P (2011) Challenges to decoding the intention behind natural instruction. In: Proceedings of RO-MAN, 2011, pp 113–118
10.
Zurück zum Zitat Santiago C, Oliveira J, Reis L, Sousa A (2011) Autonomous robot dancing synchronized to musical rhythmic stimuli. In: 2011 6th Iberian conference on information systems and technologies (CISTI), pp 1–6 Santiago C, Oliveira J, Reis L, Sousa A (2011) Autonomous robot dancing synchronized to musical rhythmic stimuli. In: 2011 6th Iberian conference on information systems and technologies (CISTI), pp 1–6
11.
Zurück zum Zitat Shiratori T, Ikeuchi K (2008) Synthesis of dance performance based on analyses of human motion and music. IPSJ Trans Comput Vis Image Media 1(1):34–47 Shiratori T, Ikeuchi K (2008) Synthesis of dance performance based on analyses of human motion and music. IPSJ Trans Comput Vis Image Media 1(1):34–47
12.
Zurück zum Zitat Shiratori T, Nakazawa A, Ikeuchi K (2006) Dancing-to-music character animation. Comput Graph Forum Proc Eurograph 2006 25(3):449–458CrossRef Shiratori T, Nakazawa A, Ikeuchi K (2006) Dancing-to-music character animation. Comput Graph Forum Proc Eurograph 2006 25(3):449–458CrossRef
13.
Zurück zum Zitat Solis J, Chida K, Suefuji K, Takanishi A (2005) Improvements of the sound perception processing of the anthropomorphic flutist robot (WF-4R) to effectively interact with humans. In: Proceedings of IEEE international workshop on robots and human interactive communication, Nashville, USA, pp 450–455 Solis J, Chida K, Suefuji K, Takanishi A (2005) Improvements of the sound perception processing of the anthropomorphic flutist robot (WF-4R) to effectively interact with humans. In: Proceedings of IEEE international workshop on robots and human interactive communication, Nashville, USA, pp 450–455
14.
Zurück zum Zitat Suay HB, Chernova S (2011) Effect of human guidance and state space size on interactive reinforcement learning. In: 20th IEEE international symposium on robot and human interactive communication, 31 July 2011–3 Aug 2011, pp 1–6 Suay HB, Chernova S (2011) Effect of human guidance and state space size on interactive reinforcement learning. In: 20th IEEE international symposium on robot and human interactive communication, 31 July 2011–3 Aug 2011, pp 1–6
15.
Zurück zum Zitat Suga Y, Ikuma Y, Nagao D, Ogata T, Sugano S (2005) Interactive evolution of human-robot communication in real world. In: Proceedings of IEEE/RSJ international conference on intelligent robots and systems, pp 1438–1443 Suga Y, Ikuma Y, Nagao D, Ogata T, Sugano S (2005) Interactive evolution of human-robot communication in real world. In: Proceedings of IEEE/RSJ international conference on intelligent robots and systems, pp 1438–1443
16.
Zurück zum Zitat Sutton R, Barto A (1998) Reinforcement learning: an introduction. MIT press, Bradford Books, Cambridge Sutton R, Barto A (1998) Reinforcement learning: an introduction. MIT press, Bradford Books, Cambridge
17.
Zurück zum Zitat Tanaka F, Movellan JR, Fortenberry B, Aisaka K (2006) Daily HRI evaluation at a classroom environment: reports from dance interaction experiments. In: Proceedings of the 1st ACM SIGCHI/SIGART conference on human-robot interaction, pp 3–9 Tanaka F, Movellan JR, Fortenberry B, Aisaka K (2006) Daily HRI evaluation at a classroom environment: reports from dance interaction experiments. In: Proceedings of the 1st ACM SIGCHI/SIGART conference on human-robot interaction, pp 3–9
18.
Zurück zum Zitat Tholley I (2012) Towards a framework to make robots learn to dance. PhD thesis, Loughborough University, UK Tholley I (2012) Towards a framework to make robots learn to dance. PhD thesis, Loughborough University, UK
19.
Zurück zum Zitat Thomaz A, Breazeal C (2007) Asymmetric interpretations of positive and negative human feedback for a social learning agent. In: The 16th IEEE international symposium on robot and human interactive communication, pp 720–725 Thomaz A, Breazeal C (2007) Asymmetric interpretations of positive and negative human feedback for a social learning agent. In: The 16th IEEE international symposium on robot and human interactive communication, pp 720–725
20.
Zurück zum Zitat Thomaz AL, Hoffman G, Breazeal C (2005) Real-time interactive reinforcement learning for robots. In: AAAI 2005 workshop on human comprehensible machine learning Thomaz AL, Hoffman G, Breazeal C (2005) Real-time interactive reinforcement learning for robots. In: AAAI 2005 workshop on human comprehensible machine learning
21.
Zurück zum Zitat Vircikova M, Sincak P (2010) Dance choreography design of humanoid robots using interactive evolutionary computation. In: 3rd Workshop for Young Researchers: Human Friendly Robotics for Young Researchers Vircikova M, Sincak P (2010) Dance choreography design of humanoid robots using interactive evolutionary computation. In: 3rd Workshop for Young Researchers: Human Friendly Robotics for Young Researchers
22.
Zurück zum Zitat Wang H, Kosuge K (2012) Attractor design and prediction-based adaption for a robot waltz dancer in physical human-robot interaction. In: Proceedings of the 2012 World Congress on intelligent control and automation, pp 3810–3815 Wang H, Kosuge K (2012) Attractor design and prediction-based adaption for a robot waltz dancer in physical human-robot interaction. In: Proceedings of the 2012 World Congress on intelligent control and automation, pp 3810–3815
23.
Zurück zum Zitat Wang H, Kosuge K (2012) Understanding and reproducing waltz dancers’ body dynamics in physical human-robot interaction. In: Proceedings of 2012 IEEE international conference robotics and automation (ICRA), pp 3134–3140 Wang H, Kosuge K (2012) Understanding and reproducing waltz dancers’ body dynamics in physical human-robot interaction. In: Proceedings of 2012 IEEE international conference robotics and automation (ICRA), pp 3134–3140
Metadaten
Titel
Robots learn to dance through interaction with humans
verfasst von
Qinggang Meng
Ibrahim Tholley
Paul W. H. Chung
Publikationsdatum
01.01.2014
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 1/2014
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-013-1504-x

Weitere Artikel der Ausgabe 1/2014

Neural Computing and Applications 1/2014 Zur Ausgabe

Premium Partner