skip to main content
research-article

Machine Learning for Social Multiparty Human--Robot Interaction

Published:14 October 2014Publication History
Skip Abstract Section

Abstract

We describe a variety of machine-learning techniques that are being applied to social multiuser human--robot interaction using a robot bartender in our scenario. We first present a data-driven approach to social state recognition based on supervised learning. We then describe an approach to social skills execution—that is, action selection for generating socially appropriate robot behavior—which is based on reinforcement learning, using a data-driven simulation of multiple users to train execution policies for social skills. Next, we describe how these components for social state recognition and skills execution have been integrated into an end-to-end robot bartender system, and we discuss the results of a user evaluation. Finally, we present an alternative unsupervised learning framework that combines social state recognition and social skills execution based on hierarchical Dirichlet processes and an infinite POMDP interaction manager. The models make use of data from both human--human interactions collected in a number of German bars and human--robot interactions recorded in the evaluation of an initial version of the system.

References

  1. David W. Aha, Dennis Kibler, and Marc K. Albert. 1991. Instance-based learning algorithms. Machine Learning 6 (1991), 37--66. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Haris Baltzakis, Maria Pateraki, and Panos Trahanias. 2012. Visual tracking of hands, faces and facial features of multiple persons. Machine Vision and Applications 23, 6 (2012), 1141--1157. DOI: http://dx.doi.org/10.1007/s00138-012-0409-5Google ScholarGoogle ScholarCross RefCross Ref
  3. Matthew J. Beal, Zoubin Ghahramani, and Carl Edward Rasmussen. 2002. The infinite hidden Markov model. In Advances in Neural Information Processing Systems 14.Google ScholarGoogle Scholar
  4. Dan Bohus and Eric Horvitz. 2009a. Dialog in the open world: Platform and applications. In Proceedings of the 11th International Conference on Multimodal Interfaces and the 6th Workshop on Machine Learning for Multimodal Interaction (ICMI-MLMI’09). 31--38. DOI: http://dx.doi.org/10.1145/1647314.1647323 Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Dan Bohus and Eric Horvitz. 2009b. Learning to predict engagement with a spoken dialog system in open-world settings. In Proceedings of the 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL’09). 244--252. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Frank Broz. 2008. Planning for Human-Robot Interaction: Representing Time and Human Intention. Ph.D. Dissertation. Carnegie Mellon University. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Harry Bunt, Jan Alexandersson, Jean Carletta, Jae-Woong Choe, Alex Chengyu Fang, Koiti Hasida, Kiyong Lee, Volha Petukhova, Andrei Popescu-Belis, Laurent Romary, Claudia Soria, and David Traum. 2010. Towards an ISO standard for dialogue act annotation. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC’10).Google ScholarGoogle Scholar
  8. Ginevra Castellano, Iolanda Leite, André Pereira, Carlos Martinho, Ana Paiva, and Peter W. McOwan. 2012. Detecting engagement in HRI: An exploration of social and task-based context. In Proceedings of SocialCom’12. 421--428. DOI: http://dx.doi.org/10.1109/SocialCom-PASSAT.2012.51 Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Chih-Chung Chang and Chih-Jen Lin. 2011. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2, 3, Article 27 (May 2011), 27 pages. DOI: http://dx.doi.org/10.1145/1961189.1961199 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Lonnie Chrisman. 1992. Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. In Proceedings of the 10th National Conference on Artificial Intelligence. 183--188. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. William W. Cohen. 1995. Fast effective rule induction. In 12th International Conference on Machine Learning. Morgan Kaufmann, 115--123.Google ScholarGoogle ScholarCross RefCross Ref
  12. Heriberto Cuayáhuitl and Nina Dethlefs. 2011. Spatially-aware dialogue control using hierarchical reinforcement learning. ACM Transactions on Speech and Language Processing 7, 3 (May 2011). Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Heriberto Cuayáhuitl, Steve Renals, Oliver Lemon, and Hiroshi Shimodaira. 2010. Evaluation of a hierarchical reinforcement learning spoken dialogue system. Computer Speech and Language 24 (2010), 395--429. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Finale Doshi-Velez. 2009. The infinite partially observable Markov decision process. In Advances in Neural Information Processing Systems 22.Google ScholarGoogle Scholar
  15. Mary Ellen Foster. 2002. State of the Art Review: Multimodal Fission. Deliverable 6.1. COMIC project.Google ScholarGoogle Scholar
  16. Mary Ellen Foster, Andre Gaschler, and Manuel Giuliani. 2013. How can I help you? Comparing engagement classification strategies for a robot bartender. In Proceedings of the 15th ACM International Conference on Multimodal Interaction (ICMI’13). Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Mary Ellen Foster, Andre Gaschler, Manuel Giuliani, Amy Isard, Maria Pateraki, and Ronald P. A. Petrick. 2012. Two people walk into a bar: Dynamic multi-party social interaction with a robot agent. In Proceedings of the 14th ACM International Conference on Multimodal Interaction (ICMI’12). Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Emily B. Fox, Erik B. Sudderth, Michael I. Jordan, and Alan S. Willsky. 2011. A sticky HDP-HMM with application to speaker diarization. Annals of Applied Statistics 5, 2A (2011), 1020--1056.Google ScholarGoogle ScholarCross RefCross Ref
  19. Eibe Frank, Yong Wang, Stuart Inglis, Geoffrey Holmes, and Ian H. Witten. 1998. Using model trees for classification. Machine Learning 32, 1 (1998), 63--76. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Mark Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, and Ian H. Witten. 2009. The WEKA data mining software: An update. Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD) Explorations Newsletter 11, 1 (Nov. 2009), 10--18. DOI: http://dx.doi.org/10.1145/1656274.1656278 Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Mark A. Hall. 2000. Correlation-based feature selection for discrete and numeric class machine learning. In Proceedings of the 17th International Conference on Machine Learning (ICML’00). 359--366. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Mark A. Hall and Geoffrey Holmes. 2003. Benchmarking attribute selection techniques for discrete class data mining. IEEE Transactions on Knowledge and Data Engineering 15, 6 (2003), 1437--1447. DOI: http://dx.doi.org/10.1109/TKDE.2003.1245283 Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Michael S. Hamada, Alyson Wilson, C. Shane Reese, and Harry Martz. 2008. Bayesian Reliability. Springer.Google ScholarGoogle Scholar
  24. W. Keith Hastings. 1970. Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57, 1 (1970), 97--109.Google ScholarGoogle ScholarCross RefCross Ref
  25. Chih-Wei Hsu, Chih-Chung Chang, and Chih-Jen Lin. 2010. A Practical Guide to Support Vector Classification. Technical Report. Department of Computer Science, National Taiwan University. Available at http://www.csie.ntu.edu.tw/∼cjlin/papers/guide/guide.pdf.Google ScholarGoogle Scholar
  26. Hemant Ishwaran and Lancelot F. James. 2001. Gibbs sampling methods for stick-breaking priors. Journal of the American Statistical Association 96 (2001), 161--173.Google ScholarGoogle ScholarCross RefCross Ref
  27. Hemant Ishwaran and Mahmoud Zarepour. 2002. Exact and approximate sum representations for the Dirichlet process. Canadian Journal of Statistics 3, 2 (2002), 269--283.Google ScholarGoogle ScholarCross RefCross Ref
  28. George H. John and Pat Langley. 1995. Estimating continuous distributions in Bayesian classifiers. In Proceedings of the11th Conference on Uncertainty in Artificial Intelligence. 338--345. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Matthew J. Johnson and Alan Willsky. 2010. The hierarchical Dirichlet process hidden semi-Markov model. In Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence (UAI’10).Google ScholarGoogle Scholar
  30. Simon Keizer, Mary Ellen Foster, Oliver Lemon, Andre Gaschler, and Manuel Giuliani. 2013a. Training and evaluation of an MDP model for social multi-user human-robot interaction. In Proceedings of the 14th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL’13).Google ScholarGoogle Scholar
  31. Simon Keizer, Pantelis Kastoris, Mary Ellen Foster, Amol Deshmukh, and Oliver Lemon. 2013b. User evaluation of a multi-user social interaction model implemented on a Nao robot. In Proceedings of the International Conference on Social Robotics (ICSR’13) Workshop on Robots in Public Spaces.Google ScholarGoogle Scholar
  32. David Klotz, Johannes Wienke, Julia Peltason, Britta Wrede, Sebastian Wrede, Vasil Khalidov, and Jean-Marc Odobez. 2011. Engagement-based multi-party dialog with a humanoid robot. In Proceedings of the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL’11). Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Ron Kohavi and George H John. 1997. Wrappers for feature subset selection. Artificial Intelligence 97, 1 (1997), 273--324. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Saskia le Cessie and Johannes C. van Houwelingen. 1992. Ridge estimators in logistic regression. Applied Statistics 41, 1 (1992), 191--201.Google ScholarGoogle ScholarCross RefCross Ref
  35. Liyuan Li, Qianli Xu, and Yeow Kee Tan. 2012. Attention-based addressee selection for service and social robots to interact with multiple persons. In Proceedings of the 5th ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia: Proceedings of the Workshop at SIGGRAPH Asia (WASA’12). 131--136. DOI: http://dx.doi.org/10.1145/2425296.2425319 Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Pierre Lison. 2011. Multi-policy dialogue management. In Proceedings of the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2011). Portland, OR. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Sebastian Loth, Kerstin Huth, and Jan P. De Ruiter. 2013. Automatic detection of service initiation signals used in bars. Frontiers in Psychology 4, 557 (2013). DOI: http://dx.doi.org/10.3389/fpsyg.2013.00557Google ScholarGoogle Scholar
  38. Zachary M. MacHardy, Kenneth Syharath, and Prasun Dewan. 2012. Engagement analysis through computer vision. In Proceedings of CollaborateCom 2012. 535--539.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Sridhar Mahadevan. 1998. Partially observable semi-Markov decision processes: Theory and applications in engineering and cognitive science. In Proceedings of the Association for the Advancement of Artificial Intelligence (AAAI) 1998 Fall Symposium: Planning with Partially Observable Markov Decision Processes.Google ScholarGoogle Scholar
  40. Tomonari Masada, Daiji Fukagawa, Atsuhiro Takasu, Yuichiro Shibata, and Kiyoshi Oguri. 2010. Modeling topical trends over continuous time with priors. In Proceedings of the 7th International Conference on Advances in Neural Networks (ISNN 2010) -- Volume Part II. 302--311. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Derek McColl and Goldie Nejat. 2012. Affect detection from body language during social HRI. In Proceedings of 21st IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN 2012). 1013--1018. DOI: http://dx.doi.org/10.1109/ROMAN.2012.6343882Google ScholarGoogle ScholarCross RefCross Ref
  42. Nicholas Metropolis, Arianna W. Rosenbluth, Marshall N. Rosenbluth, Augusta H. Teller, and Edward Teller. 1953. Equation of state calculations by fast computing machines. Journal of Chemical Physics 21 (1953), 1087--1092.Google ScholarGoogle ScholarCross RefCross Ref
  43. Eric W. Noreen. 1989. Computer-Intensive Methods for Testing Hypotheses: An Introduction. Wiley-Interscience.Google ScholarGoogle Scholar
  44. Maria Pateraki, Markos Sigalas, Georgios Chliveros, and Panos Trahanias. 2013. Visual human-robot communication in social settings. In Proceedings of ICRA Workshop on Semantics, Identification and Control of Robot-Human-Environment Interaction.Google ScholarGoogle Scholar
  45. Ronald P. A. Petrick and Mary Ellen Foster. 2013. Planning for social interaction in a robot bartender domain. In Proceedings of the International Conference on Automated Planning and Scheduling (ICAPS 2013), Special Track on Novel Applications. Rome, Italy.Google ScholarGoogle Scholar
  46. Ronald P. A. Petrick, Mary Ellen Foster, and Amy Isard. 2012. Social state recognition and knowledge-level planning for human-robot interaction in a bartender domain. In Proceedings of the Association for the Advancement of Artificial Intelligence (AAAI) 2012 Workshop on Grounding Language for Physical Systems.Google ScholarGoogle Scholar
  47. Joelle Pineau, Nicholas Roy, and Sebastian Thrun. 2001. A hierarchical approach to POMDP planning and execution. In Proceedings of the ICML Workshop on Hierarchy and Memory in Reinforcement Learning.Google ScholarGoogle Scholar
  48. Ross Quinlan. 1993. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Verena Rieser and Oliver Lemon. 2011. Learning and evaluation of dialogue strategies for new applications: Empirical methods for optimization from small data sets. Computational Linguistics 37, 1 (2011), 153--196. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Ethan O. Selfridge, Iker Arizmendi, Peter A. Heeman, and Jason D. Williams. 2012. Integrating incremental speech recognition and POMDP-based dialogue systems. In Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDial’12). 275--279. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Jayaram Sethuraman. 1994. A constructive definition of Dirichlet priors. Statistica Sinica 4 (1994), 639--650.Google ScholarGoogle Scholar
  52. Matthijs T. J. Spaan and Nikos Vlassis. 2005. Perseus: Randomized point-based value iteration for POMDPs. Journal of Artificial Intelligence Research 24 (2005), 195--220. Google ScholarGoogle ScholarCross RefCross Ref
  53. Richard S. Sutton and Andrew G. Barto. 1998. Reinforcement Learning: An Introduction. MIT Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Yee Whye Teh, Michael I. Jordan, Matthew J. Beal, and David M. Blei. 2006. Hierarchical Dirichlet processes. Journal of the American Statistical Association 101, 476 (2006), 1566--1581.Google ScholarGoogle ScholarCross RefCross Ref
  55. Blaise Thomson and Steve Young. 2010. Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems. Computer Speech and Language 24, 4 (2010), 562--588. Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Zhuoran Wang and Oliver Lemon. 2012. A nonparametric Bayesian approach to learning multimodal interaction management. In Proceedings of the 4th IEEE Workshop on Spoken Language Technology (SLT’12). DOI: http://dx.doi.org/10.1109/SLT.2012.6424162Google ScholarGoogle ScholarCross RefCross Ref
  57. Michael White. 2006. Efficient realization of coordinate structures in combinatory categorial grammar. Research on Language and Computation 4, 1 (2006), 39--75. DOI: http://dx.doi.org/10.1007/s11168-006-9010-2Google ScholarGoogle ScholarCross RefCross Ref
  58. Jason D. Williams and Steve Young. 2007. Partially observable Markov decision processes for spoken dialog systems. Computer Speech and Language 21, 2 (2007), 393--422. Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Peter Wittenburg, Hennie Brugman, Albert Russel, Alex Klassmann, and Han Sloetjes. 2006. ELAN: A professional framework for multimodality research. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC’06).Google ScholarGoogle Scholar
  60. Steve Young, Milica Gašić, Simon Keizer, François Mairesse, Blaise Thomson, and Kai Yu. 2010. The hidden information state model: A practical framework for POMDP based spoken dialogue management. Computer Speech and Language 24, 2 (2010), 150--174. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Machine Learning for Social Multiparty Human--Robot Interaction

                Recommendations

                Comments

                Login options

                Check if you have access through your login credentials or your institution to get full access on this article.

                Sign in

                Full Access

                • Published in

                  cover image ACM Transactions on Interactive Intelligent Systems
                  ACM Transactions on Interactive Intelligent Systems  Volume 4, Issue 3
                  Special Issue on Multiple Modalities in Interactive Systems and Robots
                  October 2014
                  115 pages
                  ISSN:2160-6455
                  EISSN:2160-6463
                  DOI:10.1145/2660857
                  Issue’s Table of Contents

                  Copyright © 2014 ACM

                  Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                  Publisher

                  Association for Computing Machinery

                  New York, NY, United States

                  Publication History

                  • Published: 14 October 2014
                  • Accepted: 1 March 2014
                  • Revised: 1 January 2014
                  • Received: 1 March 2013
                  Published in tiis Volume 4, Issue 3

                  Permissions

                  Request permissions about this article.

                  Request Permissions

                  Check for updates

                  Qualifiers

                  • research-article
                  • Research
                  • Refereed

                PDF Format

                View or Download as a PDF file.

                PDF

                eReader

                View online with eReader.

                eReader