Skip to main content
Top

16-10-2019 | Original Paper

Analysis of conversational listening skills toward agent-based social skills training

Authors: Hiroki Tanaka, Hidemi Iwasaka, Hideki Negoro, Satoshi Nakamura

Published in: Journal on Multimodal User Interfaces

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Listening skills are critical for human communication. Social skills training (SST), performed by human trainers, is a well-established method for obtaining appropriate skills in social interaction. Previous work automated the process of social skills training by developing a dialogue system that teaches speaking skills through interaction with a computer agent. Even though previous work that simulated social skills training considered speaking skills, the SST framework incorporates other skills, such as listening, asking questions, and expressing discomfort. In this paper, we extend our automated social skills training by considering user listening skills during conversations with computer agents. We prepared two scenarios: Listening 1 and Listening 2, which respectively assume small talk and job training. A female agent spoke to the participants about a recent story and how to make a telephone call, and the participants listened. We recorded the data of 27 Japanese graduate students who interacted with the agent. Two expert external raters assessed the participants’ listening skills. We manually extracted features that might be related to the eye fixation and behavioral cues of the participants and confirmed that a simple linear regression with selected features correctly predicted listening skills with a correlation coefficient above 0.50 in both scenarios. The number of noddings and backchannels within the utterances contributes to the predictions because we found that just using these two features predicted listening skills with a correlation coefficient above 0.43. Since these two features are easier to understand for users, we plan to integrate them into the framework of automated social skills training.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Bandura A (1978) Social learning theory of aggression. J Commun 28(3):12–29CrossRef Bandura A (1978) Social learning theory of aggression. J Commun 28(3):12–29CrossRef
3.
go back to reference Baron-Cohen S, Richler J, Bisarya D, Gurunathan N, Wheelwright S (2003) The systemizing quotient: an investigation of adults with Asperger syndrome or high-functioning autism, and normal sex differences. Philos Trans R Soc Lond B Biol Sci 358(1430):361–374CrossRef Baron-Cohen S, Richler J, Bisarya D, Gurunathan N, Wheelwright S (2003) The systemizing quotient: an investigation of adults with Asperger syndrome or high-functioning autism, and normal sex differences. Philos Trans R Soc Lond B Biol Sci 358(1430):361–374CrossRef
4.
go back to reference Barry JG, Tomlin D, Moore DR, Dillon H (2015) Use of questionnaire-based measures in the assessment of listening difficulties in school-aged children. Ear Hear 36(6):300–313CrossRef Barry JG, Tomlin D, Moore DR, Dillon H (2015) Use of questionnaire-based measures in the assessment of listening difficulties in school-aged children. Ear Hear 36(6):300–313CrossRef
7.
go back to reference Cassell J (2001) Embodied conversational agents: representation and intelligence in user interfaces. AI Mag 22(4):67–83 Cassell J (2001) Embodied conversational agents: representation and intelligence in user interfaces. AI Mag 22(4):67–83
8.
go back to reference Cigerci F, Gultekin M (2017) Use of digital stories to develop listening comprehension skills. Issues Educ Res 27:252–268 Cigerci F, Gultekin M (2017) Use of digital stories to develop listening comprehension skills. Issues Educ Res 27:252–268
9.
go back to reference Constantino JN, Davis SA, Todd RD, Schindler MK, Gross MM, Brophy SL, Metzger LM, Shoushtari CS, Splinter R, Reich W (2003) Validation of a brief quantitative measure of autistic traits: comparison of the social responsiveness scale with the autism diagnostic interview-revised. J Autism Dev Disord 33(4):427–433CrossRef Constantino JN, Davis SA, Todd RD, Schindler MK, Gross MM, Brophy SL, Metzger LM, Shoushtari CS, Splinter R, Reich W (2003) Validation of a brief quantitative measure of autistic traits: comparison of the social responsiveness scale with the autism diagnostic interview-revised. J Autism Dev Disord 33(4):427–433CrossRef
10.
go back to reference DeVault D, Artstein R, Benn G, Dey T, Fast E, Gainer A, Georgila K, Gratch J, Hartholt A, Lhommet M, Lucas G, Marsella S, Morbini F, Nazarian A, Scherer S, Stratou G, Suri A, Traum D, Wood R, Xu Y, Rizzo A, Morency LP (2014) Simsensei kiosk: a virtual human interviewer for healthcare decision support. In: Proceedings of the 2014 international conference on autonomous agents and multi-agent systems, AAMAS ’14. International foundation for autonomous agents and multiagent systems, Richland, pp 1061–1068. http://dl.acm.org/citation.cfm?id=2617388.2617415 DeVault D, Artstein R, Benn G, Dey T, Fast E, Gainer A, Georgila K, Gratch J, Hartholt A, Lhommet M, Lucas G, Marsella S, Morbini F, Nazarian A, Scherer S, Stratou G, Suri A, Traum D, Wood R, Xu Y, Rizzo A, Morency LP (2014) Simsensei kiosk: a virtual human interviewer for healthcare decision support. In: Proceedings of the 2014 international conference on autonomous agents and multi-agent systems, AAMAS ’14. International foundation for autonomous agents and multiagent systems, Richland, pp 1061–1068. http://​dl.​acm.​org/​citation.​cfm?​id=​2617388.​2617415
11.
go back to reference Duchowski AT (2007) Eye tracking methodology: theory and practice. Springer, New YorkMATH Duchowski AT (2007) Eye tracking methodology: theory and practice. Springer, New YorkMATH
12.
go back to reference Frith U, Happe F (2005) Autism spectrum disorder. Curr Biol 15(19):R786–R790CrossRef Frith U, Happe F (2005) Autism spectrum disorder. Curr Biol 15(19):R786–R790CrossRef
13.
go back to reference Golan O, Baron-Cohen S (2006) Systemizing empathy: teaching adults with Asperger syndrome or high-functioning autism to recognize complex emotions using interactive multimedia. Dev Psychopathol 18(2):591–617CrossRef Golan O, Baron-Cohen S (2006) Systemizing empathy: teaching adults with Asperger syndrome or high-functioning autism to recognize complex emotions using interactive multimedia. Dev Psychopathol 18(2):591–617CrossRef
14.
go back to reference Gosling SD, Rentfrow PJ, Swann WB (2003) A very brief measure of the big-five personality domains. J Res Pers 37(6):504–528 CrossRef Gosling SD, Rentfrow PJ, Swann WB (2003) A very brief measure of the big-five personality domains. J Res Pers 37(6):504–528 CrossRef
15.
go back to reference Gratch J, Wang N, Gerten J, Fast E, Duffy R (2007) Creating rapport with virtual agents. In: Proceedings of the 7th international conference on intelligent virtual agents (IVA). Lecture notes in artificial intelligence, vol. 4722. Paris, pp 125–128 Gratch J, Wang N, Gerten J, Fast E, Duffy R (2007) Creating rapport with virtual agents. In: Proceedings of the 7th international conference on intelligent virtual agents (IVA). Lecture notes in artificial intelligence, vol. 4722. Paris, pp 125–128
17.
go back to reference Hoque ME, Courgeon M, Martin JC, Mutlu B, Picard RW (2013) Mach: my automated conversation coach. In: Proceedings of the 2013 ACM international joint conference on pervasive and ubiquitous computing, UbiComp ’13. ACM, New York, pp 697–706. https://doi.org/10.1145/2493432.2493502 Hoque ME, Courgeon M, Martin JC, Mutlu B, Picard RW (2013) Mach: my automated conversation coach. In: Proceedings of the 2013 ACM international joint conference on pervasive and ubiquitous computing, UbiComp ’13. ACM, New York, pp 697–706. https://​doi.​org/​10.​1145/​2493432.​2493502
18.
go back to reference Huang L, Morency LP, Gratch J (2010) Learning backchannel prediction model from parasocial consensus sampling: a subjective evaluation. In: Allbeck J, Badler N, Bickmore T, Pelachaud C, Safonova A (eds) Intelligent virtual agents. Springer, Berlin, pp 159–172CrossRef Huang L, Morency LP, Gratch J (2010) Learning backchannel prediction model from parasocial consensus sampling: a subjective evaluation. In: Allbeck J, Badler N, Bickmore T, Pelachaud C, Safonova A (eds) Intelligent virtual agents. Springer, Berlin, pp 159–172CrossRef
19.
go back to reference Klin A, Jones W, Schultz R, Volkmar F, Cohen D (2002) Visual fixation patterns during viewing of naturalistic social situations as predictors of social competence in individuals with autism. Arch Gen Psychiatry 59(9):809–816CrossRef Klin A, Jones W, Schultz R, Volkmar F, Cohen D (2002) Visual fixation patterns during viewing of naturalistic social situations as predictors of social competence in individuals with autism. Arch Gen Psychiatry 59(9):809–816CrossRef
21.
go back to reference Lala D, Milhorat P, Inoue K, Ishida M, Takanashi K, Kawahara T (2017) Attentive listening system with backchanneling, response generation and flexible turn-taking. In: Proceedings of the 18th annual SIGdial Meeting on discourse and dialogue. Association for computational linguistics, pp 127–136. http://aclweb.org/anthology/W17-5516 Lala D, Milhorat P, Inoue K, Ishida M, Takanashi K, Kawahara T (2017) Attentive listening system with backchanneling, response generation and flexible turn-taking. In: Proceedings of the 18th annual SIGdial Meeting on discourse and dialogue. Association for computational linguistics, pp 127–136. http://​aclweb.​org/​anthology/​W17-5516
22.
go back to reference Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33(1):159–174 CrossRef Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33(1):159–174 CrossRef
23.
go back to reference Lee A, Oura K, Tokuda K (2013) Mmdagent—a fully open-source toolkit for voice interaction systems. In: ICASSP, pp 8382–8385 Lee A, Oura K, Tokuda K (2013) Mmdagent—a fully open-source toolkit for voice interaction systems. In: ICASSP, pp 8382–8385
24.
go back to reference Liu C, Ishi CT, Ishiguro H, Hagita N (2012) Generation of nodding, head tilting and eye gazing for human–robot dialogue interaction. In: 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI), Boston, MA, pp 285–292 Liu C, Ishi CT, Ishiguro H, Hagita N (2012) Generation of nodding, head tilting and eye gazing for human–robot dialogue interaction. In: 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI), Boston, MA, pp 285–292
25.
go back to reference Liu F, Surendran D, Xu Y (2006) Classification of statement and question intonations in mandarin. In: Proceedings of the 3rd speech prosody, pp 603–606 Liu F, Surendran D, Xu Y (2006) Classification of statement and question intonations in mandarin. In: Proceedings of the 3rd speech prosody, pp 603–606
26.
go back to reference Maynard SK (1990) Conversation management in contrast: listener response in Japanese and American English. J Pragmat 14(3):397–412CrossRef Maynard SK (1990) Conversation management in contrast: listener response in Japanese and American English. J Pragmat 14(3):397–412CrossRef
27.
go back to reference Maynard SK (1993) Kaiwa bunseki (discourse analysis) [written in Japanese] Maynard SK (1993) Kaiwa bunseki (discourse analysis) [written in Japanese]
29.
go back to reference Milne M, Raghavendra P, Leibbrandt R, Powers DMW (2018) Personalisation and automation in a virtual conversation skills tutor for children with autism. J Multimodal User Interfaces 12:257–269CrossRef Milne M, Raghavendra P, Leibbrandt R, Powers DMW (2018) Personalisation and automation in a virtual conversation skills tutor for children with autism. J Multimodal User Interfaces 12:257–269CrossRef
30.
go back to reference Nori F, Lipi AA, Nakano Y (2011) Cultural difference in nonverbal behaviors in negotiation conversations: towards a model for culture–adapted conversational agents. In: Proceedings of the 6th international conference on Universal access in human–computer interaction: design for all and eInclusion, UAHCI’11, vol. Part I. Springer, Berlin, pp 410–419. http://dl.acm.org/citation.cfm?id=2022591.2022639 CrossRef Nori F, Lipi AA, Nakano Y (2011) Cultural difference in nonverbal behaviors in negotiation conversations: towards a model for culture–adapted conversational agents. In: Proceedings of the 6th international conference on Universal access in human–computer interaction: design for all and eInclusion, UAHCI’11, vol. Part I. Springer, Berlin, pp 410–419. http://​dl.​acm.​org/​citation.​cfm?​id=​2022591.​2022639 CrossRef
31.
go back to reference Ochs M, Libermann N, Boidin A, Chaminade T (2017) Do you speak to a human or a virtual agent? automatic analysis of user’s social cues during mediated communication. In: Proceedings of the 19th ACM international conference on multimodal interaction, ICMI 2017. ACM, New York, pp 197–205. https://doi.org/10.1145/3136755.3136807 Ochs M, Libermann N, Boidin A, Chaminade T (2017) Do you speak to a human or a virtual agent? automatic analysis of user’s social cues during mediated communication. In: Proceedings of the 19th ACM international conference on multimodal interaction, ICMI 2017. ACM, New York, pp 197–205. https://​doi.​org/​10.​1145/​3136755.​3136807
32.
go back to reference Ochs M, Mestre D, de Montcheuil G, Pergandi JM, Saubesty J, Lombardo E, Francon D, Blache P (2019) Training doctors’ social skills to break bad news: evaluation of the impact of virtual environment displays on the sense of presence. J Multimodal User Interfaces 13:41–51CrossRef Ochs M, Mestre D, de Montcheuil G, Pergandi JM, Saubesty J, Lombardo E, Francon D, Blache P (2019) Training doctors’ social skills to break bad news: evaluation of the impact of virtual environment displays on the sense of presence. J Multimodal User Interfaces 13:41–51CrossRef
33.
go back to reference Okada S, Ohtake Y, Nakano YI, Hayashi Y, Huang HH, Takase Y, Nitta K (2016) Estimating communication skills using dialogue acts and nonverbal features in multiple discussion datasets. In: Proceedings of the 18th ACM international conference on multimodal interaction, ICMI 2016. ACM, New York, pp 169–176. https://doi.org/10.1145/2993148.2993154 Okada S, Ohtake Y, Nakano YI, Hayashi Y, Huang HH, Takase Y, Nitta K (2016) Estimating communication skills using dialogue acts and nonverbal features in multiple discussion datasets. In: Proceedings of the 18th ACM international conference on multimodal interaction, ICMI 2016. ACM, New York, pp 169–176. https://​doi.​org/​10.​1145/​2993148.​2993154
34.
go back to reference Poyade M, Morris G, Taylor I, Portela V (2017) Using mobile virtual reality to empower people with hidden disabilities to overcome their barriers. In: Proceedings of the 19th ACM international conference on multimodal interaction. ACM, New York, pp 504–505. https://doi.org/10.1145/3136755.3143025 Poyade M, Morris G, Taylor I, Portela V (2017) Using mobile virtual reality to empower people with hidden disabilities to overcome their barriers. In: Proceedings of the 19th ACM international conference on multimodal interaction. ACM, New York, pp 504–505. https://​doi.​org/​10.​1145/​3136755.​3143025
35.
go back to reference Recht S, Grynszpan O (2019) The sense of social agency in gaze leading. J Multimodal User Interfaces 13:19–30CrossRef Recht S, Grynszpan O (2019) The sense of social agency in gaze leading. J Multimodal User Interfaces 13:19–30CrossRef
36.
go back to reference Reeves B, Nass CI (1996) The media equation: how people treat computers, television, and new media like real people and places. Cambridge University Press, Cambridge Reeves B, Nass CI (1996) The media equation: how people treat computers, television, and new media like real people and places. Cambridge University Press, Cambridge
38.
go back to reference Skinner B (1953) Science and human behavior. Free Press paperback. Psychology. Macmillan, New York Skinner B (1953) Science and human behavior. Free Press paperback. Psychology. Macmillan, New York
39.
go back to reference Sveinbjornsdottir B, Johannsson SH, Oddsdottir J, Siguroardottir TP, Valdimarsson GI, Vilhjalmsson HH (2019) Virtual discrete trial training for teacher trainees. J Multimodal User Interfaces 13:31–40CrossRef Sveinbjornsdottir B, Johannsson SH, Oddsdottir J, Siguroardottir TP, Valdimarsson GI, Vilhjalmsson HH (2019) Virtual discrete trial training for teacher trainees. J Multimodal User Interfaces 13:31–40CrossRef
42.
go back to reference Tanaka H, Negoro H, Iwasaka H, Nakamura S (2018) Listening skills assessment through computer agents. In: Proceedings of the 20th ACM International Conference on Multimodal Interaction, ICMI ’18. ACM, New York, NY, USA, pp 492–496. https://doi.org/10.1145/3242969.3242970 Tanaka H, Negoro H, Iwasaka H, Nakamura S (2018) Listening skills assessment through computer agents. In: Proceedings of the 20th ACM International Conference on Multimodal Interaction, ICMI ’18. ACM, New York, NY, USA, pp 492–496. https://​doi.​org/​10.​1145/​3242969.​3242970
44.
45.
go back to reference Tanaka H, Watanabe H, Maki H, Sakriani S, Nakamura S (2019) Electroencephalogram-based single-trial detection of language expectation violations in listening to speech. Front Comput Neurosci 13:15CrossRef Tanaka H, Watanabe H, Maki H, Sakriani S, Nakamura S (2019) Electroencephalogram-based single-trial detection of language expectation violations in listening to speech. Front Comput Neurosci 13:15CrossRef
46.
go back to reference Tsai MN, Wu CL, Tseng LP, An CP, Chen HC (2018) Extraversion is a mediator of gelotophobia: a study of autism spectrum disorder and the big five. Front Psychol 9:150CrossRef Tsai MN, Wu CL, Tseng LP, An CP, Chen HC (2018) Extraversion is a mediator of gelotophobia: a study of autism spectrum disorder and the big five. Front Psychol 9:150CrossRef
47.
go back to reference Tyagi B (2013) Listening: an important skill and its various aspects. Criterion Int J Engl 12:1–8 Tyagi B (2013) Listening: an important skill and its various aspects. Criterion Int J Engl 12:1–8
48.
go back to reference Van Hecke AV, Stevens S, Carson AM, Karst JS, Dolan B, Schohl K, McKindles RJ, Remmel R, Brockman S (2015) Measuring the plasticity of social approach: a randomized controlled trial of the effects of the PEERS intervention on EEG asymmetry in adolescents with autism spectrum disorders. J Autism Dev Disord 45(2):316–335CrossRef Van Hecke AV, Stevens S, Carson AM, Karst JS, Dolan B, Schohl K, McKindles RJ, Remmel R, Brockman S (2015) Measuring the plasticity of social approach: a randomized controlled trial of the effects of the PEERS intervention on EEG asymmetry in adolescents with autism spectrum disorders. J Autism Dev Disord 45(2):316–335CrossRef
49.
go back to reference Veltman K, de Weerd H, Verbrugge R (2019) Training the use of theory of mind using artificial agents. J Multimodal User Interfaces 13:3–18CrossRef Veltman K, de Weerd H, Verbrugge R (2019) Training the use of theory of mind using artificial agents. J Multimodal User Interfaces 13:3–18CrossRef
Metadata
Title
Analysis of conversational listening skills toward agent-based social skills training
Authors
Hiroki Tanaka
Hidemi Iwasaka
Hideki Negoro
Satoshi Nakamura
Publication date
16-10-2019
Publisher
Springer International Publishing
Published in
Journal on Multimodal User Interfaces
Print ISSN: 1783-7677
Electronic ISSN: 1783-8738
DOI
https://doi.org/10.1007/s12193-019-00313-y

Premium Partner