Skip to main content

2018 | OriginalPaper | Buchkapitel

Prosodic Plot of Dialogues: A Conceptual Framework to Trace Speakers’ Role

verfasst von : Vered Silber-Varod, Anat Lerner, Oliver Jokisch

Erschienen in: Speech and Computer

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper we present a proof-of-concept study which aims to model a conceptual framework to analyze structures of dialogues. We demonstrate our approach on a specific research question – how speaker’s role is realized along the dialogue? To this end, we use a unified set of Map Task dialogues that are unique in the sense that each speaker participated twice – once as a follower and once as a leader, with the same interlocutor playing the other role. This pairwise setting enables to compare prosodic differences in three facets: Role, Speaker, and Session. For this POC, we analyze a basic set of prosodic features: Talk proportions, pitch, and intensity. To create comparable methodological framework for dialogues, we created three plots of the three prosodic features, in ten equal sized intervals along the session. We used a simple distance measure between the resulting ten-dimensional vectors of each facet for each feature. The prosodic plots of these dialogues reveal the interactions and common behaviour across each facet, on the one hand, and allow to trace potential locations of extreme prosodic values, suggesting pivot points of each facet, on the other.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Jenkins, R.: Social Identity, 4th edn. Routledge, London and New York (2014) Jenkins, R.: Social Identity, 4th edn. Routledge, London and New York (2014)
2.
Zurück zum Zitat Davies, B., Harré, R.: Positioning: the discursive production of selves. J. Theor. Soc. Behav. 20(1), 43–63 (1990)CrossRef Davies, B., Harré, R.: Positioning: the discursive production of selves. J. Theor. Soc. Behav. 20(1), 43–63 (1990)CrossRef
3.
Zurück zum Zitat Kupferberg, I., Green, D.: Troubled Talk: Metaphorical Negotiation in Problem Discourse. Mouton de Gruyter, Berlin (2005)CrossRef Kupferberg, I., Green, D.: Troubled Talk: Metaphorical Negotiation in Problem Discourse. Mouton de Gruyter, Berlin (2005)CrossRef
5.
Zurück zum Zitat Tur, G., De Mori, R.: Spoken Language Understanding: Systems for Extracting Semantic Information from Speech. Wiley, New York (2011)CrossRef Tur, G., De Mori, R.: Spoken Language Understanding: Systems for Extracting Semantic Information from Speech. Wiley, New York (2011)CrossRef
6.
Zurück zum Zitat Hori, C., Hori, T., Watanabe, S., Hershey, J.R.: Context-sensitive and role-dependent spoken language understanding using bidirectional and attention LSTMs. In: INTERSPEECH, pp. 3236–3240 (2016) Hori, C., Hori, T., Watanabe, S., Hershey, J.R.: Context-sensitive and role-dependent spoken language understanding using bidirectional and attention LSTMs. In: INTERSPEECH, pp. 3236–3240 (2016)
7.
Zurück zum Zitat Ma, W., Zhang, M., Liu, Y., Ma, S.: Multi-grained role labeling based on multi-modality information for real customer service telephone conversation. In: Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI), New York, pp. 1816–1822 (2016) Ma, W., Zhang, M., Liu, Y., Ma, S.: Multi-grained role labeling based on multi-modality information for real customer service telephone conversation. In: Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI), New York, pp. 1816–1822 (2016)
8.
Zurück zum Zitat Chen, P.C., Chi, T.C., Su, S.Y., Chen, Y.N.: Dynamic Time-Aware Attention to Speaker Roles and Contexts for Spoken Language Understanding. arXiv preprint arXiv:1710.00165 (2017) Chen, P.C., Chi, T.C., Su, S.Y., Chen, Y.N.: Dynamic Time-Aware Attention to Speaker Roles and Contexts for Spoken Language Understanding. arXiv preprint arXiv:​1710.​00165 (2017)
9.
Zurück zum Zitat Chi, T.C., Chen, P.C., Su, S.Y., Chen, Y.N.: Speaker Role Contextual Modeling for Language Understanding and Dialogue Policy Learning. arXiv preprint arXiv:1710.00164 (2017) Chi, T.C., Chen, P.C., Su, S.Y., Chen, Y.N.: Speaker Role Contextual Modeling for Language Understanding and Dialogue Policy Learning. arXiv preprint arXiv:​1710.​00164 (2017)
10.
Zurück zum Zitat Li, Y., et al.: Unsupervised classification of speaker roles in multi-participant conversational speech. Comput. Speech Lang. 42, 81–99 (2017)CrossRef Li, Y., et al.: Unsupervised classification of speaker roles in multi-participant conversational speech. Comput. Speech Lang. 42, 81–99 (2017)CrossRef
11.
Zurück zum Zitat Barzilay, R., Collins, M., Hirschberg, J., Whittaker, S.: The rules behind roles: identifying speaker role in radio broadcasts. In: Proceedings of the Seventeenth National Conference on Artificial Intelligence (AAAI-2000), pp. 679–684 (2000) Barzilay, R., Collins, M., Hirschberg, J., Whittaker, S.: The rules behind roles: identifying speaker role in radio broadcasts. In: Proceedings of the Seventeenth National Conference on Artificial Intelligence (AAAI-2000), pp. 679–684 (2000)
12.
Zurück zum Zitat Liu, Y.: Initial study on automatic identification of speaker role in broadcast news speech. In: Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers, pp. 81–84. Association for Computational Linguistics (2006) Liu, Y.: Initial study on automatic identification of speaker role in broadcast news speech. In: Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers, pp. 81–84. Association for Computational Linguistics (2006)
13.
Zurück zum Zitat Weizman, E.: Positioning in Media Dialogue: Negotiating Roles in the News Interview. John Benjamins Publishing, Amsterdam/Philadelphia (2008)CrossRef Weizman, E.: Positioning in Media Dialogue: Negotiating Roles in the News Interview. John Benjamins Publishing, Amsterdam/Philadelphia (2008)CrossRef
14.
Zurück zum Zitat Zhang, B., Hutchinson, B., Wu, W., Ostendorf, M.: Extracting phrase patterns with minimum redundancy for unsupervised speaker role classification. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 717–720 (2010) Zhang, B., Hutchinson, B., Wu, W., Ostendorf, M.: Extracting phrase patterns with minimum redundancy for unsupervised speaker role classification. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 717–720 (2010)
16.
Zurück zum Zitat Lerner, A., Silber-Varod, V., Batista, F., Moniz, H.: In search of the role’s footprints in client-therapist dialogues. In: Proceedings of Speech Prosody 2016 (SP2016), Boston, USA (2016) Lerner, A., Silber-Varod, V., Batista, F., Moniz, H.: In search of the role’s footprints in client-therapist dialogues. In: Proceedings of Speech Prosody 2016 (SP2016), Boston, USA (2016)
17.
Zurück zum Zitat Silber-Varod, V., Lerner, A., Jokisch, O.: Automatic speaker’s role classification with a bottom-up acoustic feature selection. In: Proceeding of the GLU 2017 International Workshop on Grounding Language Understanding, pp. 52–56 (2017). https://doi.org/10.21437/glu.2017-11 Silber-Varod, V., Lerner, A., Jokisch, O.: Automatic speaker’s role classification with a bottom-up acoustic feature selection. In: Proceeding of the GLU 2017 International Workshop on Grounding Language Understanding, pp. 52–56 (2017). https://​doi.​org/​10.​21437/​glu.​2017-11
19.
Zurück zum Zitat Silber-Varod, V., Lerner, A.: Analysis of silences in unbalanced dialogues: the effect of genre and role. In: Eklund, R., Rose, R. (eds.) Proceedings of DiSS 2017, The 8th Workshop on Disfluency in Spontaneous Speech, MH-QPSR, Stockholm, Sweden, vol. 58, no. 1, pp. 53–56 (2017) Silber-Varod, V., Lerner, A.: Analysis of silences in unbalanced dialogues: the effect of genre and role. In: Eklund, R., Rose, R. (eds.) Proceedings of DiSS 2017, The 8th Workshop on Disfluency in Spontaneous Speech, MH-QPSR, Stockholm, Sweden, vol. 58, no. 1, pp. 53–56 (2017)
20.
Zurück zum Zitat Weiss, B., Schoenenberg, K.: Conversational structures affecting auditory likeability. In: Proceedings of the INTERSPEECH, pp. 1791–1795 (2014) Weiss, B., Schoenenberg, K.: Conversational structures affecting auditory likeability. In: Proceedings of the INTERSPEECH, pp. 1791–1795 (2014)
21.
Zurück zum Zitat Biadsy, F., Rosenberg, A., Carlson, R., Hirschberg, J., Strangert, E.: A cross-cultural comparison of American, Palestinian, and Swedish perception of charismatic speech. In: Proceedings of the Speech Prosody, Campinas, Brazil, pp. 579–582 (2008) Biadsy, F., Rosenberg, A., Carlson, R., Hirschberg, J., Strangert, E.: A cross-cultural comparison of American, Palestinian, and Swedish perception of charismatic speech. In: Proceedings of the Speech Prosody, Campinas, Brazil, pp. 579–582 (2008)
22.
Zurück zum Zitat Anderson, H., et al.: The HCRC map task corpus. Lang. Speech 34(4), 351–366 (1991)CrossRef Anderson, H., et al.: The HCRC map task corpus. Lang. Speech 34(4), 351–366 (1991)CrossRef
25.
Zurück zum Zitat Ochs, E.: Planned and Unplanned Discourse. In: Syntax and Semantics: Vol. 12. Discourse and Syntax. Academic Press, New York (1979) Ochs, E.: Planned and Unplanned Discourse. In: Syntax and Semantics: Vol. 12. Discourse and Syntax. Academic Press, New York (1979)
27.
Zurück zum Zitat Walther, M., Neuber, B., Jokisch, O., Mellouli, T.: Towards a conversational expert system for rhetorical and vocal quality assessment in call center talks. In: Proceedings of the 6th ISCA Workshop on Speech and Language Technology in Education (SLaTE 2015), Leipzig, pp. 29–34, September 2015 Walther, M., Neuber, B., Jokisch, O., Mellouli, T.: Towards a conversational expert system for rhetorical and vocal quality assessment in call center talks. In: Proceedings of the 6th ISCA Workshop on Speech and Language Technology in Education (SLaTE 2015), Leipzig, pp. 29–34, September 2015
28.
Zurück zum Zitat Pardo, J.S.: On phonetic convergence during conversational interaction. J. Acoust. Soc. Am. 119(4), 2382–2393 (2006)CrossRef Pardo, J.S.: On phonetic convergence during conversational interaction. J. Acoust. Soc. Am. 119(4), 2382–2393 (2006)CrossRef
29.
Zurück zum Zitat Salamin, H., Vinciarelli, A., Truong, K., Mohammadi, G.: Automatic role recognition based on conversational and prosodic behaviour. In: Proceedings of the 18th ACM International Conference on Multimedia, pp. 847–850 (2010) Salamin, H., Vinciarelli, A., Truong, K., Mohammadi, G.: Automatic role recognition based on conversational and prosodic behaviour. In: Proceedings of the 18th ACM International Conference on Multimedia, pp. 847–850 (2010)
30.
Zurück zum Zitat Dufour, R., Estève, Y., Deléglise, P.: Characterizing and detecting spontaneous speech: application to speaker role recognition. Speech Commun. 56, 1–18 (2014)CrossRef Dufour, R., Estève, Y., Deléglise, P.: Characterizing and detecting spontaneous speech: application to speaker role recognition. Speech Commun. 56, 1–18 (2014)CrossRef
31.
Zurück zum Zitat Park, S.J., Yeung, G., Kreiman, J., Keating P.A., Alwan, A.: Using voice quality features to improve short-utterance, text-independent speaker verification systems. In: Proceedings of INTERSPEECH 2017, pp. 1522–1526 (2017) Park, S.J., Yeung, G., Kreiman, J., Keating P.A., Alwan, A.: Using voice quality features to improve short-utterance, text-independent speaker verification systems. In: Proceedings of INTERSPEECH 2017, pp. 1522–1526 (2017)
Metadaten
Titel
Prosodic Plot of Dialogues: A Conceptual Framework to Trace Speakers’ Role
verfasst von
Vered Silber-Varod
Anat Lerner
Oliver Jokisch
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-99579-3_65