nach oben

Erschienen in:

2018 | OriginalPaper | Buchkapitel

Prosodic Plot of Dialogues: A Conceptual Framework to Trace Speakers’ Role

verfasst von : Vered Silber-Varod, Anat Lerner, Oliver Jokisch

Erschienen in: Speech and Computer

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

In this paper we present a proof-of-concept study which aims to model a conceptual framework to analyze structures of dialogues. We demonstrate our approach on a specific research question – how speaker’s role is realized along the dialogue? To this end, we use a unified set of Map Task dialogues that are unique in the sense that each speaker participated twice – once as a follower and once as a leader, with the same interlocutor playing the other role. This pairwise setting enables to compare prosodic differences in three facets: Role, Speaker, and Session. For this POC, we analyze a basic set of prosodic features: Talk proportions, pitch, and intensity. To create comparable methodological framework for dialogues, we created three plots of the three prosodic features, in ten equal sized intervals along the session. We used a simple distance measure between the resulting ten-dimensional vectors of each facet for each feature. The prosodic plots of these dialogues reveal the interactions and common behaviour across each facet, on the one hand, and allow to trace potential locations of extreme prosodic values, suggesting pivot points of each facet, on the other.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Utilizing Psychoacoustic Modeling to Improve Speech-Based Emotion Recognition

Nächstes Kapitel Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition

Jenkins, R.: Social Identity, 4th edn. Routledge, London and New York (2014)

Davies, B., Harré, R.: Positioning: the discursive production of selves. J. Theor. Soc. Behav. 20(1), 43–63 (1990)CrossRef

Kupferberg, I., Green, D.: Troubled Talk: Metaphorical Negotiation in Problem Discourse. Mouton de Gruyter, Berlin (2005)CrossRef

Heritage, J., Clayman, S.: Talk in Action: Interactions Identities and Institutions. Wiley Online Library, Oxford (2010). https://doi.org/10.1002/9781444318135CrossRef

Tur, G., De Mori, R.: Spoken Language Understanding: Systems for Extracting Semantic Information from Speech. Wiley, New York (2011)CrossRef

Hori, C., Hori, T., Watanabe, S., Hershey, J.R.: Context-sensitive and role-dependent spoken language understanding using bidirectional and attention LSTMs. In: INTERSPEECH, pp. 3236–3240 (2016)

Ma, W., Zhang, M., Liu, Y., Ma, S.: Multi-grained role labeling based on multi-modality information for real customer service telephone conversation. In: Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI), New York, pp. 1816–1822 (2016)

Chen, P.C., Chi, T.C., Su, S.Y., Chen, Y.N.: Dynamic Time-Aware Attention to Speaker Roles and Contexts for Spoken Language Understanding. arXiv preprint arXiv:1710.00165 (2017)

Chi, T.C., Chen, P.C., Su, S.Y., Chen, Y.N.: Speaker Role Contextual Modeling for Language Understanding and Dialogue Policy Learning. arXiv preprint arXiv:1710.00164 (2017)

10.

Li, Y., et al.: Unsupervised classification of speaker roles in multi-participant conversational speech. Comput. Speech Lang. 42, 81–99 (2017)CrossRef

11.

Barzilay, R., Collins, M., Hirschberg, J., Whittaker, S.: The rules behind roles: identifying speaker role in radio broadcasts. In: Proceedings of the Seventeenth National Conference on Artificial Intelligence (AAAI-2000), pp. 679–684 (2000)

12.

Liu, Y.: Initial study on automatic identification of speaker role in broadcast news speech. In: Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers, pp. 81–84. Association for Computational Linguistics (2006)

13.

Weizman, E.: Positioning in Media Dialogue: Negotiating Roles in the News Interview. John Benjamins Publishing, Amsterdam/Philadelphia (2008)CrossRef

14.

Zhang, B., Hutchinson, B., Wu, W., Ostendorf, M.: Extracting phrase patterns with minimum redundancy for unsupervised speaker role classification. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 717–720 (2010)

15.

Rienks, R., Heylen, D.: Dominance detection in meetings using easily obtainable features. In: Renals, S., Bengio, S. (eds.) MLMI 2005. LNCS, vol. 3869, pp. 76–86. Springer, Heidelberg (2006). https://doi.org/10.1007/11677482_7CrossRef

16.

Lerner, A., Silber-Varod, V., Batista, F., Moniz, H.: In search of the role’s footprints in client-therapist dialogues. In: Proceedings of Speech Prosody 2016 (SP2016), Boston, USA (2016)

17.

Silber-Varod, V., Lerner, A., Jokisch, O.: Automatic speaker’s role classification with a bottom-up acoustic feature selection. In: Proceeding of the GLU 2017 International Workshop on Grounding Language Understanding, pp. 52–56 (2017). https://doi.org/10.21437/glu.2017-11

18.

Eyben, F., Wöllmer, M., Schuller, B.: OpenSMILE: the munich versatile and fast open-source audio feature extractor. Proceedings of the 18th ACM International Conference on Multimedia, pp. 1459–1462 (2010). https://doi.org/10.1145/1873951.1874246

19.

Silber-Varod, V., Lerner, A.: Analysis of silences in unbalanced dialogues: the effect of genre and role. In: Eklund, R., Rose, R. (eds.) Proceedings of DiSS 2017, The 8th Workshop on Disfluency in Spontaneous Speech, MH-QPSR, Stockholm, Sweden, vol. 58, no. 1, pp. 53–56 (2017)

20.

Weiss, B., Schoenenberg, K.: Conversational structures affecting auditory likeability. In: Proceedings of the INTERSPEECH, pp. 1791–1795 (2014)

21.

Biadsy, F., Rosenberg, A., Carlson, R., Hirschberg, J., Strangert, E.: A cross-cultural comparison of American, Palestinian, and Swedish perception of charismatic speech. In: Proceedings of the Speech Prosody, Campinas, Brazil, pp. 579–582 (2008)

22.

Anderson, H., et al.: The HCRC map task corpus. Lang. Speech 34(4), 351–366 (1991)CrossRef

23.

Carletta, J., Isard, A., Kowtko, J., Doherty-Sneddon, G.: HCRC dialogue structure coding manual. Human Communication Research Centre (1996). http://www.lancaster.ac.uk/fass/projects/eagles/maptask.htm

24.

The Map Task Corpus of the Open University of Israel (MaTaCOp). http://www.openu.ac.il/en/academicstudies/matacop/pages/default.aspx

25.

Ochs, E.: Planned and Unplanned Discourse. In: Syntax and Semantics: Vol. 12. Discourse and Syntax. Academic Press, New York (1979)

26.

Boersma, P., Weenink, D.: Praat: doing phonetics by computer [Computer program]. Version 6.0.35. http://www.praat.org/. Accessed 16 Oct 2017

27.

Walther, M., Neuber, B., Jokisch, O., Mellouli, T.: Towards a conversational expert system for rhetorical and vocal quality assessment in call center talks. In: Proceedings of the 6th ISCA Workshop on Speech and Language Technology in Education (SLaTE 2015), Leipzig, pp. 29–34, September 2015

28.

Pardo, J.S.: On phonetic convergence during conversational interaction. J. Acoust. Soc. Am. 119(4), 2382–2393 (2006)CrossRef

29.

Salamin, H., Vinciarelli, A., Truong, K., Mohammadi, G.: Automatic role recognition based on conversational and prosodic behaviour. In: Proceedings of the 18th ACM International Conference on Multimedia, pp. 847–850 (2010)

30.

Dufour, R., Estève, Y., Deléglise, P.: Characterizing and detecting spontaneous speech: application to speaker role recognition. Speech Commun. 56, 1–18 (2014)CrossRef

31.

Park, S.J., Yeung, G., Kreiman, J., Keating P.A., Alwan, A.: Using voice quality features to improve short-utterance, text-independent speaker verification systems. In: Proceedings of INTERSPEECH 2017, pp. 1522–1526 (2017)

Titel: Prosodic Plot of Dialogues: A Conceptual Framework to Trace Speakers’ Role
verfasst von: Vered Silber-Varod
Anat Lerner
Oliver Jokisch
Verlag: Springer International Publishing
Buch: Speech and Computer
Print ISBN: 978-3-319-99578-6

Electronic ISBN: 978-3-319-99579-3

Copyright-Jahr: 2018
DOI: https://doi.org/10.1007/978-3-319-99579-3_65

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"