nach oben

Erschienen in:

2016 | OriginalPaper | Buchkapitel

5. Emotional Speech Recognition

verfasst von : Swati Johar

Erschienen in: Emotion, Affect and Personality in Speech

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Recent years have been marked by a growing need for systems that can grasp human emotions and in particular, recognize emotions. Emotions lie at the centre of any social communication and form the basis for an intelligent and meaningful interaction. The chapter further discusses the acoustic correlates of emotions and describes various techniques and developments imperative to support speech interfaces that recognize emotional expressions in real world settings. Significant advancement in the areas of knowledge representation, infrastructure requirements and algorithm implementation is a prerequisite for modeling effective future speech recognition systems that are more robust and dynamic in nature.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel Multimodality and Spoken Dialogue Systems

Nächstes Kapitel Where Speech Recognition Is Going: Conclusion and Future Scope

Reeves B, Nass C (1996) The media equation: how people treat computers, television and new media like real people and places. Cambridge University Press, Cambridge

Flanagan JL (1972) Speech analysis, synthesis, and perception, 2nd edn. Springer, New YorkCrossRef

Scherer KR (2003) Vocal communication of emotion: a review of research paradigms. Speech Commun 40:227–256CrossRefMATH

Potamianos G, Neti C, Gravier G, Garg A, Senior A (2003) Recent advances in the automatic recognition of audiovisual speech. Proc IEEE 91(9):1306–1326CrossRef

Scherer K (1996) Adding the affective dimension: a new look in speech analysis and synthesis. In: Proceeding of international conference on spoken language processing (ICSLP 1996), pp 1808–1811

Chen LS, Tao H, Huang TS, Miyasato T, Nakatsu R (1998) Emotion recognition from audiovisual information. In Proceedings of IEEE workshop on multimedia signal processing, Los Angeles, CA, pp 83–88, 7–9 Dec 1998

De Silva L, Ng P (2000) Bimodal emotion recognition. In: Proceedings of automatic face and gesture recognition, 2000, pp 332–335

Schneiderman B (1993) Human values and the future of technology: a declaration of responsibility. In: Schneiderman B (ed) Sparks of innovation in human-computer interaction, Ablex Publ, 1(1), Jan 1994, pp 67–71 (ACM Interactions )

Baker J, Deng L, Glass J, Khudanpur S, Lee C, Morgan N, O’Shaughnessy D (2009) Developments and directions in speech recognition and understanding, Part 1 [DSP Education]. IEEE Signal Process Mag 26(3):75–80CrossRef

10.

Morgan N, Zhu Q, Stolcke A, Sonmez K, Sivadas S, Shinozaki T, Ostendorf M, Jain P, Hermansky H, Ellis D, Doddington G, Chen B, Cetin O, Bourlard H, Athineos M (2005) Pushing the envelope—aside. IEEE Signal Process Mag 22(5):81–88CrossRef

11.

Olukotun K (2006) A conversation with John Hennessy and David Patterson. ACM Queue Mag 4(10):14–22CrossRef

12.

Klein D (2005) The unsupervised learning of natural language structure. PhD thesis, Stanford University

13.

Park A (2006) unsupervised pattern discovery in speech: applications to word acquisition and speaker segmentation. PhD thesis, MIT

14.

Venkataraman A (2001) A statistical model for word discovery in transcribed speech. Comput Linguist 27(3):352–372CrossRef

15.

Rosenberg AE, Lee CH, Soong FK (1994) Cepstral channel normalization techniques for HMM-based speaker verification. In: Proceedings of the IEEE international conference on acoustics, speech and signal processing, 1994, pp 1835–1838

Titel: Emotional Speech Recognition
verfasst von: Swati Johar
Verlag: Springer International Publishing
Buch: Emotion, Affect and Personality in Speech
Print ISBN: 978-3-319-28045-5

Electronic ISBN: 978-3-319-28047-9

Copyright-Jahr: 2016
DOI: https://doi.org/10.1007/978-3-319-28047-9_5

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Nachhaltigkeitsaward Key Visual/© Cometis AG/Global ESG Monitor | Daniel Rupp | Generiert mit KI, Search Icon, Banner Hanser, Jonas Klose/© Pine Valley Capital GmbH, Carina Kießling von der Strategieberatung Roland Berger/© Monika Walther Fotografie | ATZ, Beijing Auto Show 2024: Deutsche Hersteller wollen angreifen./© EKH-Pictures / Generated with AI / Stock.adobe.com, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, Zukunftswerkstatt Sales Excellence 2024/© AndreyPopov / Getty Images / iStock, 2023_Antrieb/© supervisuell, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.