Skip to main content

2016 | OriginalPaper | Buchkapitel

10. Prosody Enhances Cognitive Infocommunication: Materials from the HuComTech Corpus

verfasst von : Laszlo Hunyadi, István Szekrényes, Hermina Kiss

Erschienen in: Toward Robotic Socially Believable Behaving Systems - Volume I

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The multimodal HuComTech corpus aims at annotating, studying and publishing data related to a wide spectrum of markers of human behavior in human-human spoken dialogues. By doing so the final goal is to both understand human cognitive behavior in conversational settings and contribute to the enhancement of human-machine interaction systems. One of the main issues still leaving wide spaces for further development is related to speech prosody, the understanding of its association with possible cognitive processes for the expression of emotions as well as the online production of speech utterances. Since the latter often results in incomplete structures, the study of the relation between grammatical incompleteness and prosody can both contribute to a better understanding of human cognition and the enhancement of cognitive infocommunication systems. The data and analyses presented in this paper are intended to serve both these purposes. Two different approaches will be presented as methods of data exploration: the study of static temporal alignments within the ELAN annotation tool, and the discovery of dynamic temporal patterns using the Theme framework.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Sagisaka Y, Campbell N, Higuchi N (eds) (1996) Computing prosody: computational models for processing spontaneous speech. Springer, New York Sagisaka Y, Campbell N, Higuchi N (eds) (1996) Computing prosody: computational models for processing spontaneous speech. Springer, New York
2.
Zurück zum Zitat Rajeswari KC, Uma Maheswari P (2012) Prosody modeling techniques for text-to-speech synthesis systems—A survey. Int J Comput Appl (0975–8887) 39(16):8 Rajeswari KC, Uma Maheswari P (2012) Prosody modeling techniques for text-to-speech synthesis systems—A survey. Int J Comput Appl (0975–8887) 39(16):8
3.
Zurück zum Zitat Teixeira JP (2012) Prosody generation model for TTS systems: segmental durations and F0 contours with fujisaki model. LAP LAMBERT Academic Publishing Teixeira JP (2012) Prosody generation model for TTS systems: segmental durations and F0 contours with fujisaki model. LAP LAMBERT Academic Publishing
4.
Zurück zum Zitat Chaloupka Z, Hork P (2012) Prosody modelling for TTS systems using statistical methods. In: Cognitive behavioural systems, COST 2102 International training school, Dresden, Germany, February 21–26, 2011. Revised Selected Papers, Springer, Heidelberg, pp 174–183 Chaloupka Z, Hork P (2012) Prosody modelling for TTS systems using statistical methods. In: Cognitive behavioural systems, COST 2102 International training school, Dresden, Germany, February 21–26, 2011. Revised Selected Papers, Springer, Heidelberg, pp 174–183
5.
Zurück zum Zitat Roy BC, Frank MC, Roy D (2012) Relating activity contexts to early word learning in dense longitudinal data. In: Proceedings of the 34th annual meeting of the cognitive science society. Sapporo, 2012 Roy BC, Frank MC, Roy D (2012) Relating activity contexts to early word learning in dense longitudinal data. In: Proceedings of the 34th annual meeting of the cognitive science society. Sapporo, 2012
6.
Zurück zum Zitat Baranyi P, Csapo A (2012) Definition and synergies of cognitive infocommunications. Acta Polytech Hung 9(1):67–83 Baranyi P, Csapo A (2012) Definition and synergies of cognitive infocommunications. Acta Polytech Hung 9(1):67–83
7.
Zurück zum Zitat Sallai G (2012) Defining infocommunications and related terms. Acta Polytech Hung 9(6):5–15 Sallai G (2012) Defining infocommunications and related terms. Acta Polytech Hung 9(6):5–15
8.
Zurück zum Zitat Baranyi P, Csapo A, Varlaki P (2014) An overview of research trends in coginfocom. In: IEEE International conference on intelligent engineering systems, Tihany, pp 181–186 Baranyi P, Csapo A, Varlaki P (2014) An overview of research trends in coginfocom. In: IEEE International conference on intelligent engineering systems, Tihany, pp 181–186
9.
Zurück zum Zitat Hunyadi L (2011) Multimodal human-computer interaction technologies. Theoretical modeling and application in speech processing, Argumentum 7, pp 240–260 Hunyadi L (2011) Multimodal human-computer interaction technologies. Theoretical modeling and application in speech processing, Argumentum 7, pp 240–260
10.
Zurück zum Zitat Ekman P, Friesen W (1978) Facial action coding system: a technique for the measurement of facial movement. Consulting Psychologists Press, Palo Alto Ekman P, Friesen W (1978) Facial action coding system: a technique for the measurement of facial movement. Consulting Psychologists Press, Palo Alto
11.
Zurück zum Zitat Hunyadi L, Incompleteness and fragmentation in spoken language syntax and its relation to prosody and gesturing: cognitive processes versus possible formal cues. Knowledge-based information systems in practice. Springer (to appear) Hunyadi L, Incompleteness and fragmentation in spoken language syntax and its relation to prosody and gesturing: cognitive processes versus possible formal cues. Knowledge-based information systems in practice. Springer (to appear)
12.
Zurück zum Zitat Szekrnyes I (2014) Annotation and interpretation of prosodic data in the HuComTech corpus for multimodal user interfaces. J Multimodal User Interfaces 8(2):143–150CrossRef Szekrnyes I (2014) Annotation and interpretation of prosodic data in the HuComTech corpus for multimodal user interfaces. J Multimodal User Interfaces 8(2):143–150CrossRef
13.
Zurück zum Zitat Magnusson MS (1996) Hidden real-time patterns in intra- and inter-individual behavior: description and detection. Eur J Psychol Assess 12(2):112–123CrossRef Magnusson MS (1996) Hidden real-time patterns in intra- and inter-individual behavior: description and detection. Eur J Psychol Assess 12(2):112–123CrossRef
14.
Zurück zum Zitat Ladd DR (1996) Intonational phonology. Cambridge University Press, Cambridge Ladd DR (1996) Intonational phonology. Cambridge University Press, Cambridge
15.
Zurück zum Zitat Edlund J, Heldner M, Hirschberg J (2009) Pause and gap length in face-to-face interaction. In: Proceedings of Interspeech 2009, Brighton Edlund J, Heldner M, Hirschberg J (2009) Pause and gap length in face-to-face interaction. In: Proceedings of Interspeech 2009, Brighton
16.
Zurück zum Zitat Hunyadi L (2010) Cognitive grouping and recursion in prosody. In: Hulst H van der (ed) Recursion and human language, de Guyter, Berlin & New York, pp 343–370 Hunyadi L (2010) Cognitive grouping and recursion in prosody. In: Hulst H van der (ed) Recursion and human language, de Guyter, Berlin & New York, pp 343–370
17.
Zurück zum Zitat Hunyadi L (2002) Hungarian sentence prosody and universal grammar. Peter Lang, New York Hunyadi L (2002) Hungarian sentence prosody and universal grammar. Peter Lang, New York
18.
Zurück zum Zitat Abuczki A (2011) A multimodal analysis of the sequential organization of verbal and nonverbal interaction. Argumentum 7:261–279 Abuczki A (2011) A multimodal analysis of the sequential organization of verbal and nonverbal interaction. Argumentum 7:261–279
19.
Zurück zum Zitat Hunyadi L (2010) Cognitive grouping and recursion in prosody. In: Hulst, Harry van der (ed) Recursion and human language. Studies in Generative Grammar [SGG] 104. de Gruyter Mouton, pp 343–370 Hunyadi L (2010) Cognitive grouping and recursion in prosody. In: Hulst, Harry van der (ed) Recursion and human language. Studies in Generative Grammar [SGG] 104. de Gruyter Mouton, pp 343–370
20.
Zurück zum Zitat Szekrenyes I (2015) ProsoTool, a method for automatic annotation of fundamental frequency. In: Cognitive Infocommunications (CogInfoCom), 2015 6th IEEE International Conference on, 19–21 Oct. 2015, Györ, IEEE 2015, pp 291–296 Szekrenyes I (2015) ProsoTool, a method for automatic annotation of fundamental frequency. In: Cognitive Infocommunications (CogInfoCom), 2015 6th IEEE International Conference on, 19–21 Oct. 2015, Györ, IEEE 2015, pp 291–296
Metadaten
Titel
Prosody Enhances Cognitive Infocommunication: Materials from the HuComTech Corpus
verfasst von
Laszlo Hunyadi
István Szekrényes
Hermina Kiss
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-31056-5_10

Premium Partner