Skip to main content
Erschienen in:
Buchtitelbild

2020 | OriginalPaper | Buchkapitel

Towards Automatic Determination of Critical Gestures for European Portuguese Sounds

verfasst von : Samuel Silva, Conceição Cunha, António Teixeira, Arun Joseph, Jens Frahm

Erschienen in: Computational Processing of the Portuguese Language

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Technologies, such as electromagnetic midsagittal articulography (EMA) and real-time magnetic resonance (RT-MRI), can contribute to improve our understanding of the static and dynamic aspects of speech, namely by providing information regarding which articulators are essential (critical) in producing specific sounds and how (gestures). Previous work has successfully demonstrated the possibility to determine critical articulators considering vocal tract data obtained from RT-MRI. However, these works have adopted a conservative approach by considering vocal tract representations analogous to the flash points obtained with EMA data, i.e., landmarks fixed over the articulators, e.g., tongue. To move towards a data-driven method able to determine gestural scores, e.g., driving articulatory speech synthesis, one important step is to move into a representation aligned with Articulatory Phonology and Task Dynamics. This article advances towards this goal by exploring critical articulators determination considering a vocal tract representation aligned with this framework is adopted and presents first results considering 50 Hz RTMRI data for two speakers of European Portuguese.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Ananthakrishnan, G., Engwall, O.: Important regions in the articulator trajectory. In: Proceedings of the ISSP, Strasbourg, France, pp. 305–308 (2008) Ananthakrishnan, G., Engwall, O.: Important regions in the articulator trajectory. In: Proceedings of the ISSP, Strasbourg, France, pp. 305–308 (2008)
2.
Zurück zum Zitat Black, M.P., et al.: Automated evaluation of non-native English pronunciation quality: combining knowledge-and data-driven features at multiple time scales. In: Proceedings of the INTERSPEECH, pp. 493–497 (2015) Black, M.P., et al.: Automated evaluation of non-native English pronunciation quality: combining knowledge-and data-driven features at multiple time scales. In: Proceedings of the INTERSPEECH, pp. 493–497 (2015)
3.
Zurück zum Zitat Browman, C.P., Goldstein, L.: Some notes on syllable structure in articulatory phonology. Phonetica 45(2–4), 140–155 (1988)CrossRef Browman, C.P., Goldstein, L.: Some notes on syllable structure in articulatory phonology. Phonetica 45(2–4), 140–155 (1988)CrossRef
4.
Zurück zum Zitat Chao, Q.: Data-driven approaches to articulatory speech processing. Ph.D. thesis, University of California, Merced (2011) Chao, Q.: Data-driven approaches to articulatory speech processing. Ph.D. thesis, University of California, Merced (2011)
5.
Zurück zum Zitat Cunha, C.: Die Organisation von Konsonantenclustern und CVC-Sequenzen in zwei portugiesischen Varietäten. Ph.D. thesis, LMU (2012) Cunha, C.: Die Organisation von Konsonantenclustern und CVC-Sequenzen in zwei portugiesischen Varietäten. Ph.D. thesis, LMU (2012)
6.
Zurück zum Zitat Cunha, C.: Portuguese lexical clusters and CVC sequences in speech perception and production. Phonetica 72(2–3), 138–161 (2015)CrossRef Cunha, C.: Portuguese lexical clusters and CVC sequences in speech perception and production. Phonetica 72(2–3), 138–161 (2015)CrossRef
7.
Zurück zum Zitat Feng, G., Castelli, E.: Some acoustic features of nasal and nasalized vowels: a target for vowel nasalization. J. Acoust. Soc. Am. 99(6), 3694–3706 (1996)CrossRef Feng, G., Castelli, E.: Some acoustic features of nasal and nasalized vowels: a target for vowel nasalization. J. Acoust. Soc. Am. 99(6), 3694–3706 (1996)CrossRef
8.
Zurück zum Zitat Goldstein, L., Byrd, D., Saltzman, E.: The role of vocal tract gestural action units in understanding the evolution of phonology. In: Arbib, M.A. (ed.) Action to Language via the Mirror Neuron System, pp. 215–249. Cambridge University Press, Cambridge (2006)CrossRef Goldstein, L., Byrd, D., Saltzman, E.: The role of vocal tract gestural action units in understanding the evolution of phonology. In: Arbib, M.A. (ed.) Action to Language via the Mirror Neuron System, pp. 215–249. Cambridge University Press, Cambridge (2006)CrossRef
12.
Zurück zum Zitat Johnson, R.A., Wichern, D.W.: Applied Multivariate Statistical Analysis, 6th edn. Pearson Prentice Hall, Upper Saddle River (2007)MATH Johnson, R.A., Wichern, D.W.: Applied Multivariate Statistical Analysis, 6th edn. Pearson Prentice Hall, Upper Saddle River (2007)MATH
14.
Zurück zum Zitat Lammert, A.C., Proctor, M.I., Narayanan, S.S., et al.: Data-driven analysis of realtime vocal tract MRI using correlated image regions. In: Proceedings of the INTERSPEECH, pp. 1572–1575 (2010) Lammert, A.C., Proctor, M.I., Narayanan, S.S., et al.: Data-driven analysis of realtime vocal tract MRI using correlated image regions. In: Proceedings of the INTERSPEECH, pp. 1572–1575 (2010)
15.
Zurück zum Zitat Marin, S., Pouplier, M.: Temporal organization of complex onsets and codas in American English: testing the predictions of a gestural coupling model. Mot. Control 14(3), 380–407 (2010)CrossRef Marin, S., Pouplier, M.: Temporal organization of complex onsets and codas in American English: testing the predictions of a gestural coupling model. Mot. Control 14(3), 380–407 (2010)CrossRef
16.
Zurück zum Zitat Martins, P., Oliveira, C., Silva, S., Teixeira, A.: Velar movement in European Portuguese nasal vowels. In: Proceedings of the IberSPEECH, pp. 231–240 (2012) Martins, P., Oliveira, C., Silva, S., Teixeira, A.: Velar movement in European Portuguese nasal vowels. In: Proceedings of the IberSPEECH, pp. 231–240 (2012)
17.
Zurück zum Zitat Oliveira, C.: From grapheme to gesture. Linguistic contributions for an articulatory based text-to-speech system. Ph.D. thesis, University of Aveiro (2009) Oliveira, C.: From grapheme to gesture. Linguistic contributions for an articulatory based text-to-speech system. Ph.D. thesis, University of Aveiro (2009)
18.
Zurück zum Zitat Oliveira, C., Teixeira, A.: On gestures timing in European Portuguese nasals. In: Proceedings of the ICPhS, Saarbrücken, Germany (2007) Oliveira, C., Teixeira, A.: On gestures timing in European Portuguese nasals. In: Proceedings of the ICPhS, Saarbrücken, Germany (2007)
19.
Zurück zum Zitat Parkinson, S.: Portuguese nasal vowels as phonological diphthongs. Lingua 61(2–3), 157–177 (1983)CrossRef Parkinson, S.: Portuguese nasal vowels as phonological diphthongs. Lingua 61(2–3), 157–177 (1983)CrossRef
22.
Zurück zum Zitat Rao, M., Seth, S., Xu, J., Chen, Y., Tagare, H., Príncipe, J.C.: A test of independence based on a generalized correlation function. Sign. Proces. 91(1), 15–27 (2011)CrossRef Rao, M., Seth, S., Xu, J., Chen, Y., Tagare, H., Príncipe, J.C.: A test of independence based on a generalized correlation function. Sign. Proces. 91(1), 15–27 (2011)CrossRef
23.
Zurück zum Zitat Saltzman, E.L., Munhall, K.G.: A dynamical approach to gestural patterning in speech production. Ecol. Psychol. 1(4), 333–382 (1989)CrossRef Saltzman, E.L., Munhall, K.G.: A dynamical approach to gestural patterning in speech production. Ecol. Psychol. 1(4), 333–382 (1989)CrossRef
25.
Zurück zum Zitat Sepulveda, A., Castellanos-Domínguez, G., Guido, R.C.: Time-frequency relevant features for critical articulators movement inference. In: Proceedings of the 20th European Signal Processing Conference (EUSIPCO), pp. 2802–2806, August 2012 Sepulveda, A., Castellanos-Domínguez, G., Guido, R.C.: Time-frequency relevant features for critical articulators movement inference. In: Proceedings of the 20th European Signal Processing Conference (EUSIPCO), pp. 2802–2806, August 2012
28.
Zurück zum Zitat Silva, S., Teixeira, A., Orvalho, V.: Articulatory-based audiovisual speech synthesis: proof of concept for European Portuguese. In: Proceedings of the IberSPEECH, Lisbon, Portugal, pp. 119–126 (2016) Silva, S., Teixeira, A., Orvalho, V.: Articulatory-based audiovisual speech synthesis: proof of concept for European Portuguese. In: Proceedings of the IberSPEECH, Lisbon, Portugal, pp. 119–126 (2016)
29.
Zurück zum Zitat Silva, S., Teixeira, A.J.: Critical articulators identification from RT-MRI of the vocal tract. In: INTERSPEECH, pp. 626–630 (2017) Silva, S., Teixeira, A.J.: Critical articulators identification from RT-MRI of the vocal tract. In: INTERSPEECH, pp. 626–630 (2017)
31.
Zurück zum Zitat Teixeira, A., Vaz, F., Príncipe, J.C.: Nasal vowels after nasal consonants. In: 5th Seminar on Speech Production: Models and Data, Kloster Seon, Alemanha, May 2000 Teixeira, A., Vaz, F., Príncipe, J.C.: Nasal vowels after nasal consonants. In: 5th Seminar on Speech Production: Models and Data, Kloster Seon, Alemanha, May 2000
32.
Zurück zum Zitat Teixeira, A., Vaz, F.: European Portuguese nasal vowels: an EMMA study. In: Proceedings of the INTERSPEECH, Aalborg, Denmark, pp. 1483–1486 (2001) Teixeira, A., Vaz, F.: European Portuguese nasal vowels: an EMMA study. In: Proceedings of the INTERSPEECH, Aalborg, Denmark, pp. 1483–1486 (2001)
Metadaten
Titel
Towards Automatic Determination of Critical Gestures for European Portuguese Sounds
verfasst von
Samuel Silva
Conceição Cunha
António Teixeira
Arun Joseph
Jens Frahm
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-41505-1_1

Premium Partner