Skip to main content
Top

2020 | OriginalPaper | Chapter

Measuring Periodicity Perturbations in Pathological Voice: General-Purpose Software vs. Custom-Tailored Methods

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The acoustic measurement of the severity of the symptoms present in pathological voice is an active research area, for being inexpensive and non invasive. Periodicity perturbations are among the most frequently used, requiring the previous extraction of the individual glottal pulse boundaries. In this paper we explore the performances of methods for detecting glottal pulse boundaries as implemented by freely available software (Praat, intended for phonetic studies) vs. a research-grade pulse cycle detector (reported as a super-resolution method). We compare the sequences of pulse markers as obtained by two of Praat’s internal implementations and the super-resolution method against the hand-marked reference sequence in a dataset of pathological sustained vowels from a well-known database. A group of performance measures is extracted from this comparison, using a Dynamic-Time Warping alignment procedure. The measures obtained show the pros and cons of each alternative. Researchers and clinicians must be aware of the benefits of selecting either approach.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Baken, R.J., Orlikoff, R.F.: Clinical Measurement of Speech and Voice, 2nd edn. Cengage Learning (2000) Baken, R.J., Orlikoff, R.F.: Clinical Measurement of Speech and Voice, 2nd edn. Cengage Learning (2000)
2.
go back to reference Kreiman, J., Gerratt, B.R.: Measuring vocal quality. In: Kent, R.D., Ball, M.J. (eds.) Voice Quality Measurement, pp. 73–112. Singular, San Diego (2000) Kreiman, J., Gerratt, B.R.: Measuring vocal quality. In: Kent, R.D., Ball, M.J. (eds.) Voice Quality Measurement, pp. 73–112. Singular, San Diego (2000)
3.
go back to reference Dejonckere, P.H., et al.: A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques: Guideline elaborated by the Committee on Phoniatrics of the European Laryngolo. Eur. Arch. Oto-Rhino-Laryngology 258(2), 77–82 (2001)CrossRef Dejonckere, P.H., et al.: A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques: Guideline elaborated by the Committee on Phoniatrics of the European Laryngolo. Eur. Arch. Oto-Rhino-Laryngology 258(2), 77–82 (2001)CrossRef
4.
go back to reference Dejonckere, P.H.: Assessment of voice and respiratory function. In: Remacle, M., Eckel, E.H. (eds.) Surgery of Larynx and Trachea, pp. 11–26. Springer, Heidelberg (2010)CrossRef Dejonckere, P.H.: Assessment of voice and respiratory function. In: Remacle, M., Eckel, E.H. (eds.) Surgery of Larynx and Trachea, pp. 11–26. Springer, Heidelberg (2010)CrossRef
5.
go back to reference Buder, E.H.: Acoustic analysis of voice quality: a tabulation of algorithms 1902–1990. In: Kent, R.D., Ball, M.J. (eds.) Voice Quality Measurement, pp. 119–244. Singular, San Diego (2000) Buder, E.H.: Acoustic analysis of voice quality: a tabulation of algorithms 1902–1990. In: Kent, R.D., Ball, M.J. (eds.) Voice Quality Measurement, pp. 119–244. Singular, San Diego (2000)
6.
go back to reference Ng, M.L., Wong, J.: Voice onset time characteristics of esophageal, tracheoesophageal, and laryngeal speech of cantonese. J. Speech Lang. Hear. Res. 52, 780–789 (2009)CrossRef Ng, M.L., Wong, J.: Voice onset time characteristics of esophageal, tracheoesophageal, and laryngeal speech of cantonese. J. Speech Lang. Hear. Res. 52, 780–789 (2009)CrossRef
7.
go back to reference Karlsson, F., Olofsson, K., Blomstedt, P., Linder, J., Nordh, E., van Doorn, J.: Articulatory closure proficiency in patients with Parkinson’s disease following deep brain stimulation of the subthalamic nucleus and caudal zona incerta. J. Speech. Lang. Hear. Res. 57, 1178–1190 (2014)CrossRef Karlsson, F., Olofsson, K., Blomstedt, P., Linder, J., Nordh, E., van Doorn, J.: Articulatory closure proficiency in patients with Parkinson’s disease following deep brain stimulation of the subthalamic nucleus and caudal zona incerta. J. Speech. Lang. Hear. Res. 57, 1178–1190 (2014)CrossRef
8.
go back to reference Pouplier, M., Marin, S., Waltl, S.: Voice onset time in consonant cluster errors: can phonetic accommodation differentiate cognitive from motor errors? J. Speech Lang. Hear. Res. 57, 1577–1588 (2014)CrossRef Pouplier, M., Marin, S., Waltl, S.: Voice onset time in consonant cluster errors: can phonetic accommodation differentiate cognitive from motor errors? J. Speech Lang. Hear. Res. 57, 1577–1588 (2014)CrossRef
9.
go back to reference Borrie, S.A., Liss, J.M.: Rhythm as a coordinating device: entrainment with disordered speech. J. Speech Lang. Hear. Res. 57, 815–824 (2014)CrossRef Borrie, S.A., Liss, J.M.: Rhythm as a coordinating device: entrainment with disordered speech. J. Speech Lang. Hear. Res. 57, 815–824 (2014)CrossRef
10.
go back to reference Skodda, S., Grönheit, W., Schlegel, U.: Intonation and speech rate in Parkinson’s disease: general and dynamic aspects and responsiveness to levodopa admission, J. Voice 25(4) (2011)CrossRef Skodda, S., Grönheit, W., Schlegel, U.: Intonation and speech rate in Parkinson’s disease: general and dynamic aspects and responsiveness to levodopa admission, J. Voice 25(4) (2011)CrossRef
11.
go back to reference Ní Chasaide, A., Yanushevskaya, I., Gobl, C.: Voice source dynamics in intonation. In: Proceedings of 17th International Congress of Phonetic Sciences (ICPhS 2011), pp. 1470–1473 (2011) Ní Chasaide, A., Yanushevskaya, I., Gobl, C.: Voice source dynamics in intonation. In: Proceedings of 17th International Congress of Phonetic Sciences (ICPhS 2011), pp. 1470–1473 (2011)
12.
go back to reference Hess, W.J.: Pitch Determination of Speech Signals. Springer, Berlin Heidelberg (1983)CrossRef Hess, W.J.: Pitch Determination of Speech Signals. Springer, Berlin Heidelberg (1983)CrossRef
13.
go back to reference Hess, W.J.: Pitch and voicing determination of speech with an extension toward music signals. In: Benesty, J., Sondhi, M.M., Huang, Y.A. (eds.) Springer Handbook of Speech Processing. Springer. Heidelberg, pp. 181–212 (2008)CrossRef Hess, W.J.: Pitch and voicing determination of speech with an extension toward music signals. In: Benesty, J., Sondhi, M.M., Huang, Y.A. (eds.) Springer Handbook of Speech Processing. Springer. Heidelberg, pp. 181–212 (2008)CrossRef
14.
go back to reference Boersma, P.: PRAAT, a system for doing phonetics by computer. Glot Int. 5(9/10), 5 (2002) Boersma, P.: PRAAT, a system for doing phonetics by computer. Glot Int. 5(9/10), 5 (2002)
15.
go back to reference Barsties, B., Latoszek, V., Maryn, Y., Gerrits, E., De Bodt, M.S.: “The acoustic breathiness index (ABI): a multivariate acoustic model for breathiness, J. Voice (2017, in press) Barsties, B., Latoszek, V., Maryn, Y., Gerrits, E., De Bodt, M.S.: “The acoustic breathiness index (ABI): a multivariate acoustic model for breathiness, J. Voice (2017, in press)
16.
go back to reference Uloza, V., Latoszek, B.B.V., Ulozaite-Staniene, N., Petrauskas, T., Maryn, Y.: A comparison of Dysphonia severity index and acoustic voice quality index measures in differentiating normal and dysphonic voices. Eur. Arch. Oto-Rhino-Laryngology 275(4), 949–958 (2018)CrossRef Uloza, V., Latoszek, B.B.V., Ulozaite-Staniene, N., Petrauskas, T., Maryn, Y.: A comparison of Dysphonia severity index and acoustic voice quality index measures in differentiating normal and dysphonic voices. Eur. Arch. Oto-Rhino-Laryngology 275(4), 949–958 (2018)CrossRef
17.
go back to reference Latoszek, B.B.V., Ulozaitė-Stanienė, N., Petrauskas, T., Uloza, V., Maryn, Y.: Diagnostic accuracy of dysphonia classification of DSI and AVQI. Laryngoscope (2018) Latoszek, B.B.V., Ulozaitė-Stanienė, N., Petrauskas, T., Uloza, V., Maryn, Y.: Diagnostic accuracy of dysphonia classification of DSI and AVQI. Laryngoscope (2018)
18.
go back to reference Latoszek, B.B.V., De Bodt, M., Gerrits, E., Maryn, Y.: The exploration of an objective model for roughness with several acoustic markers. J. Voice 32(2), 149–161 (2018)CrossRef Latoszek, B.B.V., De Bodt, M., Gerrits, E., Maryn, Y.: The exploration of an objective model for roughness with several acoustic markers. J. Voice 32(2), 149–161 (2018)CrossRef
19.
go back to reference Boersma, P.: Should jitter be measured by peak picking or by waveform matching? Folia Phoniatr. Logop. 61(5), 305–308 (2009)CrossRef Boersma, P.: Should jitter be measured by peak picking or by waveform matching? Folia Phoniatr. Logop. 61(5), 305–308 (2009)CrossRef
20.
go back to reference Medan, Y., Yair, E., Chazan, D.: Super resolution pitch determination of speech signals. IEEE Trans. Signal Process. 39(1), 40–48 (1991)CrossRef Medan, Y., Yair, E., Chazan, D.: Super resolution pitch determination of speech signals. IEEE Trans. Signal Process. 39(1), 40–48 (1991)CrossRef
21.
go back to reference Bagshaw, P.C., Miller, S.M., Jack, M.A.: Enhanced pitch tracking and the processing of F0 contours for computer aided intonation teaching. In: 3rd European Conference on Speech Communication and Technology EUROSPEECH 1993, pp. 1003–1006 (1993) Bagshaw, P.C., Miller, S.M., Jack, M.A.: Enhanced pitch tracking and the processing of F0 contours for computer aided intonation teaching. In: 3rd European Conference on Speech Communication and Technology EUROSPEECH 1993, pp. 1003–1006 (1993)
22.
go back to reference Parsa, V., Jamieson, D.G.: A comparison of high precision F0 extraction algorithms for sustained vowels. J. Speech. Lang. Hear. Res. 42(1), 112–126 (1999)CrossRef Parsa, V., Jamieson, D.G.: A comparison of high precision F0 extraction algorithms for sustained vowels. J. Speech. Lang. Hear. Res. 42(1), 112–126 (1999)CrossRef
23.
go back to reference Ferrer, C., Torres, D., Hernández-Díaz, M.E.: Using dynamic time warping of T0 contours in the evaluation of cycle-to-cycle pitch detection algorithms. Pattern Recognit. Lett. 31(6), 517–522 (2010)CrossRef Ferrer, C., Torres, D., Hernández-Díaz, M.E.: Using dynamic time warping of T0 contours in the evaluation of cycle-to-cycle pitch detection algorithms. Pattern Recognit. Lett. 31(6), 517–522 (2010)CrossRef
24.
go back to reference Disordered Voice Database v1.03. Kay Elemetrics Corp (1994) Disordered Voice Database v1.03. Kay Elemetrics Corp (1994)
25.
go back to reference Hosokawa, K., Ogawa, M., Hashimoto, M., Inohara, H.: Statistical analysis of the reliability of acoustic and electroglottographic perturbation parameters for the detection of vocal roughness. J. Voice 28(2), 263.e9–263.e16 (2014)CrossRef Hosokawa, K., Ogawa, M., Hashimoto, M., Inohara, H.: Statistical analysis of the reliability of acoustic and electroglottographic perturbation parameters for the detection of vocal roughness. J. Voice 28(2), 263.e9–263.e16 (2014)CrossRef
26.
go back to reference Manfredi, C., Giordano, A., Schoentgen, J., Fraj, S., Bocchi, L., Dejonckere, P.H.: Validity of jitter measures in non-quasi-periodic voices. part II: the effect of noise. Logop. Phoniatr. Vocology 36(2), 78–89 (2011)CrossRef Manfredi, C., Giordano, A., Schoentgen, J., Fraj, S., Bocchi, L., Dejonckere, P.H.: Validity of jitter measures in non-quasi-periodic voices. part II: the effect of noise. Logop. Phoniatr. Vocology 36(2), 78–89 (2011)CrossRef
Metadata
Title
Measuring Periodicity Perturbations in Pathological Voice: General-Purpose Software vs. Custom-Tailored Methods
Authors
Reinier Rodríguez-Guillén
Carlos A. Ferrer-Riesgo
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-30648-9_8