Skip to main content

2016 | OriginalPaper | Buchkapitel

Repetition Detection in Stuttered Speech

verfasst von : Pravin B. Ramteke, Shashidhar G. Koolagudi, Fathima Afroz

Erschienen in: Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics

Verlag: Springer India

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper mainly focuses on detection of repetitions in stuttered speech. The stuttered speech signal is divided into isolated units based on energy. Mel-frequency cepstrum coefficients (MFCCs), formants and shimmer are used as features for repetition recognition. These features are extracted from each isolated unit. Using Dynamic Time Warping (DTW) the features of each isolated unit are compared with those subsequent units within one second interval of speech. Based on the analysis of scores obtained from DTW a threshold is set, if the score is below the set threshold then the units are identified as repeated events. Twenty seven seconds of speech data used in this work, consists of 50 repetition events. The result shows that the combination of MFCCs, formants and shimmer can be used for the recognition of repetitions in stuttered speech. Out of 50 repetitions, 47 are correctly identified.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Riper, V.: The Nature of Stuttering. Prentice Hall, New Jersey (1971) Riper, V.: The Nature of Stuttering. Prentice Hall, New Jersey (1971)
2.
Zurück zum Zitat Kully, D., Boerg, E.: An investigation of inter-clinic agreement in the identification of fluent and stuttered syllables. J. Fluency Disord. 13, 309–318 (1988)CrossRef Kully, D., Boerg, E.: An investigation of inter-clinic agreement in the identification of fluent and stuttered syllables. J. Fluency Disord. 13, 309–318 (1988)CrossRef
3.
Zurück zum Zitat Conture, E.G.: Stuttering Englewood cliffs, New Jersey: Prentice-Hall, 2nd edn. (1990) Conture, E.G.: Stuttering Englewood cliffs, New Jersey: Prentice-Hall, 2nd edn. (1990)
4.
Zurück zum Zitat Zhang, J., Dong, B., Yan, Y.: A computer-assist algorithm to detect repetitive stuttering automatically. In: International Conference on Asian Language Processing, pp. 249–252 (2013) Zhang, J., Dong, B., Yan, Y.: A computer-assist algorithm to detect repetitive stuttering automatically. In: International Conference on Asian Language Processing, pp. 249–252 (2013)
5.
Zurück zum Zitat Ravikumar, K.M., Balakrishna, R., Rajagopal, R., Nagaraj, H.C.: Automatic detection of syllable repetition in read speech for objective assessment of stuttered disfluencies. Proce. World Acad. Sci. 2, 220–223 (2008) Ravikumar, K.M., Balakrishna, R., Rajagopal, R., Nagaraj, H.C.: Automatic detection of syllable repetition in read speech for objective assessment of stuttered disfluencies. Proce. World Acad. Sci. 2, 220–223 (2008)
6.
Zurück zum Zitat Palfy, J., Pospichal, J.: Recognition of repetitions using support vector machines. In: Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), 2011, pp. 1–6 (2011) Palfy, J., Pospichal, J.: Recognition of repetitions using support vector machines. In: Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), 2011, pp. 1–6 (2011)
7.
Zurück zum Zitat Chee, L.S., Ai, O.C., Hariharan, M., Yaacob, S.: Automatic detection of prolongations and repetitions using LPCC. In: 2009 International Conference for Technical Postgraduates (TECHPOS). pp. 1–4 (2009) Chee, L.S., Ai, O.C., Hariharan, M., Yaacob, S.: Automatic detection of prolongations and repetitions using LPCC. In: 2009 International Conference for Technical Postgraduates (TECHPOS). pp. 1–4 (2009)
8.
Zurück zum Zitat Ai, O.C., Hariharan, M., Yaacob, S., Chee, L.S.: Classification of speech dysfluencies with MFCC and LPCC features. J. Med. Syst. 39, 2157–2165 (2012) Ai, O.C., Hariharan, M., Yaacob, S., Chee, L.S.: Classification of speech dysfluencies with MFCC and LPCC features. J. Med. Syst. 39, 2157–2165 (2012)
9.
Zurück zum Zitat Ying, G.S., Mitchell, C.D., Jamieson, L.H.: Endpoint detection of isolated utterances based on a modified teager energy measurement. International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 732–735 (1993) Ying, G.S., Mitchell, C.D., Jamieson, L.H.: Endpoint detection of isolated utterances based on a modified teager energy measurement. International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 732–735 (1993)
11.
Zurück zum Zitat Welling, L., Ney, H.: Formant estimation for speech recognition. IEEE Transactions on Speech Audio Processing, vol. 6, pp. 36–48 (1998)CrossRef Welling, L., Ney, H.: Formant estimation for speech recognition. IEEE Transactions on Speech Audio Processing, vol. 6, pp. 36–48 (1998)CrossRef
12.
Zurück zum Zitat Li, X., Tao, J., Johnson, M.T., Soltis, J., Savage, A., Kirsten, M.L., Newman, J.D.: Stress and emotion classification using Jitter and Shimmer features. In: IEEE International Conference on Acoustics, Speech and Signal Processing, 2007, vol. 4., pp. IV–1081. IEEE (2007) Li, X., Tao, J., Johnson, M.T., Soltis, J., Savage, A., Kirsten, M.L., Newman, J.D.: Stress and emotion classification using Jitter and Shimmer features. In: IEEE International Conference on Acoustics, Speech and Signal Processing, 2007, vol. 4., pp. IV–1081. IEEE (2007)
13.
Zurück zum Zitat Keogh, E., Ratanamahatana, C.A.: Exact indexing of dynamic time warping. Knowl. Inf. Syst. 7, 358–386 (2005)CrossRef Keogh, E., Ratanamahatana, C.A.: Exact indexing of dynamic time warping. Knowl. Inf. Syst. 7, 358–386 (2005)CrossRef
Metadaten
Titel
Repetition Detection in Stuttered Speech
verfasst von
Pravin B. Ramteke
Shashidhar G. Koolagudi
Fathima Afroz
Copyright-Jahr
2016
Verlag
Springer India
DOI
https://doi.org/10.1007/978-81-322-2538-6_63