Skip to main content
Erschienen in: Neural Computing and Applications 7/2009

01.10.2009 | Original Article

Speech nonfluency detection using Kohonen networks

verfasst von: Izabela Szczurowska, Wiesława Kuniszyk-Jóźkowiak, Elżbieta Smołka

Erschienen in: Neural Computing and Applications | Ausgabe 7/2009

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This work covers the problem of application of neural networks to recognition and categorization of non-fluent and fluent utterance records. Fifty-five 4-s speech samples where the blockade on plosives (p, b, t, d, k and g) occurred and 55 recordings of speech of fluent speakers containing the same fragments were applied. Two Kohonen networks were used. The purpose of the first network was to reduce the dimension of the vector describing the input signals. A result of the analysis was the output matrix consisting of the neurons winning in a particular time frame. This matrix was taken as an input for the next self-organizing map network. Various types of Kohonen networks were examined with respect to their ability to classify utterances correctly into two, non-fluent and fluent, groups. Good examination results were accomplished and classification correctness exceeded 76%.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Chou SM, Papliński AP et al (2007) Speaker-dependent bimodal integration of Chinese phonemes and letters using multimodal self-organizing networks. Proceedings of international joint conference on neural networks, Orlando, Florida, USA Chou SM, Papliński AP et al (2007) Speaker-dependent bimodal integration of Chinese phonemes and letters using multimodal self-organizing networks. Proceedings of international joint conference on neural networks, Orlando, Florida, USA
6.
Zurück zum Zitat Farrell K, Mamione R et al (1994) Speaker recognition using neural networks and conventional classifiers. IEEE Trans Speech Audio Process 2(1, part 2):194–205. doi:10.1109/89.260362 CrossRef Farrell K, Mamione R et al (1994) Speaker recognition using neural networks and conventional classifiers. IEEE Trans Speech Audio Process 2(1, part 2):194–205. doi:10.​1109/​89.​260362 CrossRef
7.
Zurück zum Zitat Garfield S, Elshaw M et al (2001) Self-organizing networks for classification learning from normal and aphasic speech. The 23rd Conference of the Cognitive Science Society, Edinburgh Garfield S, Elshaw M et al (2001) Self-organizing networks for classification learning from normal and aphasic speech. The 23rd Conference of the Cognitive Science Society, Edinburgh
13.
Zurück zum Zitat Kestler HA, Schwenker F (2000) Classification of high-resolution ECG signals. In: Howlett R, Jain L (eds) Radial basis function neural networks: theory and applications. Physica-Verlag, Heidelberg Kestler HA, Schwenker F (2000) Classification of high-resolution ECG signals. In: Howlett R, Jain L (eds) Radial basis function neural networks: theory and applications. Physica-Verlag, Heidelberg
17.
Zurück zum Zitat Kohonen T (2001) Self-organizing maps. Springer, BerlinMATH Kohonen T (2001) Self-organizing maps. Springer, BerlinMATH
19.
Zurück zum Zitat Leinonen L, Hiltunen T et al (1997) Categorization of voice disorders with six perceptual dimensions. Folia Phoniatr Logop 49:9–20CrossRef Leinonen L, Hiltunen T et al (1997) Categorization of voice disorders with six perceptual dimensions. Folia Phoniatr Logop 49:9–20CrossRef
20.
Zurück zum Zitat Leinonen L, Kangas J et al (1992) Dysphonia detected by pattern recognition of spectral composition. J Speech Hear Res 35:287–295 Leinonen L, Kangas J et al (1992) Dysphonia detected by pattern recognition of spectral composition. J Speech Hear Res 35:287–295
30.
Zurück zum Zitat Smolka E, Kuniszyk-Jozkowiak W et al (2004) Speech nonfluency recognition in two stages of Kohonen networks. Biocybernetics and Biomedical Engineering, Zakopane Smolka E, Kuniszyk-Jozkowiak W et al (2004) Speech nonfluency recognition in two stages of Kohonen networks. Biocybernetics and Biomedical Engineering, Zakopane
31.
Zurück zum Zitat Smołka E, Kuniszyk-Jóźkowiak W et al (2002) Reflection of fluent and non-fluent words in Kohonen network (in Polish). XLIX Open Seminar on Acoustics, Warszawa—Stare Jabłonki Smołka E, Kuniszyk-Jóźkowiak W et al (2002) Reflection of fluent and non-fluent words in Kohonen network (in Polish). XLIX Open Seminar on Acoustics, Warszawa—Stare Jabłonki
34.
Zurück zum Zitat Szczurowska I, Kuniszyk-Jóźkowiak W et al (2006) The application of Kohonen and multilayer perceptron networks in the speech nonfluency analysis. Archiv Acoust 31(4 (Supplement)):205–210 Szczurowska I, Kuniszyk-Jóźkowiak W et al (2006) The application of Kohonen and multilayer perceptron networks in the speech nonfluency analysis. Archiv Acoust 31(4 (Supplement)):205–210
Metadaten
Titel
Speech nonfluency detection using Kohonen networks
verfasst von
Izabela Szczurowska
Wiesława Kuniszyk-Jóźkowiak
Elżbieta Smołka
Publikationsdatum
01.10.2009
Verlag
Springer-Verlag
Erschienen in
Neural Computing and Applications / Ausgabe 7/2009
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-009-0261-3

Weitere Artikel der Ausgabe 7/2009

Neural Computing and Applications 7/2009 Zur Ausgabe

Premium Partner