Skip to main content
Erschienen in: International Journal of Speech Technology 4/2016

04.10.2016

Spectral analysis of infant cries and adult speech

Erschienen in: International Journal of Speech Technology | Ausgabe 4/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, spectrographic analysis of the infant cries is reported. For the spectrographic analysis of the infant cries ten different cry modes are used to analyze differences in different pathological cries. A comparison of spectrograms of the adult speech signal and infant cry signals is given. Based on differences in the distribution of energy in the spectrograms, energy-based features are calculated from the short-time Fourier transform (STFT) of the adult speech and infant cry signals. The classification performance of these features is obtained using support vector machine (SVM) classifier and it is observed that the energy distribution in 0–1 kHz range is promising feature in the classification of adult speech and infant cries and the classification accuracy achieved with this feature is 98.22 %. On the contrary, it was observed that it is very difficult to classify adult speech and infant cries using the energy distribution in 1–3 kHz.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Black, J. S. (1997). An eclectic cry research tool for the estimation of an infant’s level of distress. B.Sc. Thesis, University of British Columbia, Dept. of Elect. Eng. Black, J. S. (1997). An eclectic cry research tool for the estimation of an infant’s level of distress. B.Sc. Thesis, University of British Columbia, Dept. of Elect. Eng.
Zurück zum Zitat Buddha, N., & Patil, H. A. (2007). Corpora for analysis of infant cry. In Int. conf. on speech databases and assessments, oriental COCOSDA, (pp. 43–48). Hanoi, Vietnam. Buddha, N., & Patil, H. A. (2007). Corpora for analysis of infant cry. In Int. conf. on speech databases and assessments, oriental COCOSDA, (pp. 43–48). Hanoi, Vietnam.
Zurück zum Zitat Chittora, A., & Patil, H. A. (2013). Data collection and corpus design for analysis of normal and pathological infant cry. In Oriental COCOSDA held jointly with International Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) (Vol. 2, pp. 1–6). New Delhi. Chittora, A., & Patil, H. A. (2013). Data collection and corpus design for analysis of normal and pathological infant cry. In Oriental COCOSDA held jointly with International Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE) (Vol. 2, pp. 1–6). New Delhi.
Zurück zum Zitat Chittora, A., Patil, H. A., & Sailor, H. (2015). Spectro-temporal analysis of HIE and asthma infant cries using auditory spectrogram. In International conference on biosignal analysis, processing and system (ICBAPS 2015) (pp. 145–150). Kuala Lumpur, Malaysia. Chittora, A., Patil, H. A., & Sailor, H. (2015). Spectro-temporal analysis of HIE and asthma infant cries using auditory spectrogram. In International conference on biosignal analysis, processing and system (ICBAPS 2015) (pp. 145–150). Kuala Lumpur, Malaysia.
Zurück zum Zitat Gabor, D. (1946). Theory of communication. Part 1: The analysis of information. Electrical Engineers-Part III: Radio and Communication Engineering, Journal of the Institution of, 93(26), 429–441. Gabor, D. (1946). Theory of communication. Part 1: The analysis of information. Electrical Engineers-Part III: Radio and Communication Engineering, Journal of the Institution of, 93(26), 429–441.
Zurück zum Zitat Garcia, J., & Garcia, C. A. (July, 2003). Mel-frequency cepstrum coefficients extraction from infant cry for classification of normal and pathological cry with feed-forward neural networks. In Proceedings of the international joint conference on neural networks, (pp. 3140–3145). Garcia, J., & Garcia, C. A. (July, 2003). Mel-frequency cepstrum coefficients extraction from infant cry for classification of normal and pathological cry with feed-forward neural networks. In Proceedings of the international joint conference on neural networks, (pp. 3140–3145).
Zurück zum Zitat Haitao, Y., Wang, Y., Zhanlin, X., & Wei, L. (2010). Feature extraction and classification based on bispectrum for underwater targets. In: Inter. Conf. on Intelligent System Design and Engineering Application (ISDEA) (Vol. 1, pp. 742–745). Changsha . Haitao, Y., Wang, Y., Zhanlin, X., & Wei, L. (2010). Feature extraction and classification based on bispectrum for underwater targets. In: Inter. Conf. on Intelligent System Design and Engineering Application (ISDEA) (Vol. 1, pp. 742–745). Changsha .
Zurück zum Zitat Kheddache, Y., & Tadj, C. (2013). Acoustic measures of the cry characteristics of healthy newborns and newborns with pathologies. Journal of Biomedical Science and Engineering, 6, 796–804.CrossRef Kheddache, Y., & Tadj, C. (2013). Acoustic measures of the cry characteristics of healthy newborns and newborns with pathologies. Journal of Biomedical Science and Engineering, 6, 796–804.CrossRef
Zurück zum Zitat Lester, B. M. (1985). Inroduction- There’s more to crying than meets the ear. In B. M. Lester & Z. C. Boukydis (Eds.), Infant crying—Theoritical and research perspective (pp. 1–27). New York and London: Plenum Press. Lester, B. M. (1985). Inroduction- There’s more to crying than meets the ear. In B. M. Lester & Z. C. Boukydis (Eds.), Infant crying—Theoritical and research perspective (pp. 1–27). New York and London: Plenum Press.
Zurück zum Zitat Messaoud, A., & Tadj, C. (2010). A cry based infant identification system. In 4th Int. conf. on image and signal process (pp. 192–199). Messaoud, A., & Tadj, C. (2010). A cry based infant identification system. In 4th Int. conf. on image and signal process (pp. 192–199).
Zurück zum Zitat Michelson, K., Sirvio, P., & Wasz-Hokert, O. (1977). Sound spectrographic cry analysis of infants with bacterial meningitis. Developmental Medicine and Child Neurology, 19, 309–315.CrossRef Michelson, K., Sirvio, P., & Wasz-Hokert, O. (1977). Sound spectrographic cry analysis of infants with bacterial meningitis. Developmental Medicine and Child Neurology, 19, 309–315.CrossRef
Zurück zum Zitat Patil, H. A. (2009). Infant identification from their cry. In IEEE 7th international conf. on advances in pattern recognition (ICAPR) (pp. 107–110). Kolkata. Patil, H. A. (2009). Infant identification from their cry. In IEEE 7th international conf. on advances in pattern recognition (ICAPR) (pp. 107–110). Kolkata.
Zurück zum Zitat Patil, H. A. (2010). Cry Baby: Using spectrographic analysis to assess neonatal health from an infant’s cry. In A. Neustein (Ed.), Advances in speech recognition, mobile environments, call centres and clinics (pp. 323–348). Springer-Verlag. Patil, H. A. (2010). Cry Baby: Using spectrographic analysis to assess neonatal health from an infant’s cry. In A. Neustein (Ed.), Advances in speech recognition, mobile environments, call centres and clinics (pp. 323–348). Springer-Verlag.
Zurück zum Zitat Reyes Galaviz, O. F., Cano-Ortiz, S. D., & Rayes-Garcia, C. A. (2008). Evolutionary-neural system to classify infant cry units for pathologies identification in recently born babies. In IEEE 7th Mexican Int. Conf. on Art. Intell. (pp. 330–335). Reyes Galaviz, O. F., Cano-Ortiz, S. D., & Rayes-Garcia, C. A. (2008). Evolutionary-neural system to classify infant cry units for pathologies identification in recently born babies. In IEEE 7th Mexican Int. Conf. on Art. Intell. (pp. 330–335).
Zurück zum Zitat Reyes-Galaviz, O., Cano-Ortiz, S., & Reyes-Garcia, C. (2008). Validation of the cry unit as primary element for cry analysis using an evolutionary-neural approach. In Mexican international conference on computer science, 2008. ENC ‘08 (pp. 261–267). Reyes-Galaviz, O., Cano-Ortiz, S., & Reyes-Garcia, C. (2008). Validation of the cry unit as primary element for cry analysis using an evolutionary-neural approach. In Mexican international conference on computer science, 2008. ENC ‘08 (pp. 261–267).
Zurück zum Zitat Saraswathy, J., Hariharan, M., Vijean, V., Yaacob, S., & Khairunizam, W. (2012). Performance comparison of Daubechies wavelet family in infant cry classification. In IEEE 8th international colloquium on signal processing and its applications (pp. 451–455). Saraswathy, J., Hariharan, M., Vijean, V., Yaacob, S., & Khairunizam, W. (2012). Performance comparison of Daubechies wavelet family in infant cry classification. In IEEE 8th international colloquium on signal processing and its applications (pp. 451–455).
Zurück zum Zitat Wasz-Hockert, O., Michelsson, K., & Lind, J. (1985). Twenty five years of scandinavian cry research. In B. M. Lester & Z. Boukydis (Eds.), Infant crying—Theoritical and research perspective (pp. 83–101). New York: Plenum Publishing Corporation. Wasz-Hockert, O., Michelsson, K., & Lind, J. (1985). Twenty five years of scandinavian cry research. In B. M. Lester & Z. Boukydis (Eds.), Infant crying—Theoritical and research perspective (pp. 83–101). New York: Plenum Publishing Corporation.
Zurück zum Zitat Xie, Q. (1993). Automatic infant cry analysis and recognition. Electrical engineering. University of British Columbia. Xie, Q. (1993). Automatic infant cry analysis and recognition. Electrical engineering. University of British Columbia.
Zurück zum Zitat Xie, Q., Ward, R. K., & Laszlo, C. A. (1996). Automatic assessment of infants’ level of distress from the cry signals. IEEE Transactions on Speech and Audio Process, 4(4), 253–265. Xie, Q., Ward, R. K., & Laszlo, C. A. (1996). Automatic assessment of infants’ level of distress from the cry signals. IEEE Transactions on Speech and Audio Process, 4(4), 253–265.
Zurück zum Zitat Xie, Q., Ward, R. K., & Laszlo, C. A. (1996). Automatic detection of infant’s level of distress from the cry signals. IEEE Transactions on Speech and Audio Process, 4(4), 253–265. Xie, Q., Ward, R. K., & Laszlo, C. A. (1996). Automatic detection of infant’s level of distress from the cry signals. IEEE Transactions on Speech and Audio Process, 4(4), 253–265.
Zurück zum Zitat Xie, Q., Ward, R. K., & Laszlo, C. A. (2009). Determining normal infant’s level of distress from cry sounds. In IEEE (Ed.), IEEE Canadian Conf. on Elect. and Comp. Eng. (Vol. 2, pp. 1094–1097). Vancouver, BC. Xie, Q., Ward, R. K., & Laszlo, C. A. (2009). Determining normal infant’s level of distress from cry sounds. In IEEE (Ed.), IEEE Canadian Conf. on Elect. and Comp. Eng. (Vol. 2, pp. 1094–1097). Vancouver, BC.
Metadaten
Titel
Spectral analysis of infant cries and adult speech
Publikationsdatum
04.10.2016
Erschienen in
International Journal of Speech Technology / Ausgabe 4/2016
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-016-9375-z

Weitere Artikel der Ausgabe 4/2016

International Journal of Speech Technology 4/2016 Zur Ausgabe

Neuer Inhalt