Skip to main content
Top

2018 | OriginalPaper | Chapter

12. Scalogram and Nonlinear Analysis of Speech

Author : Mohamed Hesham Farouk

Published in: Application of Wavelets in Speech Processing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

As voice source generally interacts with the vocal tract in a nonlinear way, the interaction may take place at the glottis during the periodic vibration of vocal cords. The resulting excitation affects the lower-frequency components of produced voice at lips. Instead, turbulent sound source interacts in a way that influences the higher-frequency components. So, the wavelet decomposition can explore such nonlinear behavior through MRA. Nonlinear and chaotic components of a speech signal can be verified through scalogram analysis obtained from such MRA using CWT. A scale index obtained from CWT can confirm chaotic behavior even for highly periodic waveforms which is the case in speech vowels.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference A. Esposito, M. Marinaro, Some notes on nonlinearities of speech, lecture notes in computer science, nonlinear speech modeling and applications. Spring 3445, 1–14 (2005) A. Esposito, M. Marinaro, Some notes on nonlinearities of speech, lecture notes in computer science, nonlinear speech modeling and applications. Spring 3445, 1–14 (2005)
2.
go back to reference Z. Ali, I. Elamvazuthi, M. Alsulaiman, G. Muhammad, Detection of voice pathology using fractal dimension in a multiresolution analysis of normal and disordered speech signals. J. Med. Syst. 40(1), 20, 10 pages (2016) Z. Ali, I. Elamvazuthi, M. Alsulaiman, G. Muhammad, Detection of voice pathology using fractal dimension in a multiresolution analysis of normal and disordered speech signals. J. Med. Syst. 40(1), 20, 10 pages (2016)
3.
go back to reference S.G. Firooz, F. Almasganj, Y. Shekofteh, Improvement of automatic speech recognition systems via nonlinear dynamical features evaluated from the recurrence plot of speech signals. Comput. Electr. Eng. 58, 215–226 (2017)CrossRef S.G. Firooz, F. Almasganj, Y. Shekofteh, Improvement of automatic speech recognition systems via nonlinear dynamical features evaluated from the recurrence plot of speech signals. Comput. Electr. Eng. 58, 215–226 (2017)CrossRef
4.
go back to reference M. Faúndez-Zanuy, S. McLaughlin, A. Esposito, A. Hussain, J. Schoentgen, G. Kubin, W.B. Kleijn, P. Maragos, Non-linear speech processing: overview and applications. Control Intell. Syst. ACTA Press 30(1), 1–10 (2002) M. Faúndez-Zanuy, S. McLaughlin, A. Esposito, A. Hussain, J. Schoentgen, G. Kubin, W.B. Kleijn, P. Maragos, Non-linear speech processing: overview and applications. Control Intell. Syst. ACTA Press 30(1), 1–10 (2002)
5.
go back to reference J.C. Vásquez-Correa, J.R. Orozco-Arroyave, J.D. Arias-Londoño, J.F. Vargas-Bonilla, E. Nöth, Non-linear dynamics characterization from wavelet packet transform for automatic recognition of emotional speech. Recent Adv. Nonlin. Speech Process. 48, 199–207 (2016, Springer, Cham)CrossRef J.C. Vásquez-Correa, J.R. Orozco-Arroyave, J.D. Arias-Londoño, J.F. Vargas-Bonilla, E. Nöth, Non-linear dynamics characterization from wavelet packet transform for automatic recognition of emotional speech. Recent Adv. Nonlin. Speech Process. 48, 199–207 (2016, Springer, Cham)CrossRef
6.
go back to reference E. Sejdic, I. Djurovic, L. Stankovic, Quantitative performance analysis of scalogram as instantaneous frequency estimator. IEEE Trans. Signal Process. 56(8), 3837–3845 (2008)MathSciNetCrossRef E. Sejdic, I. Djurovic, L. Stankovic, Quantitative performance analysis of scalogram as instantaneous frequency estimator. IEEE Trans. Signal Process. 56(8), 3837–3845 (2008)MathSciNetCrossRef
7.
go back to reference I. Jemaa, K. Ouni, Y. Laprie, S. Ouni, J.-P. Haton, A new automatic formant tracking approach based on scalogram maxima detection using complex wavelets, in CEIT – International Conference on Control, Engineering & Information Technology – 2013, Jun 2013, Sousse, Tunisia, 2013 I. Jemaa, K. Ouni, Y. Laprie, S. Ouni, J.-P. Haton, A new automatic formant tracking approach based on scalogram maxima detection using complex wavelets, in CEIT – International Conference on Control, Engineering & Information Technology – 2013, Jun 2013, Sousse, Tunisia, 2013
8.
go back to reference P. Anju, P. Shanmugapriya, Speaker verification using scalogram and Gaussian mixture model, in International Conference on Engineering and Technology, Bofring, 2013, pp. 22–25 P. Anju, P. Shanmugapriya, Speaker verification using scalogram and Gaussian mixture model, in International Conference on Engineering and Technology, Bofring, 2013, pp. 22–25
9.
go back to reference J.J. Jiang, Y. Zhang, C. McGilligan, Chaos in voice, from modeling to measurement. J. Voice 20(1), 2–17 (2006)CrossRef J.J. Jiang, Y. Zhang, C. McGilligan, Chaos in voice, from modeling to measurement. J. Voice 20(1), 2–17 (2006)CrossRef
10.
go back to reference G. Vaziri, F. Almasganj, R. Behroozmand, Pathological assessment of patients’speech signals using nonlinear dynamical analysis. Comput. Biol. Med. 40, 54–63 (2010)CrossRef G. Vaziri, F. Almasganj, R. Behroozmand, Pathological assessment of patients’speech signals using nonlinear dynamical analysis. Comput. Biol. Med. 40, 54–63 (2010)CrossRef
11.
go back to reference Y. Hou, A compactly supported, symmetrical and quasi-orthogonal wavelet. Int. J. Wavelets Multiresolution Inf. Process. 8(6), 931–940 (2010)MathSciNetCrossRefMATH Y. Hou, A compactly supported, symmetrical and quasi-orthogonal wavelet. Int. J. Wavelets Multiresolution Inf. Process. 8(6), 931–940 (2010)MathSciNetCrossRefMATH
12.
go back to reference R. Benítez, V.J. Bolós, M.E. Ramírez, A wavelet-based tool for studying non-periodicity. Comput. Math. Appl. 60(3), 634–641 (2010)MathSciNetCrossRefMATH R. Benítez, V.J. Bolós, M.E. Ramírez, A wavelet-based tool for studying non-periodicity. Comput. Math. Appl. 60(3), 634–641 (2010)MathSciNetCrossRefMATH
13.
go back to reference E. Campos Cantón, J.S. Murguía, Wavelet analysis of chaotic time series. Revista Mexicana de Física 52(2), 155–162 (2006) E. Campos Cantón, J.S. Murguía, Wavelet analysis of chaotic time series. Revista Mexicana de Física 52(2), 155–162 (2006)
14.
go back to reference G. Chen, S. Hsu, Y. Huang, M. Roque-Sol, The spectrum of chaotic time series (II): wavelet analysis. Int. J. Bifurcation Chaos 21(5), 1457–1467 (2011)MathSciNetCrossRefMATH G. Chen, S. Hsu, Y. Huang, M. Roque-Sol, The spectrum of chaotic time series (II): wavelet analysis. Int. J. Bifurcation Chaos 21(5), 1457–1467 (2011)MathSciNetCrossRefMATH
15.
go back to reference M. Hesham, Wavelet-scalogram based study of non-periodicity in speech signals as a complementary measure of chaotic content. Int. J Speech Technol 16(3), 353–361 (2013)CrossRef M. Hesham, Wavelet-scalogram based study of non-periodicity in speech signals as a complementary measure of chaotic content. Int. J Speech Technol 16(3), 353–361 (2013)CrossRef
16.
go back to reference S. Mallat, A Wavelet Tour of Signal Processing (Academic Press, London, 1999)MATH S. Mallat, A Wavelet Tour of Signal Processing (Academic Press, London, 1999)MATH
17.
go back to reference M.S. Chavan, N. Mastorakis, M.N. Chavan, M.S. Gaikwad, Implementation of SYMLET wavelets to removal of Gaussian additive noise from speech signal, in Proceeding of 10th WSEAS International Conference on Signal Processing, Robotics and Automation, Wisconsin, USA, 2011, pp. 37–41 M.S. Chavan, N. Mastorakis, M.N. Chavan, M.S. Gaikwad, Implementation of SYMLET wavelets to removal of Gaussian additive noise from speech signal, in Proceeding of 10th WSEAS International Conference on Signal Processing, Robotics and Automation, Wisconsin, USA, 2011, pp. 37–41
18.
go back to reference Y. Long, L. Gang, G. Jun, Selection of the best wavelet base for speech signal, intelligent multimedia, video and speech processing, 2004, in Proceedings of 2004 International Symposium on, 20–22 Oct 2004, pp. 218–221 Y. Long, L. Gang, G. Jun, Selection of the best wavelet base for speech signal, intelligent multimedia, video and speech processing, 2004, in Proceedings of 2004 International Symposium on, 20–22 Oct 2004, pp. 218–221
19.
go back to reference M. Hesham, A predefined wavelet packet for speech quality assessment. J. Eng. Appl. Sci. 53(5), 637–652 (2006) M. Hesham, A predefined wavelet packet for speech quality assessment. J. Eng. Appl. Sci. 53(5), 637–652 (2006)
Metadata
Title
Scalogram and Nonlinear Analysis of Speech
Author
Mohamed Hesham Farouk
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-69002-5_12