Skip to main content

2016 | OriginalPaper | Buchkapitel

Endpoint Detection and De-noising Method Based on Multi-resolution Spectrogram

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The paper studied endpoint detection algorithm of noisy speech, since the visual differences of spectrogram employed by speech and noise, the paper chose spectrogram endpoint detection methods. Technical difficulties of spectrogram endpoint detection is how to describe the intuitive difference of spectrogram by mathematical amount, according to the descriptive power of autocorrelation coefficients on texture features, the paper described the difference by selecting the autocorrelation function, and proposed column autocorrelation spectrogram detection method. Through the distribution of spectrogram self-correlation function, as the threshold of endpoint detection for the noisy speech, the cut-off point between speech and noise was found out. Since the paper used broadband spectrogram, which employed poor frequency resolution, so there were still residual noise in speech column after autocorrelation spectrum detection, in order to further de-noising in different bands, combined with the multi resolution of empirical mode decomposition (EMD), the paper analyzed the noisy speech by multi-resolution, the target was broken down into different frequency scales and was further analyzed by column autocorrelation spectrogram, experiments shown that the noise reduction effect for noisy speech was ideal.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat You, K.H., Wang, H.: Robust features for noisy speech recognition based on temporal trajectory fitting of short-time autocorrelation sequences. Speech Commun. 28(99), 13–24 (1999) You, K.H., Wang, H.: Robust features for noisy speech recognition based on temporal trajectory fitting of short-time autocorrelation sequences. Speech Commun. 28(99), 13–24 (1999)
2.
Zurück zum Zitat Haweel, T.I., Haweel, M.T.: Adaptive multichannel LMS signal decoupling. In: IEEE International Conference on Communications, Signal Processing, and their Applications, pp. 1–4 (2015) Haweel, T.I., Haweel, M.T.: Adaptive multichannel LMS signal decoupling. In: IEEE International Conference on Communications, Signal Processing, and their Applications, pp. 1–4 (2015)
3.
Zurück zum Zitat Sase, T., Ramírez, J.P., Kitajo, K.: Estimating the level of dynamical noise in time series by using fractal dimensions. Phys. Lett. A 380(11–12), 1151–1163 (2016)MathSciNetCrossRef Sase, T., Ramírez, J.P., Kitajo, K.: Estimating the level of dynamical noise in time series by using fractal dimensions. Phys. Lett. A 380(11–12), 1151–1163 (2016)MathSciNetCrossRef
4.
Zurück zum Zitat Zhang, X., Mei, C., Chen, D.: Feature selection in mixed data: a method using a novel fuzzy rough set-based information entropy. Pattern Recogn. 56(1), 1–15 (2016) Zhang, X., Mei, C., Chen, D.: Feature selection in mixed data: a method using a novel fuzzy rough set-based information entropy. Pattern Recogn. 56(1), 1–15 (2016)
5.
Zurück zum Zitat Obin, N., Liu, N.M.: On the generalization of Shannon entropy for speech recognition. In: Spoken Language Technology Workshop (SLT), vol. 8537, no. 11, pp. 97–102 (2012) Obin, N., Liu, N.M.: On the generalization of Shannon entropy for speech recognition. In: Spoken Language Technology Workshop (SLT), vol. 8537, no. 11, pp. 97–102 (2012)
6.
Zurück zum Zitat Qiu-fang, A., Xiao-Jun, W.: A method for endpoint detection of speech using FBV based on harmonious analysis. Comput. Simul. 26(8), 330–333 (2009) Qiu-fang, A., Xiao-Jun, W.: A method for endpoint detection of speech using FBV based on harmonious analysis. Comput. Simul. 26(8), 330–333 (2009)
7.
Zurück zum Zitat Lareau, J., Lareau, J: Application of Shifted Delta Cepstral Features for GMM Language Identification (2006) Lareau, J., Lareau, J: Application of Shifted Delta Cepstral Features for GMM Language Identification (2006)
8.
Zurück zum Zitat Xiang-min, C., Zhang, J., Wei, G.: A speech endpoint detection algorithm based on spectrogram. Audio Eng. 4(8), 46–49 (2006) Xiang-min, C., Zhang, J., Wei, G.: A speech endpoint detection algorithm based on spectrogram. Audio Eng. 4(8), 46–49 (2006)
9.
Zurück zum Zitat Xiao, C., Sun, D., Gao, Y.: A speech enhancement algorithm based on speech spectrogram. Audio Eng. 36(9), 44–48 (2012) Xiao, C., Sun, D., Gao, Y.: A speech enhancement algorithm based on speech spectrogram. Audio Eng. 36(9), 44–48 (2012)
10.
Zurück zum Zitat Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Electronics Industry Press, Beijing (2011) Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Electronics Industry Press, Beijing (2011)
11.
Zurück zum Zitat Wang, X., Shen, H., Zhang, W.: Image mosaic by using the local autocorrelation algorithm in triangular geometric constraints. Opto-Electron. Eng. 42(4), 32–37 (2015) Wang, X., Shen, H., Zhang, W.: Image mosaic by using the local autocorrelation algorithm in triangular geometric constraints. Opto-Electron. Eng. 42(4), 32–37 (2015)
12.
Zurück zum Zitat Wang, L., Sun, Y.: Gastroscopy image retrieval based on color-texture autocorrelation algorithm. J. Circ. Syst. 16(2), 46–50 (2011) Wang, L., Sun, Y.: Gastroscopy image retrieval based on color-texture autocorrelation algorithm. J. Circ. Syst. 16(2), 46–50 (2011)
13.
Zurück zum Zitat Soon, I.Y., Koh, S.N.: Speech enhancement using 2-D Fourier transform. IEEE Trans. Speech Audio Process. 11(6), 717–724 (2003)CrossRef Soon, I.Y., Koh, S.N.: Speech enhancement using 2-D Fourier transform. IEEE Trans. Speech Audio Process. 11(6), 717–724 (2003)CrossRef
14.
Zurück zum Zitat Zhao, l: Speech Signal Processing. China Machine Press, Beijing (2009) Zhao, l: Speech Signal Processing. China Machine Press, Beijing (2009)
15.
Zurück zum Zitat Sun, Yan-kui: Wavelet Transform and Image Processing Techniques. Tsinghua University Press, Beijing (2012) Sun, Yan-kui: Wavelet Transform and Image Processing Techniques. Tsinghua University Press, Beijing (2012)
16.
Zurück zum Zitat Huang, N.E., Shen, Z., Long, S.R.: The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. R. Soc. London Proc. 454(1971), 903–993 (1998)MathSciNetCrossRefMATH Huang, N.E., Shen, Z., Long, S.R.: The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. R. Soc. London Proc. 454(1971), 903–993 (1998)MathSciNetCrossRefMATH
17.
Zurück zum Zitat Fu, J., Wang, S.W., Cao, X.L.: The research on speech endpoint detection algorithm based on spectrogram row self-correlation. In: 2nd International Conference on Computer Science and Network Technology, pp. 212–216 (2012) Fu, J., Wang, S.W., Cao, X.L.: The research on speech endpoint detection algorithm based on spectrogram row self-correlation. In: 2nd International Conference on Computer Science and Network Technology, pp. 212–216 (2012)
Metadaten
Titel
Endpoint Detection and De-noising Method Based on Multi-resolution Spectrogram
verfasst von
Jing Zhang
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-42294-7_34