Skip to main content
Erschienen in: Cluster Computing 2/2015

01.06.2015

Improvement of speech signal extraction method using detection filter of energy spectrum entropy

verfasst von: Kyungyong Chung, SangYeob Oh

Erschienen in: Cluster Computing | Ausgabe 2/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In speech recognition system research, recognition system performance has been significantly improved through research and development in the speech recognition area, but environmental noise is still a favorite subject for research due to its numerous environmental changes. And speech extraction techniques, which are widely applied, improve speech signals that are mixed with noise. A least mean square (LMS) adaptation filter is commonly used to help noise estimation and detection algorithms adapt to changing environments. But an LMS filter needs some time to adapt and estimate signals. That weakness can be overcome by using energy spectrum entropy and an average estimate LMS (AELMS) filter to detect robust voice activity in a noisy environment. In this paper, we propose a speech signal extraction method using a detection filter of energy spectrum entropy. The proposed method is polluted speech–signal noise extraction to reduce noise with an AELMS filter to detect robust voice activity. An AELMS filter maintains source features of speech, decreases speech information degradation, and reduces noise in a polluted speech signal. To improve adaptation speed, we calculated an average estimator, and controlled the LMS filter step size with a frame measure. For speech detection of signals synthesized with low-speed and high-speed driving noise, an energy spectrum entropy method was used. Compared to an existing method of using frame energy, the proposed method improved the starting point of the resulting speech by 1.7 % of an error rate and by 3.7 % of an end point error rate.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Haykin, S.: Adaptive Filter Theory. Prentice Hall, Upper Saddle River (2002) Haykin, S.: Adaptive Filter Theory. Prentice Hall, Upper Saddle River (2002)
2.
Zurück zum Zitat Homer, J.: Detection guided NLMS estimation of sparsely parameterized channels. In: IEEE Transactions on Circuits and Systems U: Analog and Signal Processing, vol. 47, No. 12 (2000) Homer, J.: Detection guided NLMS estimation of sparsely parameterized channels. In: IEEE Transactions on Circuits and Systems U: Analog and Signal Processing, vol. 47, No. 12 (2000)
3.
Zurück zum Zitat Kang, S.K., Chung, K.Y., Lee, J.H.: Development of head detection and tracking systems for visual surveillance. Pers. Ubiquitous Comput. 18, 515–522 (2014)CrossRef Kang, S.K., Chung, K.Y., Lee, J.H.: Development of head detection and tracking systems for visual surveillance. Pers. Ubiquitous Comput. 18, 515–522 (2014)CrossRef
4.
5.
Zurück zum Zitat Baek, S.J., Han, J.S., Chung, K.Y.: Dynamic reconfiguration based on goal-scenario by adaptation strategy. Wirel. Pers. Commun. 73(2), 309–318 (2013)CrossRef Baek, S.J., Han, J.S., Chung, K.Y.: Dynamic reconfiguration based on goal-scenario by adaptation strategy. Wirel. Pers. Commun. 73(2), 309–318 (2013)CrossRef
6.
Zurück zum Zitat Chung, K.Y.: Effect of facial makeup style recommendation on visual sensibility. Multimed. Tools Appl. 71(2), 843–853 (2014)CrossRef Chung, K.Y.: Effect of facial makeup style recommendation on visual sensibility. Multimed. Tools Appl. 71(2), 843–853 (2014)CrossRef
8.
Zurück zum Zitat Ko, J.W., Chung, K.Y., Han, J.S.: Model transformation verification using similarity and graph comparison algorithm. Multimed. Tools Appl. (2013). doi:10.1007/s11042-013-1581-y Ko, J.W., Chung, K.Y., Han, J.S.: Model transformation verification using similarity and graph comparison algorithm. Multimed. Tools Appl. (2013). doi:10.​1007/​s11042-013-1581-y
10.
Zurück zum Zitat Boutaba, R., Chung, K., Gen, M.: Recent trends in interactive multimedia computing for industry. Clust. Comput. 17(3), 723–726 (2014)CrossRef Boutaba, R., Chung, K., Gen, M.: Recent trends in interactive multimedia computing for industry. Clust. Comput. 17(3), 723–726 (2014)CrossRef
11.
Zurück zum Zitat Oh, S.Y., Ghose, S., Jang, H.J., Chung, K.: Recent trends in mobile communication systems. Int. J. Comput. Virol. Hacking 10(2), 67–70 (2014)CrossRef Oh, S.Y., Ghose, S., Jang, H.J., Chung, K.: Recent trends in mobile communication systems. Int. J. Comput. Virol. Hacking 10(2), 67–70 (2014)CrossRef
12.
Zurück zum Zitat Oh, S.Y., Ghose, S., Chung, K.Y.: Recent trends in intelligent information system for convergence. Int. J. Intell. Inf. Database Syst. 8(2), 81–84 (2014) Oh, S.Y., Ghose, S., Chung, K.Y.: Recent trends in intelligent information system for convergence. Int. J. Intell. Inf. Database Syst. 8(2), 81–84 (2014)
13.
Zurück zum Zitat Kim, S.H., Chung, K.Y.: 3D simulator for stability analysis of finite slope causing plane activity. Multimed. Tools Appl. 68(2), 455–463 (2014)CrossRef Kim, S.H., Chung, K.Y.: 3D simulator for stability analysis of finite slope causing plane activity. Multimed. Tools Appl. 68(2), 455–463 (2014)CrossRef
14.
Zurück zum Zitat Kim, J.H., Chung, K.Y.: Ontology-based healthcare context information model to implement ubiquitous environment. Multimed. Tools Appl. 71(2), 873–888 (2014)CrossRef Kim, J.H., Chung, K.Y.: Ontology-based healthcare context information model to implement ubiquitous environment. Multimed. Tools Appl. 71(2), 873–888 (2014)CrossRef
15.
Zurück zum Zitat Park, R.C., Jung, H., Jo, S.M.: ABS scheduling technique for interference mitigation of M2M based medical WBAN service. Wirel. Pers. Commun. 79(4), 2685–2700 (2014)CrossRef Park, R.C., Jung, H., Jo, S.M.: ABS scheduling technique for interference mitigation of M2M based medical WBAN service. Wirel. Pers. Commun. 79(4), 2685–2700 (2014)CrossRef
16.
Zurück zum Zitat Park, R.C., Jung, H., Shin, D.K., Cho, Y.H., Lee, K.D.: Telemedicine health service using LTE-advanced relay antenna. Pers. Ubiquitous Comput. 18(6), 1325–1335 (2014)CrossRef Park, R.C., Jung, H., Shin, D.K., Cho, Y.H., Lee, K.D.: Telemedicine health service using LTE-advanced relay antenna. Pers. Ubiquitous Comput. 18(6), 1325–1335 (2014)CrossRef
17.
Zurück zum Zitat Wang, K.C., Tsai, Y.H.: Voice activity detection algorithm with low signal-to-noise ratios based on spectrum entropy. In: Proceedings of the International Symposium on Universal Communication, pp. 423–428 (2008) Wang, K.C., Tsai, Y.H.: Voice activity detection algorithm with low signal-to-noise ratios based on spectrum entropy. In: Proceedings of the International Symposium on Universal Communication, pp. 423–428 (2008)
18.
Zurück zum Zitat Yi, Hu, Loizou, P.C.: Evaluation of objective quality measures for speech enhancement. IEEE Trans. Audio Speech Lang. Process. 16(1), 229–238 (2008)CrossRef Yi, Hu, Loizou, P.C.: Evaluation of objective quality measures for speech enhancement. IEEE Trans. Audio Speech Lang. Process. 16(1), 229–238 (2008)CrossRef
19.
Zurück zum Zitat Homer, J., Mareels, I.: LS Detection guided NLMS estimation of sparse system. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (2004) Homer, J., Mareels, I.: LS Detection guided NLMS estimation of sparse system. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (2004)
20.
Zurück zum Zitat Wu, B.F., Wang, K.C.: Robust endpoint detection algorithm based on the adaptive band-partitioning spectral entropy in adverse environments. IEEE Transactions on Speech and Audio Processing 13(5), 762–775 (2005)CrossRef Wu, B.F., Wang, K.C.: Robust endpoint detection algorithm based on the adaptive band-partitioning spectral entropy in adverse environments. IEEE Transactions on Speech and Audio Processing 13(5), 762–775 (2005)CrossRef
21.
Zurück zum Zitat Li, Q., Zheng, J., Tsai, A., Zhou, Q.: Robust endpoint detection and energy normalization for real-time speech and speaker recognition. IEEE Transactions on Speech and Audio Processing 10(3), 146–157 (2002)CrossRef Li, Q., Zheng, J., Tsai, A., Zhou, Q.: Robust endpoint detection and energy normalization for real-time speech and speaker recognition. IEEE Transactions on Speech and Audio Processing 10(3), 146–157 (2002)CrossRef
22.
Zurück zum Zitat Sumit K.B., Om P.S., Prabhakar, A.: Speech/music discriminator based on frequency energy, spectrogram and autocorrelation. IJSCE, vol. 1, Issue. 6 (2012) Sumit K.B., Om P.S., Prabhakar, A.: Speech/music discriminator based on frequency energy, spectrogram and autocorrelation. IJSCE, vol. 1, Issue. 6 (2012)
23.
Zurück zum Zitat Sumit, K.B., Dekate, S.K.: Text-dependent method for gender identification through synthesis of voiced segments. Int. J. Eng. Sci. Technol. 3(6) (2011) Sumit, K.B., Dekate, S.K.: Text-dependent method for gender identification through synthesis of voiced segments. Int. J. Eng. Sci. Technol. 3(6) (2011)
24.
Zurück zum Zitat Scart, P., Filho, J.: Speech enhancement based on a priori signal to noise estimation. In: Proceedings of IEEE International Conference on Acoustic Speech Signal Processing, pp. 629–632, (1996) Scart, P., Filho, J.: Speech enhancement based on a priori signal to noise estimation. In: Proceedings of IEEE International Conference on Acoustic Speech Signal Processing, pp. 629–632, (1996)
25.
Zurück zum Zitat Kamarth, S., Loizou, P.: A multi-band spectral subtraction method for enhancing speech corrupted by colored noise. In: Proceedings of IEEE International Conference on Acoustic Speech Signal Processing, pp. 101–111 (2002) Kamarth, S., Loizou, P.: A multi-band spectral subtraction method for enhancing speech corrupted by colored noise. In: Proceedings of IEEE International Conference on Acoustic Speech Signal Processing, pp. 101–111 (2002)
26.
Zurück zum Zitat Quiroz, A., Gnanasambandam, N., Parashar, M., Sharma, N.: Robust clustering analysis for the management of self-monitoring distributed systems. Clust. Comput. 12(1), 73–85 (2009)CrossRef Quiroz, A., Gnanasambandam, N., Parashar, M., Sharma, N.: Robust clustering analysis for the management of self-monitoring distributed systems. Clust. Comput. 12(1), 73–85 (2009)CrossRef
27.
Zurück zum Zitat Jung, Y.G., Han, M.S., Chung, K.Y., Lee, S.J.: Monotonicity and performance evaluation: applications to high speed and mobile networks. Clust. Comput. 15(4), 401–414 (2012)CrossRef Jung, Y.G., Han, M.S., Chung, K.Y., Lee, S.J.: Monotonicity and performance evaluation: applications to high speed and mobile networks. Clust. Comput. 15(4), 401–414 (2012)CrossRef
28.
Zurück zum Zitat Oh, S.Y., Chung, K.Y.: Target speech feature extraction using non-parametric correlation coefficient. Clust. Comput. 17(3), 893–899 (2014)CrossRef Oh, S.Y., Chung, K.Y.: Target speech feature extraction using non-parametric correlation coefficient. Clust. Comput. 17(3), 893–899 (2014)CrossRef
29.
Zurück zum Zitat Chung, K.Y., Na, Y.J., Lee, J.H.: Interactive design recommendation using sensor based smart wear and weather webbot. Wirel. Pers. Commun. 73, 243–256 (2013)CrossRef Chung, K.Y., Na, Y.J., Lee, J.H.: Interactive design recommendation using sensor based smart wear and weather webbot. Wirel. Pers. Commun. 73, 243–256 (2013)CrossRef
30.
Zurück zum Zitat Jung, E.Y., Kim, J.H., Chung, K.Y., Park, D.K.: Home health gateway based healthcare services through U-health platform. Wirel. Pers. Commun. 73, 207–218 (2013)CrossRef Jung, E.Y., Kim, J.H., Chung, K.Y., Park, D.K.: Home health gateway based healthcare services through U-health platform. Wirel. Pers. Commun. 73, 207–218 (2013)CrossRef
31.
Zurück zum Zitat Park, J.H.: Subscriber authentication technology of AAA mechanism for mobile IPTV service offer. Telecommun. Syst. 45, 37–45 (2010)CrossRef Park, J.H.: Subscriber authentication technology of AAA mechanism for mobile IPTV service offer. Telecommun. Syst. 45, 37–45 (2010)CrossRef
32.
Zurück zum Zitat Chung, K.: Recent trends on convergence and ubiquitous computing. Pers. Ubiquitous Comput. 18(6), 1291–1293 (2014)CrossRef Chung, K.: Recent trends on convergence and ubiquitous computing. Pers. Ubiquitous Comput. 18(6), 1291–1293 (2014)CrossRef
33.
Zurück zum Zitat ETSI standard document, Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms, ETSI ES 202 050 Vol. 1, No. 1 (2003) ETSI standard document, Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms, ETSI ES 202 050 Vol. 1, No. 1 (2003)
34.
Zurück zum Zitat Abdallah I., Montresor S., Baudry, M.: Robust speech/non-speech detection in adverse conditions using an entropy based estimator. In: Proceedings of the IEEE International Conference on Digital Signal Processing, pp. 752–760 (1997) Abdallah I., Montresor S., Baudry, M.: Robust speech/non-speech detection in adverse conditions using an entropy based estimator. In: Proceedings of the IEEE International Conference on Digital Signal Processing, pp. 752–760 (1997)
35.
Zurück zum Zitat Ahmed, B., Holmes, P.H.: A voice activity detector using the chi-square test. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 625–628 (2004) Ahmed, B., Holmes, P.H.: A voice activity detector using the chi-square test. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 625–628 (2004)
36.
Zurück zum Zitat Zhu, Q., Iseli, M., Cui, X., Alwan, A.: Noise robust feature extraction for ASR using the Aurora 2 database. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (2006) Zhu, Q., Iseli, M., Cui, X., Alwan, A.: Noise robust feature extraction for ASR using the Aurora 2 database. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (2006)
37.
Zurück zum Zitat Tüske, Z., Mihajlik, P., Tobler, Z., Fegyó, T.: Robust voice activity detection based on the entropy of noise suppressed spectrum. Interspeech, pp. 245–248 (2005) Tüske, Z., Mihajlik, P., Tobler, Z., Fegyó, T.: Robust voice activity detection based on the entropy of noise suppressed spectrum. Interspeech, pp. 245–248 (2005)
38.
Zurück zum Zitat Kozel, D., Apostoaia, C.: Colored noise reduction using Bark scale spectral subtraction, statistics, and multiple time frames. In: Proceedings of the IEEE International Conference on Electro/Information Technology, pp. 416–421 (2007) Kozel, D., Apostoaia, C.: Colored noise reduction using Bark scale spectral subtraction, statistics, and multiple time frames. In: Proceedings of the IEEE International Conference on Electro/Information Technology, pp. 416–421 (2007)
Metadaten
Titel
Improvement of speech signal extraction method using detection filter of energy spectrum entropy
verfasst von
Kyungyong Chung
SangYeob Oh
Publikationsdatum
01.06.2015
Verlag
Springer US
Erschienen in
Cluster Computing / Ausgabe 2/2015
Print ISSN: 1386-7857
Elektronische ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-015-0429-9

Weitere Artikel der Ausgabe 2/2015

Cluster Computing 2/2015 Zur Ausgabe

Premium Partner