Skip to main content
Erschienen in: International Journal of Speech Technology 4/2015

29.08.2015

Hybrid speech enhancement with empirical mode decomposition and spectral subtraction for efficient speaker identification

verfasst von: Samia Abd El-Moneim, Moawad I. Dessouky, Fathi E. Abd El-Samie, M. A. Nassar, Mohammed Abd El-Naby

Erschienen in: International Journal of Speech Technology | Ausgabe 4/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Speech enhancement is a very important pre-processing step in various speech processing applications such as speech recognition, speaker identification, speech coding, and speech synthesis. In this paper, we focus on speech enhancement prior to speaker identification, because the degradations of the speech signals may cause difficulties in hearing, understanding, and speaker recognition. The paper presents a hybrid speech enhancement method based on empirical mode decomposition combined with spectral subtraction to improve the quality of speech signals prior to speaker identification. Simulation results show an improvement in speaker recognition rates with the proposed speech enhancement method as a pre-processing step.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abd El-samie, F. E. (2011). Information security for automatic speaker identification., Springer briefs in electrical and computer engineering New York: Springer.CrossRef Abd El-samie, F. E. (2011). Information security for automatic speaker identification., Springer briefs in electrical and computer engineering New York: Springer.CrossRef
Zurück zum Zitat Hossain, Md. M., Ahmed, B. & Asrafi, M. (2007). A real time speaker identification using artificial neural network. Department of Computer Science & Engineering, 1-4244-1551-9/07/$25.00 ©2007 IEEE. Hossain, Md. M., Ahmed, B. & Asrafi, M. (2007). A real time speaker identification using artificial neural network. Department of Computer Science & Engineering, 1-4244-1551-9/07/$25.00 ©2007 IEEE.
Zurück zum Zitat Alotaiby, T., Alshebeili, S. A., Alshawi, T., Ahmad, I., & Abd El-Samie, F. E. (2014). EEG seizure detection and prediction algorithms: A survey. EURASIP Journal on Advances in Signal Processing, 2014, 1–21.CrossRef Alotaiby, T., Alshebeili, S. A., Alshawi, T., Ahmad, I., & Abd El-Samie, F. E. (2014). EEG seizure detection and prediction algorithms: A survey. EURASIP Journal on Advances in Signal Processing, 2014, 1–21.CrossRef
Zurück zum Zitat Campbell, J. P. (1997). Speaker recognition: A tutorial. In Proceedings of the IEEE (Vol. 85). Campbell, J. P. (1997). Speaker recognition: A tutorial. In Proceedings of the IEEE (Vol. 85).
Zurück zum Zitat Evans, N. W. D., Mason, J. S., Liu, W. M. & Fauve, B. (2005). On the fundamental limitations of spectra subtraction: An assessment by automatic speech recognition. Swansea: University of Wales Swansea Singleton Park. http://eegalilee.swan.ac.uk. Evans, N. W. D., Mason, J. S., Liu, W. M. & Fauve, B. (2005). On the fundamental limitations of spectra subtraction: An assessment by automatic speech recognition. Swansea: University of Wales Swansea Singleton Park. http://​eegalilee.​swan.​ac.​uk.
Zurück zum Zitat Furui, S. (1981). Cepstral analysis technique for automatic speaker verification. IEEE Transactions of Acoustics, and Signal Processing, 29, 254–272.CrossRef Furui, S. (1981). Cepstral analysis technique for automatic speaker verification. IEEE Transactions of Acoustics, and Signal Processing, 29, 254–272.CrossRef
Zurück zum Zitat Goel, P., & Garg, A. (2011). Review of spectral subtraction techniques for speech enhancement. Haryana: Electronics and Communication Department, M.M. University, Mullana, Ambala. Goel, P., & Garg, A. (2011). Review of spectral subtraction techniques for speech enhancement. Haryana: Electronics and Communication Department, M.M. University, Mullana, Ambala.
Zurück zum Zitat Hu, Y., & Loizou, P. C. (2008). Evaluation of objective quality measures for speech enhancement. IEEE Transactions on Audio, Speech and Language Processing, 16(1), 229–238.CrossRef Hu, Y., & Loizou, P. C. (2008). Evaluation of objective quality measures for speech enhancement. IEEE Transactions on Audio, Speech and Language Processing, 16(1), 229–238.CrossRef
Zurück zum Zitat Karam, M., Khazaal, H. F., Aglan, H., & Cole, C. (2014). Noise removal in speech processing using spectral subtraction. Journal of Signal and Information Processing, 5, 32–41.CrossRef Karam, M., Khazaal, H. F., Aglan, H., & Cole, C. (2014). Noise removal in speech processing using spectral subtraction. Journal of Signal and Information Processing, 5, 32–41.CrossRef
Zurück zum Zitat Kim, D., & Oh, H.-S. (2009). EMD: A package for empirical mode decomposition and hilbert spectrum. The R Journal, 1, 40–46. Kim, D., & Oh, H.-S. (2009). EMD: A package for empirical mode decomposition and hilbert spectrum. The R Journal, 1, 40–46.
Zurück zum Zitat Love, B. J., Vining, J. & Sun, X. (2004). Automatic speaker recognition using neural networks. EE371D intro. To neural networks. Austin: The University of Texas. Love, B. J., Vining, J. & Sun, X. (2004). Automatic speaker recognition using neural networks. EE371D intro. To neural networks. Austin: The University of Texas.
Zurück zum Zitat Pawar, A. P.,Choudhari, K. B. (2013). Enhancement of speech in noisy conditions. The International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering, 2(7). Pawar, A. P.,Choudhari, K. B. (2013). Enhancement of speech in noisy conditions. The International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering, 2(7).
Zurück zum Zitat Reynolds, D. A. (2002). An overview of automatic speaker recognition technology. In Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on (Vol. 4). Orlando, FL: IEEE. Accessed 13–17 May 2002. Reynolds, D. A. (2002). An overview of automatic speaker recognition technology. In Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on (Vol. 4). Orlando, FL: IEEE. Accessed 13–17 May 2002.
Zurück zum Zitat Rilling, G., Flandrin, P., & Goncalv`es, P. (2003). On empirical mode decomposition and its algorithms. Lyon: Ecole Normale Sup´erieure de Lyon. Rilling, G., Flandrin, P., & Goncalv`es, P. (2003). On empirical mode decomposition and its algorithms. Lyon: Ecole Normale Sup´erieure de Lyon.
Zurück zum Zitat Samudravijaya, K. (2003). Speech and speaker recognition: A tutorial. Mumbai: Tata Institute of Fundamental Research. Samudravijaya, K. (2003). Speech and speaker recognition: A tutorial. Mumbai: Tata Institute of Fundamental Research.
Zurück zum Zitat Sharma, A., Singh, S. P., Kumar, V. (2005). Text-independent speaker identification using back propagation MLP network classifier for a closed set of speaker. In 2005 IEEE International Symposium on Signal Processing and Information Technology. Allahabad: Indian Institute of Information Technology. Sharma, A., Singh, S. P., Kumar, V. (2005). Text-independent speaker identification using back propagation MLP network classifier for a closed set of speaker. In 2005 IEEE International Symposium on Signal Processing and Information Technology. Allahabad: Indian Institute of Information Technology.
Metadaten
Titel
Hybrid speech enhancement with empirical mode decomposition and spectral subtraction for efficient speaker identification
verfasst von
Samia Abd El-Moneim
Moawad I. Dessouky
Fathi E. Abd El-Samie
M. A. Nassar
Mohammed Abd El-Naby
Publikationsdatum
29.08.2015
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 4/2015
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-015-9293-5

Weitere Artikel der Ausgabe 4/2015

International Journal of Speech Technology 4/2015 Zur Ausgabe