Skip to main content
Erschienen in: International Journal of Speech Technology 4/2015

14.10.2015

A comparative study of BA, APSO, GSA, hybrid PSOGSA and SPSO in dual channel speech enhancement

Erschienen in: International Journal of Speech Technology | Ausgabe 4/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The present paper focuses on the suppression of background noise in speech signal by utilizing a powerful heuristic optimization algorithm called Bat algorithm (BA) for adaptive filtering in dual channel enhancement systems. Bat algorithm is a recently developed population based meta heuristic approach which is inspired by the hunting behavior of the bats. It is developed by Yang (2010). As a novel feature Bat algorithm is based on the echo location behavior of micro bats. BA uses the frequency tuning technique to increase the diversity of the solutions in the population, while at the same time it uses the automatic zooming and tries to balance the exploration and exploitation. A few studies have been carried out on the use of heuristics for ANC in speech enhancement by using standard particle swarm optimization (SPSO) and some of its variants till 2010. In order to extend these heuristic approaches to speech, accelerated particle swarm optimization (APSO) based enhancement approach has been proposed (Prajna et al. in Int J Speech Technol 17(4):341–351, 2014a; Prajna et al. in IJISA 6(4):1–10, 2014b, doi:10.​5815/​ijisa.​2014.​04.​01; Prajna et al. in IEEE international conference on communications and signal processing (ICCSP), April 2014, pp 1457–1461, 2014c, doi:10.​1109/​ICCSP.​2014.​6950090). To overcome the problem of poor exploitation ability of APSO, another approach is proposed based on gravitational search algorithm (GSA) (Prajna et al. 2014a, b, c). To combine both the abilities of PSO and GSA algorithms, Hybrid PSOGSA algorithm is also proposed to ANC (Prajna et al. 2014a, b, c). To further improve the efficiency of adaptive filtering in ANC, by providing a dynamic balance between exploration and exploitation, BA is proposed to speech enhancement. This paper intends to present the Bat algorithm as an improved approach to ANC in speech enhancement when compared with that of SPSO, APSO, GSA and Hybrid PSOGSA based speech enhancement algorithms. The performance of all the algorithms is evaluated by computing four objective measures SNRI, PESQ, FAI and WSS, in two real world noise conditions Babble and Factory, at three different input SNR levels set at −10, 0 and 5 dB. Simulation results prove that BA is the most successful algorithm of all the algorithms studied in this work, to suppress the background noise more effectively.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Asl, L. B., & Geravanchizadeh, M. (2010a). Dual-channel speech enhancement based on stochastic optimization strategies. In International conference on information science, signal processing and their applications, 2010, pp. 229–232. Asl, L. B., & Geravanchizadeh, M. (2010a). Dual-channel speech enhancement based on stochastic optimization strategies. In International conference on information science, signal processing and their applications, 2010, pp. 229–232.
Zurück zum Zitat Asl, L. B., & Geravanchizadeh, M. (2010b). Asexual reproduction based adaptive quantum particle swarm optimization algorithm for dual channel speech enhancement. In International conference on information science, signal processing and their applications (ISSPA 2010), pp 129–132. Asl, L. B., & Geravanchizadeh, M. (2010b). Asexual reproduction based adaptive quantum particle swarm optimization algorithm for dual channel speech enhancement. In International conference on information science, signal processing and their applications (ISSPA 2010), pp 129–132.
Zurück zum Zitat Asl, L. B., & Nezhad, V. M. (2010a). Speech enhancement using Particle swarm optimization Techniques. In International conference on measuring technology and mechatronics automation, 2010, pp. 441–444. Asl, L. B., & Nezhad, V. M. (2010a). Speech enhancement using Particle swarm optimization Techniques. In International conference on measuring technology and mechatronics automation, 2010, pp. 441–444.
Zurück zum Zitat Asl, L. B., & Nezhad, V. M. (2010b). Improved particle swarm optimization for dual-channel speech enhancement. In International conference on signal acquisition and processing, 2010, pp. 13–17. Asl, L. B., & Nezhad, V. M. (2010b). Improved particle swarm optimization for dual-channel speech enhancement. In International conference on signal acquisition and processing, 2010, pp. 13–17.
Zurück zum Zitat Eberhart, R. C., & Kennady, J. (1995). A new optimizer using particles swarm theory. In International symposium on micro machine and human science, Nagoya, Japan, pp. 39–43. Eberhart, R. C., & Kennady, J. (1995). A new optimizer using particles swarm theory. In International symposium on micro machine and human science, Nagoya, Japan, pp. 39–43.
Zurück zum Zitat Goodwin, G., & Sin, K. (1985). Adaptive filtering prediction and control. Englewood Cliffs, NJ: Prentice-Hall. Goodwin, G., & Sin, K. (1985). Adaptive filtering prediction and control. Englewood Cliffs, NJ: Prentice-Hall.
Zurück zum Zitat Hu, Y., & Loizou, P. (2006). Subjective comparison of speech enhancement algorithms. In ICASSP proceedings, Toulouse, France, pp 153–156. Hu, Y., & Loizou, P. (2006). Subjective comparison of speech enhancement algorithms. In ICASSP proceedings, Toulouse, France, pp 153–156.
Zurück zum Zitat Klatt, D. (1982). Prediction of perceived phonetic distance from critical band spectra. In Proceedings of IEEE international conference acoustics, speech, signal process, Vol. 7, pp. 1278–1281. Klatt, D. (1982). Prediction of perceived phonetic distance from critical band spectra. In Proceedings of IEEE international conference acoustics, speech, signal process, Vol. 7, pp. 1278–1281.
Zurück zum Zitat Krishnamoorthy, P. (2011). An overview of subjective and objective quality measures for noisy speech enhancement algorithms. IETE Technical Review, 28, 292–301.CrossRef Krishnamoorthy, P. (2011). An overview of subjective and objective quality measures for noisy speech enhancement algorithms. IETE Technical Review, 28, 292–301.CrossRef
Zurück zum Zitat Kunche, P., Rao, G. S. B., Reddy, K. V. V. S., & Maheswari, R. U. (2014). A new approach to dual channel speech enhancement based on hybrid PSOGSA. International Journal of Speech Technology, 18(1), 45–56. Kunche, P., Rao, G. S. B., Reddy, K. V. V. S., & Maheswari, R. U. (2014). A new approach to dual channel speech enhancement based on hybrid PSOGSA. International Journal of Speech Technology, 18(1), 45–56.
Zurück zum Zitat Loizou, P. C. (2007). Speech enhancement theory and practice. Boca Raton: CRC Press. Loizou, P. C. (2007). Speech enhancement theory and practice. Boca Raton: CRC Press.
Zurück zum Zitat Loizou, P. C., & Ma, J. (2011). Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms. The Journal of the Acoustical Society of America, 130(2), 986–995. Loizou, P. C., & Ma, J. (2011). Extending the articulation index to account for non-linear distortions introduced by noise-suppression algorithms. The Journal of the Acoustical Society of America, 130(2), 986–995.
Zurück zum Zitat Mirjalili, S., & Mohd Hashim, S. Z. (2010). A new hybrid PSOGSA algorithm for function optimization. In IEEE international conference on computer and information application (ICCIA 2010), 2010, pp. 374–377. Mirjalili, S., & Mohd Hashim, S. Z. (2010). A new hybrid PSOGSA algorithm for function optimization. In IEEE international conference on computer and information application (ICCIA 2010), 2010, pp. 374–377.
Zurück zum Zitat Osgouei, S. G., & Geravan Chizadeh, M. (2010). Dual-channel speech enhancement based on a hybrid particle swarm optimization algorithm. In 5th international symposium on telecommunications (IST’ 2010), pp. 873–877. Osgouei, S. G., & Geravan Chizadeh, M. (2010). Dual-channel speech enhancement based on a hybrid particle swarm optimization algorithm. In 5th international symposium on telecommunications (IST’ 2010), pp. 873–877.
Zurück zum Zitat Prajna, K., Rao, G. S. B., Reddy, K. V. V. S., & Maheswari, R. U. (2014a). A new approach to dual channel speech enhancement based on GSA. International Journal of Speech Technology, 17(4), 341–351. Prajna, K., Rao, G. S. B., Reddy, K. V. V. S., & Maheswari, R. U. (2014a). A new approach to dual channel speech enhancement based on GSA. International Journal of Speech Technology, 17(4), 341–351.
Zurück zum Zitat Prajna, K., Sasi Bhushan Rao, G., Reddy, K. V. V. S., & Maheswari, R. U. (2014b). A new dual channel speech enhancement approach based on accelerated particle swarm optimization (APSO). IJISA, 6(4), 1–10. doi:10.5815/ijisa.2014.04.01.CrossRef Prajna, K., Sasi Bhushan Rao, G., Reddy, K. V. V. S., & Maheswari, R. U. (2014b). A new dual channel speech enhancement approach based on accelerated particle swarm optimization (APSO). IJISA, 6(4), 1–10. doi:10.​5815/​ijisa.​2014.​04.​01.CrossRef
Zurück zum Zitat Prajna, K., Rao, G. S. B., Reddy, K. V. V. S, & Maheswari R. U. (2014c). Application of Bat algorithm in dual channel speech enhancement. In IEEE International conference on communications and signal processing (ICCSP), April 2014, pp. 1457–1461. doi:10.1109/ICCSP.2014.6950090. Prajna, K., Rao, G. S. B., Reddy, K. V. V. S, & Maheswari R. U. (2014c). Application of Bat algorithm in dual channel speech enhancement. In IEEE International conference on communications and signal processing (ICCSP), April 2014, pp. 1457–1461. doi:10.​1109/​ICCSP.​2014.​6950090.
Zurück zum Zitat Rashedi, E., Nezamabadi-Pour, H., & Saryazdi, S. (2009). GSA: A gravitational search algorithm. Information Science, 179(13), 2232–2248. Rashedi, E., Nezamabadi-Pour, H., & Saryazdi, S. (2009). GSA: A gravitational search algorithm. Information Science, 179(13), 2232–2248.
Zurück zum Zitat Ueda, T., & Suzuki, H. (1990). Performance of equalizers employing a re-training RLS algorithm for digital mobile radio communications. In 40th IEEE vehicular technology conference, 1990, pp. 553–558. Ueda, T., & Suzuki, H. (1990). Performance of equalizers employing a re-training RLS algorithm for digital mobile radio communications. In 40th IEEE vehicular technology conference, 1990, pp. 553–558.
Zurück zum Zitat Widrow, B., & Stearns, S. (1985). Adaptive signal processing. Englewood Cliffs, NJ: Prentice Hall.MATH Widrow, B., & Stearns, S. (1985). Adaptive signal processing. Englewood Cliffs, NJ: Prentice Hall.MATH
Zurück zum Zitat Yang, X. S. (2010). Nature-inspired metaheuristic algorithms (2nd edn.). Luniver Press. Yang, X. S. (2010). Nature-inspired metaheuristic algorithms (2nd edn.). Luniver Press.
Metadaten
Titel
A comparative study of BA, APSO, GSA, hybrid PSOGSA and SPSO in dual channel speech enhancement
Publikationsdatum
14.10.2015
Erschienen in
International Journal of Speech Technology / Ausgabe 4/2015
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-015-9308-2

Weitere Artikel der Ausgabe 4/2015

International Journal of Speech Technology 4/2015 Zur Ausgabe

Neuer Inhalt