Skip to main content
Top

2017 | OriginalPaper | Chapter

Hardware Implementation of MFCC Feature Extraction for Speech Recognition on FPGA

Authors : Van-Lan Dao, Van-Danh Nguyen, Hai-Duong Nguyen, Van-Phuc Hoang

Published in: Advances in Information and Communication Technology

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, an FPGA-based Mel Frequency Cepstral Coefficient (MFCC) IP core for speech recognition is presented. The implementation results on FPGA show that the proposed MFCC core achieves higher resource usage efficiency compared with other designs.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Boves, L., den Os, E.: Speaker recognition in telecom applications. In: Proceedings of the Interactive Voice Technology for Telecommunications Applications, Torino, pp. 203–208 (1998) Boves, L., den Os, E.: Speaker recognition in telecom applications. In: Proceedings of the Interactive Voice Technology for Telecommunications Applications, Torino, pp. 203–208 (1998)
2.
go back to reference McLoughlin, I.V., Sharifzadeh, H.R.: Speech recognition engine adaptions for smart home dialogues. In: Proceedings of the International Conference on Information, Communications and Signal Processing, Singapore, pp. 1–5 (2007) McLoughlin, I.V., Sharifzadeh, H.R.: Speech recognition engine adaptions for smart home dialogues. In: Proceedings of the International Conference on Information, Communications and Signal Processing, Singapore, pp. 1–5 (2007)
3.
go back to reference Marchetto, E., Avanzini, F., Flego, F.: An automatic speaker recognition system for intelligence applications. In: Proceedings of the European Signal Processing, Glasgow, pp. 1612–1616 (2009) Marchetto, E., Avanzini, F., Flego, F.: An automatic speaker recognition system for intelligence applications. In: Proceedings of the European Signal Processing, Glasgow, pp. 1612–1616 (2009)
4.
go back to reference Selvan, K., Joseph, A., Anish Babu, K.K.: Speaker recognition system for security applications. In: IEEE Recent Advances in Intelligent Computational Systems (RAICS), pp. 26–30 (2013) Selvan, K., Joseph, A., Anish Babu, K.K.: Speaker recognition system for security applications. In: IEEE Recent Advances in Intelligent Computational Systems (RAICS), pp. 26–30 (2013)
5.
go back to reference Ajgou, R., Sbaa, S., Ghendir, S., Chamsa, A., Taleb-Ahmed, A.: Robust remote speaker recognition system based on AR-MFCC features and efficient speech activity detection algorithm. In: International Symposium on Wireless Communications Systems (ISWCS), Barcelona, pp. 722–727 (2014) Ajgou, R., Sbaa, S., Ghendir, S., Chamsa, A., Taleb-Ahmed, A.: Robust remote speaker recognition system based on AR-MFCC features and efficient speech activity detection algorithm. In: International Symposium on Wireless Communications Systems (ISWCS), Barcelona, pp. 722–727 (2014)
6.
go back to reference Malode, A.A., Sahare, S.L.: An improved speaker recognition by using VQ and HMM. In: Proceedings of the International on Sustainable Energy and Intelligent Systems (SEISCON 2012), Tiruchengode, pp. 1–7 (2012) Malode, A.A., Sahare, S.L.: An improved speaker recognition by using VQ and HMM. In: Proceedings of the International on Sustainable Energy and Intelligent Systems (SEISCON 2012), Tiruchengode, pp. 1–7 (2012)
7.
go back to reference Lung, V.D., Truong, V.N.: Vietnamese speech recognition using dynamic time warping and coefficient of correlation. In: Proceedings of the International Conference on Control, Automation and Information Sciences (ICCAIS), Nha Trang, pp. 64–67 (2013) Lung, V.D., Truong, V.N.: Vietnamese speech recognition using dynamic time warping and coefficient of correlation. In: Proceedings of the International Conference on Control, Automation and Information Sciences (ICCAIS), Nha Trang, pp. 64–67 (2013)
8.
go back to reference Tuzun, O.B., Demirekler, M., Nakiboglu, K.B.: Comparison of parametric and non-parametric representations of speech for recognition. In: Proceedings, pp. 65–68 (1994) Tuzun, O.B., Demirekler, M., Nakiboglu, K.B.: Comparison of parametric and non-parametric representations of speech for recognition. In: Proceedings, pp. 65–68 (1994)
9.
go back to reference Openshaw, J.P., Sun, Z.P., Mason, J.S.: A comparison of composite features under degraded speech in speaker recognition. In: Proceedings on Acoustics, Speech, and Signal Processing, vol. 2, Minneapolis, USA, pp. 371–374 (1993) Openshaw, J.P., Sun, Z.P., Mason, J.S.: A comparison of composite features under degraded speech in speaker recognition. In: Proceedings on Acoustics, Speech, and Signal Processing, vol. 2, Minneapolis, USA, pp. 371–374 (1993)
10.
go back to reference Vergin, R., O’Shaughnessy, D., Gupta, V.: Compensated mel frequency cepstrum coefficients. In: Proceedings on Acoustics, Speech, and Signal Processing, Minneapolis, USA, pp. 323–326 (1996) Vergin, R., O’Shaughnessy, D., Gupta, V.: Compensated mel frequency cepstrum coefficients. In: Proceedings on Acoustics, Speech, and Signal Processing, Minneapolis, USA, pp. 323–326 (1996)
11.
go back to reference Ibrahim, N.J., et al.: Quranic verse recitation feature extraction using Mel-frequency cepstral coefficients (MFCC). In: Proceedings of the International Colloquium on Signal Processing and Its Applications (CSPA), Kuala Lumpur, Malaysia (2008) Ibrahim, N.J., et al.: Quranic verse recitation feature extraction using Mel-frequency cepstral coefficients (MFCC). In: Proceedings of the International Colloquium on Signal Processing and Its Applications (CSPA), Kuala Lumpur, Malaysia (2008)
12.
go back to reference Price, J., Sophomore Student: Design an automatic speech recognition system using maltab. University of Maryland Estern Shore Princess Anne Price, J., Sophomore Student: Design an automatic speech recognition system using maltab. University of Maryland Estern Shore Princess Anne
13.
go back to reference Wang, J.-C., Wang, J.-F., Weng, Y.-S.: Chip design of mel frequency cepstral coefficients for speech recognition. In: Proceedings of the Advanced IEEE International Conference on Acoustics, Speech, and Signal Processing, Istanbul, vol. 6, pp. 3658–3661 (2000) Wang, J.-C., Wang, J.-F., Weng, Y.-S.: Chip design of mel frequency cepstral coefficients for speech recognition. In: Proceedings of the Advanced IEEE International Conference on Acoustics, Speech, and Signal Processing, Istanbul, vol. 6, pp. 3658–3661 (2000)
14.
go back to reference Wassi, G., Iloga, S., Romain, O., Granado, B.: FPGA-based real-time MFCC extraction for automatic audio indexing on FM broadcast data. In: Proceedings on Design and Architectures for Signal and Image Processing (DASIP), Krakow, pp. 1–6 (2015) Wassi, G., Iloga, S., Romain, O., Granado, B.: FPGA-based real-time MFCC extraction for automatic audio indexing on FM broadcast data. In: Proceedings on Design and Architectures for Signal and Image Processing (DASIP), Krakow, pp. 1–6 (2015)
15.
go back to reference Bahoura, M., Ezzaidi, H.: Hardware implementation of MFCC feature extraction for respiratory sounds analysis. In: Proceedings of the International Workshop on Systems, Signal Processing and their Applications (WoSSPA), Algiers, pp. 226–229 (2013) Bahoura, M., Ezzaidi, H.: Hardware implementation of MFCC feature extraction for respiratory sounds analysis. In: Proceedings of the International Workshop on Systems, Signal Processing and their Applications (WoSSPA), Algiers, pp. 226–229 (2013)
16.
go back to reference Ehkan, P., Zakaria, F.F., Warip, M.N.M., Sauli, Z., Elshaikh, M.: Hardware implementation of MFCC-based feature extraction for speaker recognition. In: Sulaiman, H.A., Othman, M.A., Othman, M.F.I., Rahim, Y.A., Pee, N.C. (eds.) Advanced Computer and Communication Engineering Technology. LNEE, vol. 315, pp. 471–480. Springer, Heidelberg (2015). doi:10.1007/978-3-319-07674-4_46 Ehkan, P., Zakaria, F.F., Warip, M.N.M., Sauli, Z., Elshaikh, M.: Hardware implementation of MFCC-based feature extraction for speaker recognition. In: Sulaiman, H.A., Othman, M.A., Othman, M.F.I., Rahim, Y.A., Pee, N.C. (eds.) Advanced Computer and Communication Engineering Technology. LNEE, vol. 315, pp. 471–480. Springer, Heidelberg (2015). doi:10.​1007/​978-3-319-07674-4_​46
Metadata
Title
Hardware Implementation of MFCC Feature Extraction for Speech Recognition on FPGA
Authors
Van-Lan Dao
Van-Danh Nguyen
Hai-Duong Nguyen
Van-Phuc Hoang
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-49073-1_27

Premium Partner