Skip to main content
Top
Published in: International Journal of Speech Technology 4/2019

09-09-2019

Performance measurement of a novel pitch detection scheme based on weighted autocorrelation for speech signals

Author: Sandeep Kumar

Published in: International Journal of Speech Technology | Issue 4/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

A novel pitch detection scheme (PDS) based on weighted autocorrelation function (WACF) is proposed. The proposed scheme has been simulated and then integrated in an analysis-synthesis system for speech signal. The simulation and real-time performance comparison of this scheme with two other existing schemes [ACF and weighted ACF (WACF)] has been carried out. The performance comparison results show that the proposed PDS outperforms (in terms of speech quality and intelligibility) for both clean as well as noisy environment as compared to the other conventional PDS schemes considered for the comparison. Moreover, simulation and real-time implementation results show that the time taken for computation and memory consumption for the proposed PDS is less as compared to the weighted ACF based PDS.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Bhattacharya, S., Singh, S. K., & Abhinav, T. (2012). Performance evaluation of LPC and Cepstral speech coder in simulation and in real-time. In Proceedings of IEEE International Conference on Recent Advances in Information Technology (RAIT) (826–831). Bhattacharya, S., Singh, S. K., & Abhinav, T. (2012). Performance evaluation of LPC and Cepstral speech coder in simulation and in real-time. In Proceedings of IEEE International Conference on Recent Advances in Information Technology (RAIT) (826–831).
go back to reference Deller, J. R., Hansen, J. H. L., & Proakis, J. G. (2000). Discrete-time processing of speech signal (pp. 570–579). Hoboken: Wiley. Deller, J. R., Hansen, J. H. L., & Proakis, J. G. (2000). Discrete-time processing of speech signal (pp. 570–579). Hoboken: Wiley.
go back to reference Fangming, W., & Yip, P. (1991). Cepstral analysis using discrete trignomatric transform. IEEE Transactions Acoustics, Speech, Signal Processing (ASSP),39(2), 538–541.CrossRef Fangming, W., & Yip, P. (1991). Cepstral analysis using discrete trignomatric transform. IEEE Transactions Acoustics, Speech, Signal Processing (ASSP),39(2), 538–541.CrossRef
go back to reference Han, Z., & Wang, X. (2019). A signal period detection algorithm based on morphological self-complementtary top-hat transform and AMDF. Information,10(1), 1–12.CrossRef Han, Z., & Wang, X. (2019). A signal period detection algorithm based on morphological self-complementtary top-hat transform and AMDF. Information,10(1), 1–12.CrossRef
go back to reference Hu, Y., & Loizou, P. (2007). Subjective evaluation and comparison of speech enhancement algorithms. Speech Communication,49, 588–601.CrossRef Hu, Y., & Loizou, P. (2007). Subjective evaluation and comparison of speech enhancement algorithms. Speech Communication,49, 588–601.CrossRef
go back to reference Huang, H., & Pan, J. (2006). Speech pitch determination based on Huang-Hilbert transform. Signal Processing,86(4), 792–803.CrossRef Huang, H., & Pan, J. (2006). Speech pitch determination based on Huang-Hilbert transform. Signal Processing,86(4), 792–803.CrossRef
go back to reference Kumar, B. (2018). Comparative performance evaluation of MMSE-based speech enhancement techniques through simulation and real-time implementation. International Journal of Speech Technology, Springer.,21(4), 1033–1044.CrossRef Kumar, B. (2018). Comparative performance evaluation of MMSE-based speech enhancement techniques through simulation and real-time implementation. International Journal of Speech Technology, Springer.,21(4), 1033–1044.CrossRef
go back to reference Kumar, S., Bhattacharya, S., Dhiman, V., & Mohapatra, S. (2013). Performance evaluation of a wavelet-based pitch detection scheme. International Journal of Speech Technology, Springer.,16(4), 431–437.CrossRef Kumar, S., Bhattacharya, S., Dhiman, V., & Mohapatra, S. (2013). Performance evaluation of a wavelet-based pitch detection scheme. International Journal of Speech Technology, Springer.,16(4), 431–437.CrossRef
go back to reference Kumar, S., Bhattacharya, S., & Patel, P. (2014). A new pitch detection scheme based on ACF and AMDF. In Proceedings of IEEE International Conference on Advanced Communications, Control and Computing Technologies, Ramnathapuram, India (pp. 1235–1240). Kumar, S., Bhattacharya, S., & Patel, P. (2014). A new pitch detection scheme based on ACF and AMDF. In Proceedings of IEEE International Conference on Advanced Communications, Control and Computing Technologies, Ramnathapuram, India (pp. 1235–1240).
go back to reference Kumar, S., Bhattacharya, S., & Singh, S. K. (2015). Performance evaluation of a ACF-AMDF based pitch detection in real-time. International Journal of Speech Technology, Springer.,18(4), 521–527.CrossRef Kumar, S., Bhattacharya, S., & Singh, S. K. (2015). Performance evaluation of a ACF-AMDF based pitch detection in real-time. International Journal of Speech Technology, Springer.,18(4), 521–527.CrossRef
go back to reference Mohapatra, S., Dhiman, V., Kumar, S., & Bhattacharya, S. (2011). A theoretical Justification for coincidence of wavelet maxima at a particular scale pair in an Event-based pitch detection method, In Proceedings of IEEE International Conference on Devices Communications (ICDeCom), BIT Mesra, Ranchi (India) (pp. 403–406). Mohapatra, S., Dhiman, V., Kumar, S., & Bhattacharya, S. (2011). A theoretical Justification for coincidence of wavelet maxima at a particular scale pair in an Event-based pitch detection method, In Proceedings of IEEE International Conference on Devices Communications (ICDeCom), BIT Mesra, Ranchi (India) (pp. 403–406).
go back to reference Perceptual Evaluation of Speech Quality (PESQ), ITU-T P. 862. Perceptual Evaluation of Speech Quality (PESQ), ITU-T P. 862.
go back to reference Pirker, G., Wohlmayr, M., Petrik, S., & Pernkopf, F. (2011). A pitch tracking corpus with evaluation on multipitch tracking scenario. Interspeech, 1509–1512. Pirker, G., Wohlmayr, M., Petrik, S., & Pernkopf, F. (2011). A pitch tracking corpus with evaluation on multipitch tracking scenario. Interspeech, 1509–1512.
go back to reference Plante, F., Ainsworth, William A., & Meyer, Georg F. (1995). A pitch extraction reference database. EUROSPEECH,95, 837–840. Plante, F., Ainsworth, William A., & Meyer, Georg F. (1995). A pitch extraction reference database. EUROSPEECH,95, 837–840.
go back to reference Rabiner, L. R. (1977). On the use of autocorrelation analysis for pitch detection. IEEE Transactions Acoustics,Speech, Signal Processing (ASSP),25(1), 24–33.CrossRef Rabiner, L. R. (1977). On the use of autocorrelation analysis for pitch detection. IEEE Transactions Acoustics,Speech, Signal Processing (ASSP),25(1), 24–33.CrossRef
go back to reference Ross, M. J., Shaffer, H. L., Cohen, A., Freudberg, R., & Manley, H. J. (1974). Average magnitude difference function pitch extractor. IEEE Transactions Acoustics, Speech, Signal Processing (ASSP),22(5), 353–362.CrossRef Ross, M. J., Shaffer, H. L., Cohen, A., Freudberg, R., & Manley, H. J. (1974). Average magnitude difference function pitch extractor. IEEE Transactions Acoustics, Speech, Signal Processing (ASSP),22(5), 353–362.CrossRef
go back to reference Shimamura, T., & Kobayashi, H. (2001). Weighted autocorrelation for pitch extraction of noisy speech. IEEE Transactions on Speech and Audio Processing,9(7), 727–730.CrossRef Shimamura, T., & Kobayashi, H. (2001). Weighted autocorrelation for pitch extraction of noisy speech. IEEE Transactions on Speech and Audio Processing,9(7), 727–730.CrossRef
Metadata
Title
Performance measurement of a novel pitch detection scheme based on weighted autocorrelation for speech signals
Author
Sandeep Kumar
Publication date
09-09-2019
Publisher
Springer US
Published in
International Journal of Speech Technology / Issue 4/2019
Print ISSN: 1381-2416
Electronic ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-019-09634-5

Other articles of this Issue 4/2019

International Journal of Speech Technology 4/2019 Go to the issue