Skip to main content
Top
Published in: International Journal of Speech Technology 4/2015

04-08-2015

Performance evaluation of a ACF-AMDF based pitch detection scheme in real-time

Authors: Sandeep Kumar, Satish Kumar Singh, S. Bhattacharya

Published in: International Journal of Speech Technology | Issue 4/2015

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, a complete speech analysis-synthesis system for a pitch detection algorithm based on short-time autocorrelation function (ACF) and average magnitude difference function (AMDF) has been implemented in real-time using TMS320C6713 DSK. Performance of this system has been compared with the analysis-synthesis system that use autocorrelation, cepstrum and wavelet based pitch detection method in terms of synthesized speech quality, Diagnostic Rhyme Test, execution time and memory consumption. Results show that this method of pitch detection scheme is the best in terms of speech quality, execution time and memory consumption. Moreover the synthesized speech using ACF-AMDF method of pitch detection is more intelligible compared to other pitch detection algorithm.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Abdullah-AI-Mamun, K., Sarker, F., & Muhammad, G. (2009). A high resolution pitch detection algorithm based on AMDF and ACF. Journal of Scientific Reasearch, 1(3), 508–515. Abdullah-AI-Mamun, K., Sarker, F., & Muhammad, G. (2009). A high resolution pitch detection algorithm based on AMDF and ACF. Journal of Scientific Reasearch, 1(3), 508–515.
go back to reference Ahmadi, S., & Spanias, A. S. (2001). Low bit-rate speech coding based on an improved sinusoidal model. Speech Communication, 34(4), 369–390.MATHCrossRef Ahmadi, S., & Spanias, A. S. (2001). Low bit-rate speech coding based on an improved sinusoidal model. Speech Communication, 34(4), 369–390.MATHCrossRef
go back to reference Amado, R. G., & Filho, J. V. (2008). Pitch detection algorithms based on zero-cross rate and autocorrelation function for musical notes. In Proceedings of IEEE international conference on audio, language and image processing (pp. 449–454). Shanghai, China. Amado, R. G., & Filho, J. V. (2008). Pitch detection algorithms based on zero-cross rate and autocorrelation function for musical notes. In Proceedings of IEEE international conference on audio, language and image processing (pp. 449–454). Shanghai, China.
go back to reference Bhattacharya, S., Singh, S. K., & Abhinav, T. (2012). Performance evaluation of LPC and Cepstral speech coder in simulation and in real-time. In Proceedings of IEEE international conference on recent advances in information technology (RAIT) (pp. 826–831). Dhanbad, India: ISM Dhanbad. Bhattacharya, S., Singh, S. K., & Abhinav, T. (2012). Performance evaluation of LPC and Cepstral speech coder in simulation and in real-time. In Proceedings of IEEE international conference on recent advances in information technology (RAIT) (pp. 826–831). Dhanbad, India: ISM Dhanbad.
go back to reference Deller, J. R., Hansen, J. H. L., & Proakis, J. G. (2000). Discrete-time processing of speech signal (pp. 570–579). New York: Wiley. Deller, J. R., Hansen, J. H. L., & Proakis, J. G. (2000). Discrete-time processing of speech signal (pp. 570–579). New York: Wiley.
go back to reference Fangming, W., & Yip, P. (1991). Cepstral analysis using discrete trignomatric transform. IEEE Transaction on Acoustics, Speech, Signal Processing (ASSP), 39(2), 538–541. Fangming, W., & Yip, P. (1991). Cepstral analysis using discrete trignomatric transform. IEEE Transaction on Acoustics, Speech, Signal Processing (ASSP), 39(2), 538–541.
go back to reference Gu, L., & Liu, R. (1982). The government standard linear predictive coding algorithm. Speech Technology, 40–49. Gu, L., & Liu, R. (1982). The government standard linear predictive coding algorithm. Speech Technology, 40–49.
go back to reference Huang, H., & Pan, J. (2006). Speech pitch determination based on Huang-Hilbert transform. Signal Processing, 86(4), 792–803.MATHCrossRef Huang, H., & Pan, J. (2006). Speech pitch determination based on Huang-Hilbert transform. Signal Processing, 86(4), 792–803.MATHCrossRef
go back to reference Kadambe, S., & Boudreaux-Bartels, G. F. (1992). Application of the Wavelet transform for pitch detection of speech signals. IEEE Transaction on Information Theory, 38(2), 917–924.CrossRef Kadambe, S., & Boudreaux-Bartels, G. F. (1992). Application of the Wavelet transform for pitch detection of speech signals. IEEE Transaction on Information Theory, 38(2), 917–924.CrossRef
go back to reference Kumar, S., Bhattacharya, S., Dhiman, V., & Mohapatra, S. (2013). Performance evaluation of a wavelet-based pitch detection scheme. International Journal of Speech Technology, Springer, 16(4), 431–437.CrossRef Kumar, S., Bhattacharya, S., Dhiman, V., & Mohapatra, S. (2013). Performance evaluation of a wavelet-based pitch detection scheme. International Journal of Speech Technology, Springer, 16(4), 431–437.CrossRef
go back to reference Kumar, S., Bhattacharya, S., & Patel, P. (2014). A new pitch detection scheme based on ACF and AMDF. In Proceedings of IEEE international conference on recent advanced communication control and computing technology (ICACCCT) (pp. 1235–1240). Ramanathapuram, Tamilnadu, India: Syed Ammal Engg. College. Kumar, S., Bhattacharya, S., & Patel, P. (2014). A new pitch detection scheme based on ACF and AMDF. In Proceedings of IEEE international conference on recent advanced communication control and computing technology (ICACCCT) (pp. 1235–1240). Ramanathapuram, Tamilnadu, India: Syed Ammal Engg. College.
go back to reference Mohapatra, S., Dhiman, V., Kumar, S., & Bhattacharya, S. (2011). A theoretical justification for coincidence of wavelet maxima at a particular scale pair in an Event-based pitch detection method. In Proceedings of IEEE international conference on devices communications (ICDeCom). (pp. 403–406). Ranchi, India: BIT Mesra. Mohapatra, S., Dhiman, V., Kumar, S., & Bhattacharya, S. (2011). A theoretical justification for coincidence of wavelet maxima at a particular scale pair in an Event-based pitch detection method. In Proceedings of IEEE international conference on devices communications (ICDeCom). (pp. 403–406). Ranchi, India: BIT Mesra.
go back to reference Muhammad, G. (2008). Noise robust pitch detection based on extended AMDF. In Proceedings of 8th IEEE international symposium on signal processing and information technology (pp. 133–138). Sarajevo, Bosnia & Herzegovnia Muhammad, G. (2008). Noise robust pitch detection based on extended AMDF. In Proceedings of 8th IEEE international symposium on signal processing and information technology (pp. 133–138). Sarajevo, Bosnia & Herzegovnia
go back to reference Muhammad, G. (2010). Noise-robust pitch detection using auto-correlation function with enhancements. Journal of King Saud University-Computer and Information Sciences, 22, 13–28.CrossRef Muhammad, G. (2010). Noise-robust pitch detection using auto-correlation function with enhancements. Journal of King Saud University-Computer and Information Sciences, 22, 13–28.CrossRef
go back to reference Pirker, G., Wohlmayr, M., Petrik, S., & Pernkopf, F. (2011). A pitch tracking corpus with evaluation on multipitch tracking scenario. Interspeech, 1509–1512. Pirker, G., Wohlmayr, M., Petrik, S., & Pernkopf, F. (2011). A pitch tracking corpus with evaluation on multipitch tracking scenario. Interspeech, 1509–1512.
go back to reference Rabiner, L. R., Cheng, M. J., Rosenberg, A. E., & McGonegal, C. A. (1976). A comparative performance study of several pitch detection algorithms. IEEE Transaction on Audio Signal and Speech Processing, 24(5), 399–417.CrossRef Rabiner, L. R., Cheng, M. J., Rosenberg, A. E., & McGonegal, C. A. (1976). A comparative performance study of several pitch detection algorithms. IEEE Transaction on Audio Signal and Speech Processing, 24(5), 399–417.CrossRef
go back to reference Ross, M. J., Shaffer, H. L., Cohen, A., Freudberg, R., & Manley, H. J. (1974). Average magnitude difference function pitch extractor. IEEE Transaction on Acoustics, Speech, Signal Processing (ASSP), 22(5), 353–362.CrossRef Ross, M. J., Shaffer, H. L., Cohen, A., Freudberg, R., & Manley, H. J. (1974). Average magnitude difference function pitch extractor. IEEE Transaction on Acoustics, Speech, Signal Processing (ASSP), 22(5), 353–362.CrossRef
go back to reference Sondhi, M. M. (1968). New methods of pitch extraction. IEEE Transaction on Audio and Electroacoust, 16(2), 262–266.CrossRef Sondhi, M. M. (1968). New methods of pitch extraction. IEEE Transaction on Audio and Electroacoust, 16(2), 262–266.CrossRef
go back to reference Suma, S. A., & Gurumurthy, K. S. (2010). Novel pitch extraction methods using average magnitude difference (AMDF) for LPC speech coders in noisy environments. In Proceedings of IEEE international conference on signal processing system (ICSPS) (pp. 636–640). Dalian, China Suma, S. A., & Gurumurthy, K. S. (2010). Novel pitch extraction methods using average magnitude difference (AMDF) for LPC speech coders in noisy environments. In Proceedings of IEEE international conference on signal processing system (ICSPS) (pp. 636–640). Dalian, China
go back to reference Zhang, W., Xu, G., & Wang, Y. (2002). Pitch estimation based on circular AMDF. In Proceedings of IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 341–344). Florida, USA Zhang, W., Xu, G., & Wang, Y. (2002). Pitch estimation based on circular AMDF. In Proceedings of IEEE international conference on acoustics, speech and signal processing (ICASSP) (pp. 341–344). Florida, USA
Metadata
Title
Performance evaluation of a ACF-AMDF based pitch detection scheme in real-time
Authors
Sandeep Kumar
Satish Kumar Singh
S. Bhattacharya
Publication date
04-08-2015
Publisher
Springer US
Published in
International Journal of Speech Technology / Issue 4/2015
Print ISSN: 1381-2416
Electronic ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-015-9296-2

Other articles of this Issue 4/2015

International Journal of Speech Technology 4/2015 Go to the issue