Skip to main content
Erschienen in: International Journal of Speech Technology 4/2013

01.12.2013

Performance evaluation of a wavelet-based pitch detection scheme

verfasst von: Sandeep Kumar, S. Bhattacharya, Vishal Dhiman, Shuvashree Mohapatra

Erschienen in: International Journal of Speech Technology | Ausgabe 4/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Performance evaluation, in a complete speech analysis-synthesis system, has been carried out for a wavelet-based pitch detection scheme that has been reported earlier. Speech quality, time for computation and memory consumption (for real-time implementation) are the parameters that have been considered while comparing this system with analysis-synthesis systems that use pitch detection based on autocorrelation and cepstral analysis. Results for different speech signals show that autocorrelation-based pitch detection scheme is the best in terms of speech quality and memory consumption while wavelet-based pitch detection stands in between the other two methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Ananthapadmanabha, T. V., & Yegnanarayana, B. (1979). Epoch extraction from linear prediction residual for identification and closed glottis interval. IEEE Transactions on Acoustics, Speech, and Signal Processing, 27, 309–319. CrossRef Ananthapadmanabha, T. V., & Yegnanarayana, B. (1979). Epoch extraction from linear prediction residual for identification and closed glottis interval. IEEE Transactions on Acoustics, Speech, and Signal Processing, 27, 309–319. CrossRef
Zurück zum Zitat Banerjee, M., Vani, B. A., & Krishna, G. R. (2004). Optimal real time DSP implementation of ITU G.729 speech codec. In Proceedings of IEEE. 60th vehicular technology conference (VTC 2004), Los Angeles, USA (Vol. 6, pp. 3908–3912). Banerjee, M., Vani, B. A., & Krishna, G. R. (2004). Optimal real time DSP implementation of ITU G.729 speech codec. In Proceedings of IEEE. 60th vehicular technology conference (VTC 2004), Los Angeles, USA (Vol. 6, pp. 3908–3912).
Zurück zum Zitat Chen, H. (2001). Efficient implementation of low bit rate 1.6 kbps speech coder using field programmable gate arrays. In Proceedings of IEEE workshop on signal processing systems, Antwerpen, Belgium (pp. 161–168). Chen, H. (2001). Efficient implementation of low bit rate 1.6 kbps speech coder using field programmable gate arrays. In Proceedings of IEEE workshop on signal processing systems, Antwerpen, Belgium (pp. 161–168).
Zurück zum Zitat Cheng, Y. M., & O’Shaughnessy, D. (1989). Automatic and reliable estimation of glottal closure instant and period. IEEE Transactions on Acoustics, Speech, and Signal Processing, 37, 1805–1815. CrossRef Cheng, Y. M., & O’Shaughnessy, D. (1989). Automatic and reliable estimation of glottal closure instant and period. IEEE Transactions on Acoustics, Speech, and Signal Processing, 37, 1805–1815. CrossRef
Zurück zum Zitat Kadambe, S., & Boudreaux-Bartels, G. F. (1992). Application of the wavelet transform for pitch detection of speech signals. IEEE Transactions on Information Theory, 38(2), 917–924. CrossRef Kadambe, S., & Boudreaux-Bartels, G. F. (1992). Application of the wavelet transform for pitch detection of speech signals. IEEE Transactions on Information Theory, 38(2), 917–924. CrossRef
Zurück zum Zitat Mahawar, K., Kumar, V., & Gupta, H. O. (2012). Design and implementation of AMBE based voice codec module over custom FPGA platform. In Proceedings of international conference on computing, communication and applications (ICCCA), Tamil Nadu, India (pp. 1–5). CrossRef Mahawar, K., Kumar, V., & Gupta, H. O. (2012). Design and implementation of AMBE based voice codec module over custom FPGA platform. In Proceedings of international conference on computing, communication and applications (ICCCA), Tamil Nadu, India (pp. 1–5). CrossRef
Zurück zum Zitat Mallat, S. G., & Zhong, S. (1989). Complete signal representation with multiscale edges (Tech. rep. Courant Inst. of Math. Sci., RRT-483-RR-219). Mallat, S. G., & Zhong, S. (1989). Complete signal representation with multiscale edges (Tech. rep. Courant Inst. of Math. Sci., RRT-483-RR-219).
Zurück zum Zitat Mohapatra, S., Dhiman, V., Bhattacharya, S., & Kumar, S. (2011). A theoretical justification for coincidence of wavelet maxima at a particular scale pair in an event-based pitch detection method. In Proceedings of IEEE international conference on devices communications (ICDeCom), BIT Mesra, Ranchi, India (pp. 403–406). Mohapatra, S., Dhiman, V., Bhattacharya, S., & Kumar, S. (2011). A theoretical justification for coincidence of wavelet maxima at a particular scale pair in an event-based pitch detection method. In Proceedings of IEEE international conference on devices communications (ICDeCom), BIT Mesra, Ranchi, India (pp. 403–406).
Zurück zum Zitat Pang, J., Chauhan, S., & Bhlodia, J. M. (2008). Speech compression FPGA design by using different discrete wavelet transform schemes. In Proceedings of advances in electrical and electronics engineering, IAENG special edition of the world congress on engineering and computer science (pp. 21–29). CrossRef Pang, J., Chauhan, S., & Bhlodia, J. M. (2008). Speech compression FPGA design by using different discrete wavelet transform schemes. In Proceedings of advances in electrical and electronics engineering, IAENG special edition of the world congress on engineering and computer science (pp. 21–29). CrossRef
Zurück zum Zitat Pasero, E., & Montuori, A. (2002). Real-time perceptual coding of wideband speech by competitive neural networks. In Lecture notes in computer science (Vol. 2486, pp. 160–167). Berlin: Springer. Pasero, E., & Montuori, A. (2002). Real-time perceptual coding of wideband speech by competitive neural networks. In Lecture notes in computer science (Vol. 2486, pp. 160–167). Berlin: Springer.
Zurück zum Zitat Unisa, A. P. Q., & Guevara, R. C. L. (2009). Real-time implementation of wideband sinusoidal speech coder on ADSP-21065L. In Proceedings of 16th international conference on digital signal processing, Santorini, Greece (pp. 1–5). Unisa, A. P. Q., & Guevara, R. C. L. (2009). Real-time implementation of wideband sinusoidal speech coder on ADSP-21065L. In Proceedings of 16th international conference on digital signal processing, Santorini, Greece (pp. 1–5).
Metadaten
Titel
Performance evaluation of a wavelet-based pitch detection scheme
verfasst von
Sandeep Kumar
S. Bhattacharya
Vishal Dhiman
Shuvashree Mohapatra
Publikationsdatum
01.12.2013
Verlag
Springer US
Erschienen in
International Journal of Speech Technology / Ausgabe 4/2013
Print ISSN: 1381-2416
Elektronische ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-013-9194-4

Weitere Artikel der Ausgabe 4/2013

International Journal of Speech Technology 4/2013 Zur Ausgabe

Neuer Inhalt