Skip to main content
Top
Published in: International Journal of Speech Technology 2/2012

01-06-2012

Overall performance evaluation of adaptive multi rate 06.90 speech codec based on code excited linear prediction algorithm using MATLAB

Authors: Ninad Bhatt, Yogeshwar Kosta

Published in: International Journal of Speech Technology | Issue 2/2012

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Today, the primary constrain in wireless communication system is limited bandwidth and power. Wireless systems involved in transmission of speech envisage that efficient and effective methods need to be developed for maintaining quality-of-speech, especially at the receiving end, with maximum saving of bandwidth and power. Amongst all elements of the communication system (transmitter, channel and receiver), transmission channel (carrier of information/data, also called the medium) is the most critical and plays a key role in the transmission and reception of information/data. Channel conditions decide the quality of speech at receiver. Modeling a channel is a complex task. Many techniques are adopted to mitigate the effect of the channel. AMR (Adaptive Multi Rate) is one such technique that counteracts the deleterious effect of the channel on speech. This technique employs variable bit rate that dynamically switches to specific modes of operation (switching bit rates—called modes of operation) depending upon the channel conditions.
In this paper, the application of Code Excited Linear Prediction (CELP) source coder on speech followed by AMR codec is investigated and studied. An e-test bench using MATLAB is created to implement the CELP based AMR Codec scheme, and the same studied and investigated through a series of simulation. Here, both subjective and objective evaluations are carried out. Objective evaluations are categorized into waveform based, spectral based and perceptual based analysis. The results of the simulations are recorded and compared in various graphs and tables, which include calculation of various parameters like Absolute Error (ABS), Mean Square Error (MSE), Root Mean Square Error (RMSE), Signal to Noise Ratio (SNR), segmental SNR (segSNR) (Y. Hu and P. Loizou in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. 1, pp. 153–156, 2006a; Proc. Interspeech, pp. 1447–1450, 2006b), Weighted-Slope Spectral distance (WSS) (Y. Hu and P. Loizou in Speech Commun. 49, 588–601, 2007), Perceptual Evaluation of Speech Quality (PESQ) (ITU-T rec. P.862, 2000), Log-Likelihood Ratio (LLR), Itakura-Saito Distance measure (ISD), Cepstrum Distance Measures (CEP) (V. Turbin and N. Faucheur in Proc. Online Workshop Meas. Speech Audio Quality Netw., pp. 81–84, 2005), Frequency Weighted Segmental SNR (fwSNRseg), Predicted rating of overall Quality (Covl), Rating of Speech Distortion (Csig), Rating of Background Distortion (Cbak) (ITU-T rec. P.835, 2003) and MeanOpinion Score (MOS). Simulation results clearly advocate that, it is possible to producevariable bitrates (tuning to channel conditions) in CELP coder by affecting coefficients of the coder while still maintaining a good quality of speech. Further, higher the bit-rate used, the better is the quality of speech (which can be verified from the results obtained with PESQ and MOS analysis) and at the same time offered simulation delay time also increases.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Bhatt, N., et al. (2010). Implementation & performance analysis of CELP based GSM AMR Codec using MATLAB. In IEEE conference, CNC 2010, Calicut, India. Bhatt, N., et al. (2010). Implementation & performance analysis of CELP based GSM AMR Codec using MATLAB. In IEEE conference, CNC 2010, Calicut, India.
go back to reference Falk, T. H., & Chan, W. (2006). Single-ended speech quality measurement using machine learning methods. IEEE Transactions on Audio, Speech, and Language Processing, 14(6), 1935–1947. CrossRef Falk, T. H., & Chan, W. (2006). Single-ended speech quality measurement using machine learning methods. IEEE Transactions on Audio, Speech, and Language Processing, 14(6), 1935–1947. CrossRef
go back to reference Grundlehner, B., Lecocq, J., Balan, R., & Rosca, J. (2005). Performance assessment method for speech enhancement systems. In Proc. 1st annu. IEEE BENELUX/DSP valley signal process. symp. Grundlehner, B., Lecocq, J., Balan, R., & Rosca, J. (2005). Performance assessment method for speech enhancement systems. In Proc. 1st annu. IEEE BENELUX/DSP valley signal process. symp.
go back to reference Hu, Y., & Loizou, P. (2006a). Subjective comparison of speech enhancement algorithms. In Proc. IEEE int. conf. acoust., speech, signal process. (Vol. 1, pp. 153–156). Hu, Y., & Loizou, P. (2006a). Subjective comparison of speech enhancement algorithms. In Proc. IEEE int. conf. acoust., speech, signal process. (Vol. 1, pp. 153–156).
go back to reference Hu, Y., & Loizou, P. (2006b). Evaluation of objective measures for speech enhancement. In Proc. inter speech (pp. 1447–1450). Hu, Y., & Loizou, P. (2006b). Evaluation of objective measures for speech enhancement. In Proc. inter speech (pp. 1447–1450).
go back to reference Hu, Y., & Loizou, P. (2007). Subjective comparison and evaluation of speech enhancement algorithms. Speech Communication, 49, 588–601. CrossRef Hu, Y., & Loizou, P. (2007). Subjective comparison and evaluation of speech enhancement algorithms. Speech Communication, 49, 588–601. CrossRef
go back to reference Hu, Y., & Loizou, P. C. (2008). Evaluation of objective quality measures for speech enhancement. IEEE Transactions on Audio, Speech, and Language Processing, 16(1). Hu, Y., & Loizou, P. C. (2008). Evaluation of objective quality measures for speech enhancement. IEEE Transactions on Audio, Speech, and Language Processing, 16(1).
go back to reference ITU-T (2000). Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs (ITU-T Rec. P.862, 2001). ITU-T (2000). Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs (ITU-T Rec. P.862, 2001).
go back to reference ITU-T P.835 (2003). Subjective test methodology for evaluating speech communication systems that include noise suppression algorithm (ITU-T Rec. P. 835). ITU-T P.835 (2003). Subjective test methodology for evaluating speech communication systems that include noise suppression algorithm (ITU-T Rec. P. 835).
go back to reference Jarvinen, K. (2000). Standardisation of the adaptive multirate codec. In IEEE conference. Jarvinen, K. (2000). Standardisation of the adaptive multirate codec. In IEEE conference.
go back to reference Salmela, J., & Mattila, V. (2004). New intrusive method for the objective quality evaluation of acoustic noise suppression in mobile communications. In Proc. 116th audio eng. soc. conv., Preprint 6145. Salmela, J., & Mattila, V. (2004). New intrusive method for the objective quality evaluation of acoustic noise suppression in mobile communications. In Proc. 116th audio eng. soc. conv., Preprint 6145.
go back to reference Turbin, V., & Faucheur, N. (2005). A perceptual objective measure for noise reduction systems. In Proc. online workshop meas. speech audio quality netw (pp. 81–84). Turbin, V., & Faucheur, N. (2005). A perceptual objective measure for noise reduction systems. In Proc. online workshop meas. speech audio quality netw (pp. 81–84).
go back to reference Xiao, J., Li, X., & Wan, L. (1998). Software simulation in GSM environment of federal standard 1016 CELP vocoder. In International conference on communication technology, Oct. 22–24, Beijing, China. Xiao, J., Li, X., & Wan, L. (1998). Software simulation in GSM environment of federal standard 1016 CELP vocoder. In International conference on communication technology, Oct. 22–24, Beijing, China.
Metadata
Title
Overall performance evaluation of adaptive multi rate 06.90 speech codec based on code excited linear prediction algorithm using MATLAB
Authors
Ninad Bhatt
Yogeshwar Kosta
Publication date
01-06-2012
Publisher
Springer US
Published in
International Journal of Speech Technology / Issue 2/2012
Print ISSN: 1381-2416
Electronic ISSN: 1572-8110
DOI
https://doi.org/10.1007/s10772-011-9126-0

Other articles of this Issue 2/2012

International Journal of Speech Technology 2/2012 Go to the issue