Skip to main content
Top

2017 | OriginalPaper | Chapter

12. Packet Loss and Concealment

Authors : Tom Bäckström, Jérémie Lecomte

Published in: Speech Coding

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Transmission over real-world networks will occasionally suffer from transmission errors, which can significantly deteriorate the perceived quality of a speech codec. This chapter addresses the problem of transmission errors in packet based voice applications, such as voice over Internet protocol (VoIP). A broad range of techniques for recovery from packet loss on the channel are presented, from channel coding to techniques using speech signal processing methods, as well as both sender-driven and receiver-based methods. The sender based methods include for example retransmission, interleaving and forward error correction (both media-specific as well as media-independent), whereas receiver-based techniques include noise substitution, repetition and synchronisation methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference 3GPP. TS 26.402 Enhanced aacPlus general audio codec; Additional decoder tools (Release 11) (2012) 3GPP. TS 26.402 Enhanced aacPlus general audio codec; Additional decoder tools (Release 11) (2012)
2.
go back to reference Atti, V., Sinder, D.J., Subasingha, S., Rajendran, V., Dewasurendra, D., Chebiyyam, V., Varga, I., Krishnan, V., Schubert, B., Lecomte, J., et al.: Improved error resilience for VoLTE and VoIP with 3GPP EVS channel aware coding. In: Proceedings of the ICASSP, pp. 5713–5717. IEEE (2015) Atti, V., Sinder, D.J., Subasingha, S., Rajendran, V., Dewasurendra, D.,  Chebiyyam, V., Varga, I., Krishnan, V., Schubert, B., Lecomte, J., et al.: Improved error resilience for VoLTE and VoIP with 3GPP EVS channel aware coding. In: Proceedings of the ICASSP, pp. 5713–5717. IEEE (2015)
3.
go back to reference Bäckström, T., Ghido, F., Fischer, J.: Blind recovery of perceptual models in distributed speech and audio coding. In: Proceedings of the Interspeech (2016) Bäckström, T., Ghido, F., Fischer, J.: Blind recovery of perceptual models in distributed speech and audio coding. In: Proceedings of the Interspeech (2016)
4.
go back to reference Benesty, J., Sondhi, M., Huang, Y.: Springer Handbook of Speech Processing. Springer, Heidelberg (2008)CrossRef Benesty, J., Sondhi, M., Huang, Y.: Springer Handbook of Speech Processing. Springer, Heidelberg (2008)CrossRef
5.
go back to reference Bhute, V.P., Shrawankar, U.N.: Speech packet concealment techniques based on time-scale modification for voip. In: ICCSIT 2008, International Conference on Computer Science and Information Technology, pp. 825–828. IEEE (2008) Bhute, V.P., Shrawankar, U.N.: Speech packet concealment techniques based on time-scale modification for voip. In: ICCSIT 2008, International Conference on Computer Science and Information Technology, pp. 825–828. IEEE (2008)
6.
go back to reference Boufounos, P.T., Baraniuk, R.G.: 1-bit compressive sensing. In: CISS 2008, 42nd Annual Conference on Information Sciences and Systems, pp. 16–21. IEEE (2008) Boufounos, P.T., Baraniuk, R.G.: 1-bit compressive sensing. In: CISS 2008, 42nd Annual Conference on Information Sciences and Systems, pp. 16–21. IEEE (2008)
7.
go back to reference Dietz, M., Multrus, M., Eksler, V., Malenovsky, V., Norvell, E., Pobloth, H., Miao, L., Wang, Z., Laaksonen, L., Vasilache, A., Kamamoto, Y., Kikuiri, K., Ragot, S., Faure, J., Ehara, H., Rajendran, V., Atti, V., Sung, H., Oh, E., Yuan, H., Zhu, C.: Overview of the EVS codec architecture. In: Proceedings of the ICASSP, pp. 5698–5702. IEEE (2015) Dietz, M., Multrus, M., Eksler, V., Malenovsky, V., Norvell, E., Pobloth, H., Miao, L., Wang, Z., Laaksonen, L., Vasilache, A., Kamamoto, Y., Kikuiri, K., Ragot, S., Faure, J., Ehara, H., Rajendran, V., Atti, V., Sung, H., Oh, E., Yuan, H., Zhu, C.: Overview of the EVS codec architecture. In: Proceedings of the ICASSP, pp. 5698–5702. IEEE (2015)
8.
go back to reference Fairhurst, G., Wood, L.: RFC3366 Advice to link designers on link Automatic Repeat reQuest (ARQ). IETF (2002) Fairhurst, G., Wood, L.: RFC3366 Advice to link designers on link Automatic Repeat reQuest (ARQ). IETF (2002)
9.
go back to reference Fuchs, G., Multrus, M., Neuendorf, M., Geiger, R.: Mdct-based coder for highly adaptive speech and audio coding. In: Proceedings of the EUSIPCO, pp. 24–28 (2009) Fuchs, G., Multrus, M., Neuendorf, M., Geiger, R.: Mdct-based coder for highly adaptive speech and audio coding. In: Proceedings of the EUSIPCO, pp. 24–28 (2009)
10.
go back to reference Gournay, P., Rousseau, F., Lefebvre, R.: Improved packet loss recovery using late frames for prediction-based speech coders. In: Proceedings of the ICASSP, vol. 1, pp. I–108. IEEE (2003) Gournay, P., Rousseau, F., Lefebvre, R.: Improved packet loss recovery using late frames for prediction-based speech coders. In: Proceedings of the ICASSP, vol. 1, pp. I–108. IEEE (2003)
11.
go back to reference Goyal, V.K.: Multiple description coding: compression meets the network. IEEE Signal Process. Mag. 18(5), 74–93 (2001)CrossRef Goyal, V.K.: Multiple description coding: compression meets the network. IEEE Signal Process. Mag. 18(5), 74–93 (2001)CrossRef
12.
go back to reference Gruber, J., Strawczynski, Leo: Subjective effects of variable delay and speech clipping in dynamically managed voice systems. IEEE Trans. Commun. 33(8), 801–808 (1985)CrossRef Gruber, J., Strawczynski, Leo: Subjective effects of variable delay and speech clipping in dynamically managed voice systems. IEEE Trans. Commun. 33(8), 801–808 (1985)CrossRef
13.
go back to reference Han, S.: Contributions to Improved Hard- and Soft-Decision Decoding in Speech and Audio Codecs. Ph.D. thesis, Braunschweig University of Technology (2016) Han, S.: Contributions to Improved Hard- and Soft-Decision Decoding in Speech and Audio Codecs. Ph.D. thesis, Braunschweig University of Technology (2016)
14.
go back to reference Hess, W.: Pitch determination of speech signals: algorithms and devices. Pitch Determination of Speech Signals. Springer, Heidelberg (2012) Hess, W.: Pitch determination of speech signals: algorithms and devices. Pitch Determination of Speech Signals. Springer, Heidelberg (2012)
15.
go back to reference Laakso, T.I., Valimaki, V., Karjalainen, M., Laine, U.K.: Splitting the unit delay [FIR/all pass filters design]. IEEE Signal Process. Mag. 13(1), 30–60 (1996)CrossRef Laakso, T.I., Valimaki, V., Karjalainen, M., Laine, U.K.: Splitting the unit delay [FIR/all pass filters design]. IEEE Signal Process. Mag. 13(1), 30–60 (1996)CrossRef
16.
go back to reference Lecomte, J., Schnabel, M., Markovic, G., Dietz, M., Neugebauer, B.: Apparatus and method for improved concealment of the adaptive codebook in ACELP-like concealment employing improved pulse resynchronization, 24 December 2014. WO Patent App. PCT/EP2014/062,578 Lecomte, J., Schnabel, M., Markovic, G., Dietz, M., Neugebauer, B.: Apparatus and method for improved concealment of the adaptive codebook in ACELP-like concealment employing improved pulse resynchronization, 24 December 2014. WO Patent App. PCT/EP2014/062,578
17.
go back to reference Lecomte, J., Schnabel, M., Markovic, G., Dietz, M., Neugebauer, B.: Apparatus and method for improved concealment of the adaptive codebook in a CELP-like concealment employing improved pulse resynchronization, 21 April 2016. US Patent 20,160,111,094 Lecomte, J., Schnabel, M., Markovic, G., Dietz, M., Neugebauer, B.: Apparatus and method for improved concealment of the adaptive codebook in a CELP-like concealment employing improved pulse resynchronization, 21 April 2016. US Patent 20,160,111,094
18.
go back to reference Lecomte, J., Vaillancourt, T., Bruhn, S., Sung, H., Peng, K., Kikuiri, K., Wang, B., Subasingha, S., Faure, J.: Packet-loss concealment technology advances in EVS. In: Proceedings of the ICASSP, pp. 5708–5712. IEEE (2015) Lecomte, J., Vaillancourt, T., Bruhn, S., Sung, H., Peng, K., Kikuiri, K., Wang, B., Subasingha, S., Faure, J.: Packet-loss concealment technology advances in EVS. In: Proceedings of the ICASSP, pp. 5708–5712. IEEE (2015)
19.
go back to reference Lee, M.-K., Jung, S.-K., Kang, H.-G., Park, Y.-C., Youn, D.-H.: A packet loss concealment algorithm based on time-scale modification for celp-type speech coders. In: Proceedings of the ICASSP, vol. 1, pp. I–116. IEEE (2003) Lee, M.-K., Jung, S.-K., Kang, H.-G., Park, Y.-C., Youn, D.-H.: A packet loss concealment algorithm based on time-scale modification for celp-type speech coders. In: Proceedings of the ICASSP, vol. 1, pp. I–116. IEEE (2003)
20.
go back to reference Liang, Y.J., Farber, N., Girod, B.: Adaptive playout scheduling using time-scale modification in packet voice communications. In: Proceedings of the ICASSP, vol. 3, pp. 1445–1448. IEEE (2001) Liang, Y.J., Farber, N., Girod, B.: Adaptive playout scheduling using time-scale modification in packet voice communications. In: Proceedings of the ICASSP, vol. 3, pp. 1445–1448. IEEE (2001)
21.
go back to reference Liu, F., Kim, J.W., Kuo, C.-C.J.: Adaptive delay concealment for internet voice applications with packet-based time-scale modification. In: Information Technologies 2000, pp. 91–102. International Society for Optics and Photonics (2001) Liu, F., Kim, J.W., Kuo, C.-C.J.: Adaptive delay concealment for internet voice applications with packet-based time-scale modification. In: Information Technologies 2000, pp. 91–102. International Society for Optics and Photonics (2001)
22.
go back to reference Lochart, G.B., Goodman, D.J.: Reconstruction of missing speech packets by waveform substitution. Signal Process. 3, 357–360 (1986) Lochart, G.B., Goodman, D.J.: Reconstruction of missing speech packets by waveform substitution. Signal Process. 3, 357–360 (1986)
23.
go back to reference Merazka, F.: A comparison of packet loss concealment and control for voice transmission over IP network services. In: 2014 9th International Symposium on Communication Systems, Networks Digital Signal Processing (CSNDSP), pp. 497–501, July 2014 Merazka, F.: A comparison of packet loss concealment and control for voice transmission over IP network services. In: 2014 9th International Symposium on Communication Systems, Networks Digital Signal Processing (CSNDSP), pp. 497–501, July 2014
24.
go back to reference Nagabuchi, H., Kitawaki, N.: Evaluation of coded speech quality degraded by cell loss in ATM networks. Electron. Commun. Jpn. (Part III: Fundam. Electron. Sci.) 75(9), 14–24 (1992)CrossRef Nagabuchi, H., Kitawaki, N.: Evaluation of coded speech quality degraded by cell loss in ATM networks. Electron. Commun. Jpn. (Part III: Fundam. Electron. Sci.) 75(9), 14–24 (1992)CrossRef
25.
go back to reference Perkins, C: RTP payload format for interleaved media. IETF Audio/Video Transport Working Group (1999) Perkins, C: RTP payload format for interleaved media. IETF Audio/Video Transport Working Group (1999)
26.
go back to reference Rizzo, L.: Effective erasure codes for reliable computer communication protocols. ACM SIGCOMM Comput. Commun. Rev. 27(2), 24–36 (1997)CrossRef Rizzo, L.: Effective erasure codes for reliable computer communication protocols. ACM SIGCOMM Comput. Commun. Rev. 27(2), 24–36 (1997)CrossRef
27.
go back to reference Rosenberg, J., Schulzrinne, H.: RFC 2733 An RTP Payload Format for Generic Forward Error Correction. IETF, December 1999 Rosenberg, J., Schulzrinne, H.: RFC 2733 An RTP Payload Format for Generic Forward Error Correction. IETF, December 1999
28.
go back to reference Sanneck, H., Stenger, A., Younes, K.B., Girod, B.: A new technique for audio packet loss concealment. In: GLOBECOM 1996, Communications: The Key to Global Prosperity, Global Telecommunications Conference, pp. 48–52. IEEE (1996) Sanneck, H., Stenger, A., Younes, K.B., Girod, B.: A new technique for audio packet loss concealment. In: GLOBECOM 1996, Communications: The Key to Global Prosperity, Global Telecommunications Conference, pp. 48–52. IEEE (1996)
29.
go back to reference Serizawa, M., Nozawa, Y.: A packet loss concealment method using pitch waveform repetition and internal state update on the decoded speech for the sub-band ADPCM wideband speech codec. In: Proceedings of the IEEE Workshop Speech Coding, pp. 68–70. IEEE (2002) Serizawa, M., Nozawa, Y.: A packet loss concealment method using pitch waveform repetition and internal state update on the decoded speech for the sub-band ADPCM wideband speech codec. In: Proceedings of the IEEE Workshop Speech Coding, pp. 68–70. IEEE (2002)
30.
go back to reference Vaillancourt, T., Jelinek, M., Salami, R., Lefebvre, R.: Efficient frame erasure concealment in predictive speech codecs using glottal pulse resynchronisation. In: Proceedings of the ICASSP, vol. 4, pp. IV–1113. IEEE (2007) Vaillancourt, T., Jelinek, M., Salami, R., Lefebvre, R.: Efficient frame erasure concealment in predictive speech codecs using glottal pulse resynchronisation. In: Proceedings of the ICASSP, vol. 4, pp. IV–1113. IEEE (2007)
31.
go back to reference Verhelst, W., Roelands, M.: An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech. In: Proceedings of the ICASSP, vol. 2, pp. 554–557. IEEE (1993) Verhelst, W., Roelands, M.: An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech. In: Proceedings of the ICASSP, vol. 2, pp. 554–557. IEEE (1993)
32.
go back to reference Warren, R.M.: Auditory perception: An Analysis and Synthesis, vol. 109. Elsevier, Amsterdam (2013) Warren, R.M.: Auditory perception: An Analysis and Synthesis, vol. 109. Elsevier, Amsterdam (2013)
33.
go back to reference Wasem, O.J., Goodman, D.J., Dvorak, C.A., Page, H.G.: The effect of waveform substitution on the quality of pcm packet communications. IEEE Trans. Acoust. Speech, Signal Process. 36(3), 342–348 (1988)CrossRef Wasem, O.J., Goodman, D.J., Dvorak, C.A., Page, H.G.: The effect of waveform substitution on the quality of pcm packet communications. IEEE Trans. Acoust. Speech, Signal Process. 36(3), 342–348 (1988)CrossRef
34.
go back to reference Xiong, Z., Liveris, A.D., Cheng, S.: Distributed source coding for sensor networks. IEEE Signal Process. Mag. 21(5), 80–94 (2004)CrossRef Xiong, Z., Liveris, A.D., Cheng, S.: Distributed source coding for sensor networks. IEEE Signal Process. Mag. 21(5), 80–94 (2004)CrossRef
Metadata
Title
Packet Loss and Concealment
Authors
Tom Bäckström
Jérémie Lecomte
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-50204-5_12