Skip to main content
Top

2023 | OriginalPaper | Chapter

Using the Client Cache for Content Encoding: Shared Dictionary Compression for the Web

Authors : Benjamin Wollmer, Wolfram Wingerath, Felix Gessert, Florian Bücklers, Hannes Kuhlmann, Erik Witt, Fabian Panse, Norbert Ritter

Published in: Service-Oriented Computing

Publisher: Springer Nature Switzerland

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

As different approaches have demonstrated in the past, delta encoding and shared dictionary compression can significantly reduce the payload of websites. However, choosing a good dictionary or delta source is still a challenge and has kept delta encoding from becoming practically relevant for today’s web. In this work, we demonstrate that the often prohibitive costs of dictionary generation exhibited by earlier approaches can be avoided by simply using cache entries for content encoding: We divide web pages into different page types and use one actual page of every type as a dictionary to encode pages of the same type. In an experimental evaluation, we show that our approach outperforms current industry standards by a factor of 5 in terms of compression ratio. We discuss optimization and content normalization strategies as well as application scenarios that are possible with our approach, but infeasible with the current state of the art.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Content delivery networks (CDNs) accelerate content delivery by caching resources that are requested by multiple clients [6]. This is obviously not possible for deltas, if they are computed for individual users.
 
5
The categorization within those page types is implemented by the website owner, e.g. by URL regex.
 
8
We only use pages within the same website as a dictionary, since browsers would prohibit sharing content across different domains.
 
9
A journey describes multiple consecutive page visits of one user.
 
Literature
1.
go back to reference Alakuijala, J., et al.: Brotli: a general-purpose data compressor. ACM TOI 37(1), 1–30 (2018) Alakuijala, J., et al.: Brotli: a general-purpose data compressor. ACM TOI 37(1), 1–30 (2018)
2.
go back to reference Collet, Y., M. Kucherawy, E.: Zstandard Compression and the ’application/zstd’ Media Type. RFC 8878, February 2021 Collet, Y., M. Kucherawy, E.: Zstandard Compression and the ’application/zstd’ Media Type. RFC 8878, February 2021
3.
go back to reference Korn, D., MacDonald, J., Mogul, J., Vo, K.: The VCDIFF Generic Differencing and Compression Data Format. RFC 3284, June 2002 Korn, D., MacDonald, J., Mogul, J., Vo, K.: The VCDIFF Generic Differencing and Compression Data Format. RFC 3284, June 2002
4.
go back to reference McQuade, B., Mixter, K., Lee, W.H., Butler, J.: A proposal for shared dictionary compression over http (2016) McQuade, B., Mixter, K., Lee, W.H., Butler, J.: A proposal for shared dictionary compression over http (2016)
5.
go back to reference Mogul, J., et al.: Delta Encoding in HTTP. RFC 3229, January 2002 Mogul, J., et al.: Delta Encoding in HTTP. RFC 3229, January 2002
10.
go back to reference Wingerath, W., et al.: Speed Kit: A Polyglot & GDPR-Compliant Approach For Caching Personalized Content. In: ICDE, Dallas, Texas (2020) Wingerath, W., et al.: Speed Kit: A Polyglot & GDPR-Compliant Approach For Caching Personalized Content. In: ICDE, Dallas, Texas (2020)
11.
go back to reference Wingerath, W., et al.: Beaconnect: continuous web performance A/B-testing at scale. In: Proceedings of the 48th International Conference on Very Large Data Bases (2022) Wingerath, W., et al.: Beaconnect: continuous web performance A/B-testing at scale. In: Proceedings of the 48th International Conference on Very Large Data Bases (2022)
12.
go back to reference Wollmer, B., Wingerath, W., Ferrlein, S., Panse, F., Gessert, F., Ritter, N.: The case for cross-entity delta encoding in web compression. In: Proceedings of the 22nd International Conference on Web Engineering (ICWE) (2022) Wollmer, B., Wingerath, W., Ferrlein, S., Panse, F., Gessert, F., Ritter, N.: The case for cross-entity delta encoding in web compression. In: Proceedings of the 22nd International Conference on Web Engineering (ICWE) (2022)
13.
go back to reference Wollmer, B., Wingerath, W., Ferrlein, S., Panse, F., Gessert, F., Ritter, N.: The case for cross-entity delta encoding in web compression (extended). J. Web Eng. 22(01), 131–146 (2023) Wollmer, B., Wingerath, W., Ferrlein, S., Panse, F., Gessert, F., Ritter, N.: The case for cross-entity delta encoding in web compression (extended). J. Web Eng. 22(01), 131–146 (2023)
Metadata
Title
Using the Client Cache for Content Encoding: Shared Dictionary Compression for the Web
Authors
Benjamin Wollmer
Wolfram Wingerath
Felix Gessert
Florian Bücklers
Hannes Kuhlmann
Erik Witt
Fabian Panse
Norbert Ritter
Copyright Year
2023
DOI
https://doi.org/10.1007/978-3-031-45728-9_3

Premium Partner