Skip to main content
Top

waLLMartCache: A Distributed, Multi-tenant and Enhanced Semantic Caching System for LLMs

  • 2025
  • OriginalPaper
  • Chapter
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The chapter discusses waLLMartCache, an advanced caching system designed to handle the challenges of scaling LLM responses efficiently. It builds on GPTCache by introducing Redis as an L2 cache, enabling distributed processing across multiple nodes, and implementing a decision engine to handle code snippets and temporal contexts. Additionally, it highlights the effectiveness of pre-loading FAQs to boost cache hits. The paper also includes empirical evidence and ablation studies to demonstrate the improvements made by waLLMartCache over existing solutions. It concludes by outlining future directions for further enhancing the system.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Business + Economics & Engineering + Technology"

Online-Abonnement

Springer Professional "Business + Economics & Engineering + Technology" gives you access to:

  • more than 102.000 books
  • more than 537 journals

from the following subject areas:

  • Automotive
  • Construction + Real Estate
  • Business IT + Informatics
  • Electrical Engineering + Electronics
  • Energy + Sustainability
  • Finance + Banking
  • Management + Leadership
  • Marketing + Sales
  • Mechanical Engineering + Materials
  • Insurance + Risk


Secure your knowledge advantage now!

Springer Professional "Engineering + Technology"

Online-Abonnement

Springer Professional "Engineering + Technology" gives you access to:

  • more than 67.000 books
  • more than 390 journals

from the following specialised fileds:

  • Automotive
  • Business IT + Informatics
  • Construction + Real Estate
  • Electrical Engineering + Electronics
  • Energy + Sustainability
  • Mechanical Engineering + Materials





 

Secure your knowledge advantage now!

Springer Professional "Business + Economics"

Online-Abonnement

Springer Professional "Business + Economics" gives you access to:

  • more than 67.000 books
  • more than 340 journals

from the following specialised fileds:

  • Construction + Real Estate
  • Business IT + Informatics
  • Finance + Banking
  • Management + Leadership
  • Marketing + Sales
  • Insurance + Risk



Secure your knowledge advantage now!

Title
waLLMartCache: A Distributed, Multi-tenant and Enhanced Semantic Caching System for LLMs
Authors
Soumik Dasgupta
Anurag Wagh
Lalitdutt Parsai
Binay Gupta
Geet Vudata
Shally Sangal
Sohom Majumdar
Hema Rajesh
Kunal Banerjee
Anirban Chatterjee
Copyright Year
2025
DOI
https://doi.org/10.1007/978-3-031-78183-4_15
This content is only visible if you are logged in and have the appropriate permissions.
This content is only visible if you are logged in and have the appropriate permissions.

Premium Partner

    Image Credits
    Neuer Inhalt/© ITandMEDIA, Nagarro GmbH/© Nagarro GmbH, AvePoint Deutschland GmbH/© AvePoint Deutschland GmbH, AFB Gemeinnützige GmbH/© AFB Gemeinnützige GmbH, USU GmbH/© USU GmbH