Skip to main content

2017 | OriginalPaper | Buchkapitel

A New Static Web Caching Mechanism Based on Mutual Dependency Between Result Cache and Posting List Cache

verfasst von : Thanh Trinh, Dingming Wu, Joshua Zhexue Huang

Erschienen in: Web Information Systems Engineering – WISE 2017

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Caching is an important optimization technique in search engine architectures. There exist various types of caches, such as result cache, posting list cache, intersection cache, snippet cache, and document cache. However, these caching techniques are studied separately. Although several multiple level caches that integrate different types of caches have been proposed, the relationships among different caches are ignored. In this paper, we study the mutual dependency between the result cache and the posting list cache via empirical experiments and observe duplicate hits in the two types of caches. In order to better utilize the cache space and increase the hit ratio, three algorithms are proposed to implement a static cache mechanism based on the mutual dependency between the result cache and the posting list cache. A series of experiments were conducted on a real data set and the results have demonstrated the improvement of the hit ratio of our proposals.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Altingovde, I.S., Ozcan, R., Ulusoy, Ö.: A cost-aware strategy for query result caching in web search engines. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 628–636. Springer, Heidelberg (2009). doi:10.1007/978-3-642-00958-7_59CrossRef Altingovde, I.S., Ozcan, R., Ulusoy, Ö.: A cost-aware strategy for query result caching in web search engines. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 628–636. Springer, Heidelberg (2009). doi:10.​1007/​978-3-642-00958-7_​59CrossRef
2.
Zurück zum Zitat Baeza-Yates, R.A., Gionis, A., Junqueira, F., Murdock, V., Plachouras, V., Silvestri, F.: The impact of caching on search engines. In: SIGIR, pp. 183–190 (2007) Baeza-Yates, R.A., Gionis, A., Junqueira, F., Murdock, V., Plachouras, V., Silvestri, F.: The impact of caching on search engines. In: SIGIR, pp. 183–190 (2007)
3.
Zurück zum Zitat Baeza-Yates, R.A., Gionis, A., Junqueira, F., Murdock, V., Plachouras, V., Silvestri, F.: Design trade-offs for search engine caching. TWEB 2(4), 20:1–20:28 (2008)CrossRef Baeza-Yates, R.A., Gionis, A., Junqueira, F., Murdock, V., Plachouras, V., Silvestri, F.: Design trade-offs for search engine caching. TWEB 2(4), 20:1–20:28 (2008)CrossRef
4.
Zurück zum Zitat Baeza-Yates, R., Jonassen, S.: Modeling static caching in web search engines. In: Baeza-Yates, R., Vries, A.P., Zaragoza, H., Cambazoglu, B.B., Murdock, V., Lempel, R., Silvestri, F. (eds.) ECIR 2012. LNCS, vol. 7224, pp. 436–446. Springer, Heidelberg (2012). doi:10.1007/978-3-642-28997-2_37CrossRef Baeza-Yates, R., Jonassen, S.: Modeling static caching in web search engines. In: Baeza-Yates, R., Vries, A.P., Zaragoza, H., Cambazoglu, B.B., Murdock, V., Lempel, R., Silvestri, F. (eds.) ECIR 2012. LNCS, vol. 7224, pp. 436–446. Springer, Heidelberg (2012). doi:10.​1007/​978-3-642-28997-2_​37CrossRef
5.
Zurück zum Zitat Ceccarelli, D., Lucchese, C., Orlando, S., Perego, R., Silvestri, F.: Caching query-biased snippets for efficient retrieval. In: EDBT, pp. 93–104 (2011) Ceccarelli, D., Lucchese, C., Orlando, S., Perego, R., Silvestri, F.: Caching query-biased snippets for efficient retrieval. In: EDBT, pp. 93–104 (2011)
6.
Zurück zum Zitat Fagni, T., Perego, R., Silvestri, F., Orlando, S.: Boosting the performance of web search engines: caching and prefetching query results by exploiting historical usage data. ACM Trans. Inf. Syst. 24(1), 51–78 (2006)CrossRef Fagni, T., Perego, R., Silvestri, F., Orlando, S.: Boosting the performance of web search engines: caching and prefetching query results by exploiting historical usage data. ACM Trans. Inf. Syst. 24(1), 51–78 (2006)CrossRef
7.
Zurück zum Zitat Gan, Q., Suel, T.: Improved techniques for result caching in web search engines. In: WWW, pp. 431–440 (2009) Gan, Q., Suel, T.: Improved techniques for result caching in web search engines. In: WWW, pp. 431–440 (2009)
8.
Zurück zum Zitat Long, X., Suel, T.: Three-level caching for efficient query processing in large web search engines. World Wide Web 9(4), 369–395 (2006)CrossRef Long, X., Suel, T.: Three-level caching for efficient query processing in large web search engines. World Wide Web 9(4), 369–395 (2006)CrossRef
9.
Zurück zum Zitat Marín, M., Costa, V.G., Gómez-Pantoja, C.: New caching techniques for web search engines. In: HPDC, pp. 215–226 (2010) Marín, M., Costa, V.G., Gómez-Pantoja, C.: New caching techniques for web search engines. In: HPDC, pp. 215–226 (2010)
10.
Zurück zum Zitat Markatos, E.P.: On caching search engine query results. Comput. Commun. 24(2), 137–143 (2001)CrossRef Markatos, E.P.: On caching search engine query results. Comput. Commun. 24(2), 137–143 (2001)CrossRef
11.
Zurück zum Zitat Ozcan, R., Altingövde, I.S., Cambazoglu, B.B., Junqueira, F.P., Ulusoy, Ö.: A five-level static cache architecture for web search engines. Inf. Process. Manage. 48(5), 828–840 (2012)CrossRef Ozcan, R., Altingövde, I.S., Cambazoglu, B.B., Junqueira, F.P., Ulusoy, Ö.: A five-level static cache architecture for web search engines. Inf. Process. Manage. 48(5), 828–840 (2012)CrossRef
12.
Zurück zum Zitat Ozcan, R., Altingövde, I.S., Cambazoglu, B.B., Ulusoy, Ö.: Second chance: a hybrid approach for dynamic result caching and prefetching in search engines. TWEB 8(1), 3:1–3:22 (2013)CrossRef Ozcan, R., Altingövde, I.S., Cambazoglu, B.B., Ulusoy, Ö.: Second chance: a hybrid approach for dynamic result caching and prefetching in search engines. TWEB 8(1), 3:1–3:22 (2013)CrossRef
13.
Zurück zum Zitat Ozcan, R., Altingövde, I.S., Ulusoy, Ö.: Static query result caching revisited. In: WWW, pp. 1169–1170 (2008) Ozcan, R., Altingövde, I.S., Ulusoy, Ö.: Static query result caching revisited. In: WWW, pp. 1169–1170 (2008)
14.
Zurück zum Zitat Ozcan, R., Altingövde, I.S., Ulusoy, Ö.: Cost-aware strategies for query result caching in web search engines. TWEB 5(2), 9:1–9:25 (2011)CrossRef Ozcan, R., Altingövde, I.S., Ulusoy, Ö.: Cost-aware strategies for query result caching in web search engines. TWEB 5(2), 9:1–9:25 (2011)CrossRef
15.
Zurück zum Zitat Podlipnig, S., Böszörményi, L.: A survey of web cache replacement strategies. ACM Comput. Surv. 35(4), 374–398 (2003)CrossRef Podlipnig, S., Böszörményi, L.: A survey of web cache replacement strategies. ACM Comput. Surv. 35(4), 374–398 (2003)CrossRef
16.
Zurück zum Zitat Saraiva, P.C., de Moura, E.S., Fonseca, R.C., Meira, W.Jr., Ribeiro-Neto, B.A., Ziviani, N.: Rank-preserving two-level caching for scalable search engines. In: SIGIR, pp. 51–58 (2001) Saraiva, P.C., de Moura, E.S., Fonseca, R.C., Meira, W.Jr., Ribeiro-Neto, B.A., Ziviani, N.: Rank-preserving two-level caching for scalable search engines. In: SIGIR, pp. 51–58 (2001)
17.
Zurück zum Zitat Tolosa, G., Becchetti, L., Feuerstein, E., Marchetti-Spaccamela, A.: Performance improvements for search systems using an integrated cache of Lists+Intersections. In: Moura, E., Crochemore, M. (eds.) SPIRE 2014. LNCS, vol. 8799, pp. 227–235. Springer, Cham (2014). doi:10.1007/978-3-319-11918-2_22CrossRef Tolosa, G., Becchetti, L., Feuerstein, E., Marchetti-Spaccamela, A.: Performance improvements for search systems using an integrated cache of Lists+Intersections. In: Moura, E., Crochemore, M. (eds.) SPIRE 2014. LNCS, vol. 8799, pp. 227–235. Springer, Cham (2014). doi:10.​1007/​978-3-319-11918-2_​22CrossRef
18.
Zurück zum Zitat Zhang, J., Long, X., Suel, T.: Performance of compressed inverted list caching in search engines. In: WWW, pp. 387–396 (2008) Zhang, J., Long, X., Suel, T.: Performance of compressed inverted list caching in search engines. In: WWW, pp. 387–396 (2008)
19.
Zurück zum Zitat Zhou, W., Li, R., Dong, X., Xu, Z., Xiao, W.: An intersection cache based on frequent itemset mining in large scale search engines. In: IEEE Workshop on Hot Topics in Web Systems and Technologies, pp. 19–24 (2015) Zhou, W., Li, R., Dong, X., Xu, Z., Xiao, W.: An intersection cache based on frequent itemset mining in large scale search engines. In: IEEE Workshop on Hot Topics in Web Systems and Technologies, pp. 19–24 (2015)
Metadaten
Titel
A New Static Web Caching Mechanism Based on Mutual Dependency Between Result Cache and Posting List Cache
verfasst von
Thanh Trinh
Dingming Wu
Joshua Zhexue Huang
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-68786-5_12