Top

World Wide Web

Published in:

21-02-2022

LIFOSS: a learned index scheme for streaming scenarios

Authors: Tong Yu, Guanfeng Liu, An Liu, Zhixu Li, Lei Zhao

Published in: World Wide Web | Issue 1/2023

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Recently, researches on dynamic decision-making based on streaming data are in full swing. As an indispensable technology for data management and analysis, indexing methods are also evolving. The indexing paradigm named learned index has been proposed to replace the traditional B-tree family in some scenarios. It has been proved that learned indexes can provide higher lookup efficiency and lower storage cost overhead than traditional indexes. Usually, learned indexes assume that the data is static or at least the data distribution is unchanged. However, the streaming scenarios break the strong assumption. This paper presents a learned index scheme for streaming scenarios (LIFOSS for short), where the workloads insert, delete, and query data arbitrarily. Precisely, LIFOSS consists of three parts: a) an adaptive packed-memory array which stores data and handles updates with lower bound of performance guaranteed; b) a middle-layer model group, used to fit the cumulative distribution function of data; c) a feedback mechanism designed to update parameters of the model group above in real-time locally. Extensive experiments on two public datasets show that LIFOSS performs better than the state-of-the-art dynamic learned index method. LIFOSS reduces the lookup latency by at least \(6\%\), and its dynamic performance is more stable, requiring only a tiny amount of extra space.

previous article Multi-center federated learning: clients clustering for better personalization

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Bender, M.A., Hu, H.: An adaptive packed-memory array. ACM Trans. Database Syst. 32(4), 26 (2007)CrossRef

BFerragina, P., HVinciguerra, G.: The pgm-index: a fully-dynamic compressed learned index with provable worst-case bounds. VLDB 13(8), 1162–1175 (2020)

Chen, J., Zhong, M., Li, J., Wang, D., Qian, T., Tu, H.: Effective deep attributed network representation learning with topology adapted smoothing. IEEE Trans Cybern., https://doi.org/10.1109/TCYB.2021.3064092 (2021)

Davitkova, A., Milchevski, E., Michel, S.: The ML-Index: a multidimensional, learned index for point, range, and nearest-neighbor queries. In: EDBT, pp. 407–410 (2020)

Ding, J., Minhas, U.F., Yu, J., Wang, C., Do J., Li, Y., Zhang, H., Chandramouli, B., Gehrke, J., Kossmann, D., Lomet, D.B., Kraska, T., Fu, X., Xu, J., Lu, H.: ALEX: An updatable adaptive learned index. In: SIGMOD, pp. 969–984 (2019)

Ding, J., Nathan, V., Alizadeh, M., Kraska, T.: Tsunami: A learned multi-dimensional index for correlated data and skewed workloads. VLDB 14(2), 74–86 (2020)

Ferragina, P., Lillo, F., Vinciguerra, G.: Why are learned indexes so effective?. In: ICML, pp. 3123–3132 (2020)

Galakatos, A., Markovitch, M., Binnig, C., Fonseca, R., Kraska, T.: FITing-Tree: a data-aware index structure. In: SIGMOD, pp. 1189–1206 (2019)

Gao, Y., Ye, J., Gao, X., Chen, G.: Middle layer based scalable learned index scheme. Journal of Software 31(3), 620–633 (2020)

10.

He, Y., Sick, B.: Clear: An adaptive continual learning framework for regression tasks. arxiv:2101.00926 (2021)

11.

Huszár, R.: On quadratic penalties in elastic weight consolidation. arxiv:1712.03847 (2017)

12.

Kipf, A., Marcus, R., Renen, A.V., Stoian, M., Kemper, A., Kraska, T., Neumann, T.: Sosd: A benchmark for learned indexes. arxiv:1911.13014 (2019)

13.

Kraska, T., Alizadeh, M., Beutel, A., Chi, E.H., Kristo, A., Leclerc, G., Madden, S., Mao, H., Nathan, V.: SageDB: a learned database system. In: CIDR (2019)

14.

Kraska, T., Beutel, A., Chi, E.H., Dean, J., Polyzotis, N.: The case for learned index structures. In: SIGMOD, pp. 489–504 (2018)

15.

Leo, D.D., Boncz, P.A.: Packed memory arrays - rewired. In: ICDE, pp. 830–841 (2019)

16.

Li, J., Cai, T., Ajmal, M., Li, R., Timos, S., Yu, J.: Holistic influence maximization for targeted advertisements in spatial social. In: ICDE, pp. 1340–1343 (2018)

17.

Li, P., Lu, H., Zheng, Q., Yang, L., Pan, G.: LISA: A learned index structure for spatial data. In: SIGMOD, pp. 2119–2133 (2020)

18.

Li, G., Zhou, X., Cao, L.: AI meets database: AI4DB and DB4AI. In: SIGMOD, pp. 2859–2866 (2021)

19.

Li, Z., Wang, X., Li, J., Chen, Y., Zhang, Q.: Deep attributed network representation learning of complex coupling interaction. Knowl. Based Syst 212, 106618 (2021)CrossRef

20.

Nathan, V., Ding, J., Alizadeh, M., Kraska, T.: Learning multi-dimensional indexes. In: SIGMOD, pp. 985–1000 (2020)

21.

Pandey, V., Renen, A.V., Kipf, A., Ding, J., Sabek, I., Kemper, A.: The case for learned spatial indexes. In: VLDB, pp. 2119–2133 (2020)

22.

Patrick, E., Cheng, E., Gawlick, D., Elizabeth, J.: The log-structured merge-tree (lsm-tree). Acta Informatica 33(4), 351–385 (1996)CrossRefMATH

23.

Schwarz, J., Czarnecki, W., Luketina, J., Teh, Y.W., Pascanu, R., Hadsell, R.: Progress & Compress: A scalable framework for continual learning. In: ICML, pp. 4535–4544 (2018)

24.

Wang, H., Fu, X., Xu, J., Lu, H.: Learned index for spatial queries. In: MDM, pp. 569–574 (2019)

25.

Wu, J., Zhang, Y., Chen, S., Chen, Y., Wang, J., Xing, C.: Updatable learned index with precise positions. VLDB 14(8), 1276–1288 (2021)

26.

Xue, G., Zhong, M., Li, J., Chen, J., Zhai, C., Kong, R.: Dynamic network embedding survey. arxiv:2103.15447 (2021)

27.

Yang, Y., Guan, Z., Li, J., Zhao, W., Cui, J., Wang, Q.: Interpretable and efficient heterogeneous graph convolutional network. In: TKDE, https://doi.org/10.1109/TKDE.2021.3101356 (2021)

Title: LIFOSS: a learned index scheme for streaming scenarios
Authors: Tong Yu
Guanfeng Liu
An Liu
Zhixu Li
Lei Zhao
Publication date: 21-02-2022
Publisher: Springer US
Published in: World Wide Web / Issue 1/2023
Print ISSN: 1386-145X
Electronic ISSN: 1573-1413
DOI: https://doi.org/10.1007/s11280-022-01021-6

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Other articles of this Issue 1/2023

Auxiliary signal-guided knowledge encoder-decoder for medical report generation

Special Issue on Decision Making in Heterogeneous Network Data Scenarios and Applications

PreKar: A learned performance predictor for knowledge graph stores

Attention-based hierarchical denoised deep clustering network

Glider: rethinking congestion control with deep reinforcement learning

Multi-center federated learning: clients clustering for better personalization

Premium Partner