Skip to main content
Erschienen in: The VLDB Journal 4/2019

19.12.2018 | Regular Paper

Compact representations of temporal databases

verfasst von: Antoon Bronselaer, Christophe Billiet, Robin De Mol, Joachim Nielandt, Guy De Tré

Erschienen in: The VLDB Journal | Ausgabe 4/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper investigates the storage compactness of temporal merges. A temporal merge is a joint representation of a set of snapshots of the same database at different points in time. An axiomatic requirement for temporal merges is that each individual snapshot must be derivable from it. We first show that the usual temporal extension of a database is a special kind of temporal merge called a direct merge. For direct merges, finding the most compact representation is easy for separate relations. However, if snapshots feature foreign key dependencies, the representation may contain more tuples than strictly needed. We therefore study inclusion dependencies in temporal databases and show how to use them while maintaining minimality in our representation. Next, we argue that direct merges imply redundancy when they contain non-contiguous, value-equivalent tuples. We therefore introduce a more general kind of temporal merge known as a coherent merge and study its properties in depth. Deriving a minimal coherent merge from a direct one is shown to be NP-hard. However, several greedy algorithms are demonstrated to provide decent results in quadratic time. Next, an incremental algorithm for construction of coherent merges is presented. We provide experimental results on real-life data sets that support the usefulness of the contributions of this paper.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
By common convention, the concatenation of two sets of attributes signifies the union of those sets.
 
2
Coalescing is the operation of merging value-equivalent tuples with overlapping or meeting validity intervals.
 
3
GRID database: http://​grid.​ac.
 
Literatur
1.
Zurück zum Zitat Jensen, C., Snodgrass, R.: Temporal data management. IEEE Trans. Knowl. Data Eng. 11(1), 36–44 (1999)CrossRef Jensen, C., Snodgrass, R.: Temporal data management. IEEE Trans. Knowl. Data Eng. 11(1), 36–44 (1999)CrossRef
2.
Zurück zum Zitat Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling, 3rd edn. Wiley, New York (2013) Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling, 3rd edn. Wiley, New York (2013)
4.
Zurück zum Zitat Böhlen, M.: Temporal database system implementations. SIGMOD Rec. 24(4), 53–60 (1995)CrossRef Böhlen, M.: Temporal database system implementations. SIGMOD Rec. 24(4), 53–60 (1995)CrossRef
5.
Zurück zum Zitat Snodgrass, R., Ahn, I.: A taxonomy of time in databases. ACM SIGMOD Rec. 14(4), 236–246 (1985)CrossRef Snodgrass, R., Ahn, I.: A taxonomy of time in databases. ACM SIGMOD Rec. 14(4), 236–246 (1985)CrossRef
6.
Zurück zum Zitat Jensen, C., Snodgrass, R.: Semantics of time-varying information. Inf. Syst. 21(4), 311–352 (1996)CrossRef Jensen, C., Snodgrass, R.: Semantics of time-varying information. Inf. Syst. 21(4), 311–352 (1996)CrossRef
7.
Zurück zum Zitat Özsoyoglu, G., Snodgrass, R.: Temporal and real-time databases: a survey. IEEE Trans. Knowl. Data Eng. 7(4), 513–532 (1995)CrossRef Özsoyoglu, G., Snodgrass, R.: Temporal and real-time databases: a survey. IEEE Trans. Knowl. Data Eng. 7(4), 513–532 (1995)CrossRef
8.
Zurück zum Zitat Jensen, C., et al.: The Consensus Glossary of Temporal Database Concepts February 1998 Version, pp. 367–405. Springer, Berlin (1998) Jensen, C., et al.: The Consensus Glossary of Temporal Database Concepts February 1998 Version, pp. 367–405. Springer, Berlin (1998)
9.
Zurück zum Zitat Lorentzos, N., Johnson, R.: Extending relational algebra to manipulate temporal data. Inf. Syst. 13(3), 289–296 (1988)CrossRefMATH Lorentzos, N., Johnson, R.: Extending relational algebra to manipulate temporal data. Inf. Syst. 13(3), 289–296 (1988)CrossRefMATH
10.
Zurück zum Zitat Navathe, S., Ahmed, R.: A temporal relational model and a query language. Inf. Sci. 49, 147–175 (1989)CrossRefMATH Navathe, S., Ahmed, R.: A temporal relational model and a query language. Inf. Sci. 49, 147–175 (1989)CrossRefMATH
11.
Zurück zum Zitat Ahn, I., Snodgrass, R.: Partitioned storage for temporal databases. Inf. Syst. 13(4), 369–391 (1988)CrossRefMATH Ahn, I., Snodgrass, R.: Partitioned storage for temporal databases. Inf. Syst. 13(4), 369–391 (1988)CrossRefMATH
12.
Zurück zum Zitat Lum, V., Dadam, P., Erbe, R., Guenauer, J., Pistor, P., Walch, G., Werner, H., Woodfill, J.: Design of an Integrated DBMS to Support Advanced Applications, pp. 31–49. Springer, Berlin (1987) Lum, V., Dadam, P., Erbe, R., Guenauer, J., Pistor, P., Walch, G., Werner, H., Woodfill, J.: Design of an Integrated DBMS to Support Advanced Applications, pp. 31–49. Springer, Berlin (1987)
13.
Zurück zum Zitat Gunadhi, H., Segev, A.: A framework for query optimization in temporal databases. In: Proceedings of the 5th International Conference on Statistical and Scientific Database Management, pp. 131–147 (1988) Gunadhi, H., Segev, A.: A framework for query optimization in temporal databases. In: Proceedings of the 5th International Conference on Statistical and Scientific Database Management, pp. 131–147 (1988)
15.
Zurück zum Zitat Salzberg, B., Tsotras, V.: Comparison of access methods for time-evolving data. ACM Comput. Surv. 31(2), 185–221 (1999)CrossRef Salzberg, B., Tsotras, V.: Comparison of access methods for time-evolving data. ACM Comput. Surv. 31(2), 185–221 (1999)CrossRef
17.
Zurück zum Zitat Bebel, B., Krolikowski, Z., Wrembel, R.: Formal approach to modelling a multiversion data warehouse. Bull. Pol. Acad. Sci. Tech. Sci. 54(1), 51–62 (2006)MATH Bebel, B., Krolikowski, Z., Wrembel, R.: Formal approach to modelling a multiversion data warehouse. Bull. Pol. Acad. Sci. Tech. Sci. 54(1), 51–62 (2006)MATH
18.
19.
20.
Zurück zum Zitat Grandi, F., Mandreoli, F.: The valid web: an XML/XSL infrastructure for temporal management of web documents. In: Yakhno, T. (ed.) Lecture Notes in Computer Science 1909—Proceedings of the 1st International Conference on Advances in Information Systems, pp. 294–303. Springer, Berlin (2000) Grandi, F., Mandreoli, F.: The valid web: an XML/XSL infrastructure for temporal management of web documents. In: Yakhno, T. (ed.) Lecture Notes in Computer Science 1909—Proceedings of the 1st International Conference on Advances in Information Systems, pp. 294–303. Springer, Berlin (2000)
21.
Zurück zum Zitat Grandi, F., Mandreoli, F., Tiberio, P.: Temporal modelling and management of normative documents in XML format. Data Knowl. Eng. 54, 327–354 (2005)CrossRef Grandi, F., Mandreoli, F., Tiberio, P.: Temporal modelling and management of normative documents in XML format. Data Knowl. Eng. 54, 327–354 (2005)CrossRef
22.
Zurück zum Zitat Grandi, F., Mandreoli, F., Tiberio, P., Bergonzini, M.: A temporal data model and management system for normative texts in xml format. In: Proceedings of the 5th ACM International Workshop on Web Information and Data Management, pp. 29–36. ACM, New York (2003) Grandi, F., Mandreoli, F., Tiberio, P., Bergonzini, M.: A temporal data model and management system for normative texts in xml format. In: Proceedings of the 5th ACM International Workshop on Web Information and Data Management, pp. 29–36. ACM, New York (2003)
23.
Zurück zum Zitat Chien, S.Y., Tsotras, V.J., Zaniolo, C., Zhang, D.: Efficient complex query support for multiversion xml documents. In: Proceedings of the 8th International Conference on Extending Database Technology, pp. 161–178. Springer, Berlin (2002). https://doi.org/10.1007/3-540-45876-X_12 Chien, S.Y., Tsotras, V.J., Zaniolo, C., Zhang, D.: Efficient complex query support for multiversion xml documents. In: Proceedings of the 8th International Conference on Extending Database Technology, pp. 161–178. Springer, Berlin (2002). https://​doi.​org/​10.​1007/​3-540-45876-X_​12
24.
Zurück zum Zitat Wang, F.: XML-based support for database histories and document versions. Ph.D. Thesis, University of California (2004) Wang, F.: XML-based support for database histories and document versions. Ph.D. Thesis, University of California (2004)
25.
Zurück zum Zitat Wang, F., Zaniolo, C.: Temporal queries in xml document archives and web warehouses. In: Proceedings of the 10th International Symposium on Temporal Representation and Reasoning and the Fourth International Conference on Temporal Logic. IEEE (2003). https://doi.org/10.1109/TIME.2003.1214879 Wang, F., Zaniolo, C.: Temporal queries in xml document archives and web warehouses. In: Proceedings of the 10th International Symposium on Temporal Representation and Reasoning and the Fourth International Conference on Temporal Logic. IEEE (2003). https://​doi.​org/​10.​1109/​TIME.​2003.​1214879
26.
Zurück zum Zitat Ali, K.A., Pokorny, J.: A Comparison of XML-Based Temporal Models, pp. 339–350. Springer, Berlin (2009) Ali, K.A., Pokorny, J.: A Comparison of XML-Based Temporal Models, pp. 339–350. Springer, Berlin (2009)
29.
Zurück zum Zitat Gutierrez, C., Hurtado, C., Vaisman, A.: Introducing time into RDF. IEEE Trans. Knowl. Data Eng. 19(2), 207–218 (2007)CrossRef Gutierrez, C., Hurtado, C., Vaisman, A.: Introducing time into RDF. IEEE Trans. Knowl. Data Eng. 19(2), 207–218 (2007)CrossRef
30.
Zurück zum Zitat Graube, M., Hensel, S., Urbas, L.: R43ples: revisions for triples—an approach for version control in the semantic web. In: LDQ@SEMANTICS, pp. 1–8. CEUR-WS.org (2014) Graube, M., Hensel, S., Urbas, L.: R43ples: revisions for triples—an approach for version control in the semantic web. In: LDQ@SEMANTICS, pp. 1–8. CEUR-WS.org (2014)
31.
Zurück zum Zitat Codd, E.F.: A relational model of data for large shared data banks. Commun. ACM 13(6), 377–387 (1970)CrossRefMATH Codd, E.F.: A relational model of data for large shared data banks. Commun. ACM 13(6), 377–387 (1970)CrossRefMATH
32.
Zurück zum Zitat Wijsen, J.: Temporal FDs on complex objects. ACM Trans. Database Syst. 24(1), 127–176 (1999)CrossRef Wijsen, J.: Temporal FDs on complex objects. ACM Trans. Database Syst. 24(1), 127–176 (1999)CrossRef
33.
Zurück zum Zitat Wijsen, J.: Temporal Dependencies, pp. 2960–2966. Springer, Berlin (2009) Wijsen, J.: Temporal Dependencies, pp. 2960–2966. Springer, Berlin (2009)
34.
Zurück zum Zitat Casanova, M., Fagin, R., Papadimitriou, C.: Inclusion dependencies and their interaction with functional dependencies. J. Comput. Syst. Sci. 28, 29–59 (1984)MathSciNetCrossRefMATH Casanova, M., Fagin, R., Papadimitriou, C.: Inclusion dependencies and their interaction with functional dependencies. J. Comput. Syst. Sci. 28, 29–59 (1984)MathSciNetCrossRefMATH
35.
Zurück zum Zitat Rescher, N., Manor, R.: On inference from inconsistent premises. Theory Decis. 1, 179–217 (1970)CrossRefMATH Rescher, N., Manor, R.: On inference from inconsistent premises. Theory Decis. 1, 179–217 (1970)CrossRefMATH
38.
Zurück zum Zitat Sakai, S., Togasaki, M., Yamazaki, K.: A note on greedy algorithms for the maximum weighted independent set problem. Discrete Appl. Math. 126(2), 313–322 (2003)MathSciNetCrossRefMATH Sakai, S., Togasaki, M., Yamazaki, K.: A note on greedy algorithms for the maximum weighted independent set problem. Discrete Appl. Math. 126(2), 313–322 (2003)MathSciNetCrossRefMATH
40.
Zurück zum Zitat Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD-96), pp. 226–231. AAAI Press, New York (1996) Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD-96), pp. 226–231. AAAI Press, New York (1996)
41.
Zurück zum Zitat Allen, J.F.: Maintaining knowledge about temporal intervals. Commun. ACM 26(11), 832–843 (1983)CrossRefMATH Allen, J.F.: Maintaining knowledge about temporal intervals. Commun. ACM 26(11), 832–843 (1983)CrossRefMATH
Metadaten
Titel
Compact representations of temporal databases
verfasst von
Antoon Bronselaer
Christophe Billiet
Robin De Mol
Joachim Nielandt
Guy De Tré
Publikationsdatum
19.12.2018
Verlag
Springer Berlin Heidelberg
Erschienen in
The VLDB Journal / Ausgabe 4/2019
Print ISSN: 1066-8888
Elektronische ISSN: 0949-877X
DOI
https://doi.org/10.1007/s00778-018-0535-4

Weitere Artikel der Ausgabe 4/2019

The VLDB Journal 4/2019 Zur Ausgabe

Premium Partner