Skip to main content
Top
Published in: The VLDB Journal 4/2018

12-06-2018 | Regular Paper

Multi-schema-version data management: data independence in the twenty-first century

Authors: Kai Herrmann, Hannes Voigt, Torben Bach Pedersen, Wolfgang Lehner

Published in: The VLDB Journal | Issue 4/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Agile software development allows us to continuously evolve and run a software system. However, this is not possible in databases, as established methods are very expensive, error-prone, and far from agile. We present InVerDa, a multi-schema-version database management system (MSVDB) for agile database development. MSVDBs realize co-existing schema versions within one database, where each schema version behaves like a regular single-schema database and write operations are propagated between schema versions. Developers use a relationally complete and bidirectional database evolution language (BiDEL) to easily evolve existing schema versions to new ones. BiDEL scripts are more robust, orders of magnitude shorter, and cause only a small performance overhead compared to handwritten SQL scripts. We formally guarantee data independence: no matter how the data of the co-existing schema versions is physically materialized, each schema version is guaranteed to behave like a regular database. Since, the chosen physical materialization significantly determines the overall performance, we equip database administrators with an advisor that proposes an optimized materialization for the current workload, which can improve the performance by orders of magnitude compared to naïve solutions. To our best knowledge, we are the first to facilitate agile evolution of production databases with full support of co-existing schema versions and formally guaranteed data independence.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Online demo available at www.​inverda.​de.
 
Literature
1.
go back to reference Agrawal, S., Chaudhuri, S., Narasayya, V.R.: Automated selection of materialized views and indexes in SQL databases. In: Proceedings of the 26th International Conference on Very Large Data Bases, 10–14 September 2000, pp. 496–505. Morgan Kaufmann Publishers Inc, San Francisco (2000) Agrawal, S., Chaudhuri, S., Narasayya, V.R.: Automated selection of materialized views and indexes in SQL databases. In: Proceedings of the 26th International Conference on Very Large Data Bases, 10–14 September 2000, pp. 496–505. Morgan Kaufmann Publishers Inc, San Francisco (2000)
2.
go back to reference Agrawal, S., Chaudhuri, S., Narasayya, V.: Materialized view and index selection tool for Microsoft SQL server 2000. ACM SIGMOD Rec. 30(2), 608 (2001)CrossRef Agrawal, S., Chaudhuri, S., Narasayya, V.: Materialized view and index selection tool for Microsoft SQL server 2000. ACM SIGMOD Rec. 30(2), 608 (2001)CrossRef
3.
go back to reference Arora, M., Gosain, A.: Schema evolution for data warehouse: a survey. IJCA 22(5), 6–14 (2011)CrossRef Arora, M., Gosain, A.: Schema evolution for data warehouse: a survey. IJCA 22(5), 6–14 (2011)CrossRef
4.
go back to reference Bentley, P., Corne, D.: Creative Evolutionary Systems. Morgan Kaufmann, San Francisco (2002) Bentley, P., Corne, D.: Creative Evolutionary Systems. Morgan Kaufmann, San Francisco (2002)
6.
go back to reference Brodie, M.L., Liu, J.T.: The power and limits of relational technology in the age of information ecosystems. In: On the Move to Meaningful Internet Systems (OTM), Lecture Notes in Computer Science. Springer, Berlin (2010) Brodie, M.L., Liu, J.T.: The power and limits of relational technology in the age of information ecosystems. In: On the Move to Meaningful Internet Systems (OTM), Lecture Notes in Computer Science. Springer, Berlin (2010)
7.
go back to reference Curino, C., Moon, H.J., Deutsch, A., Zaniolo, C.: Automating the database schema evolution process. VLDB J. 22(1), 73–98 (2013)CrossRef Curino, C., Moon, H.J., Deutsch, A., Zaniolo, C.: Automating the database schema evolution process. VLDB J. 22(1), 73–98 (2013)CrossRef
8.
go back to reference Curino, C., Moon, H.J., Zaniolo, C.: Graceful database schema evolution: the PRISM workbench. PVLDB 1(1), 761–772 (2008) Curino, C., Moon, H.J., Zaniolo, C.: Graceful database schema evolution: the PRISM workbench. PVLDB 1(1), 761–772 (2008)
9.
go back to reference Curino, C., Moon, H.J., Tanca, L., Zaniolo, C.: Schema evolution in wikipedia-Toward a web information system benchmark. In: Proceedings of the Tenth International Conference on Enterprise Information Systems (ICEIS), Volume DISI, Barcelona, Spain, 12–16 June 2008, pp. 323–332 (2008) Curino, C., Moon, H.J., Tanca, L., Zaniolo, C.: Schema evolution in wikipedia-Toward a web information system benchmark. In: Proceedings of the Tenth International Conference on Enterprise Information Systems (ICEIS), Volume DISI, Barcelona, Spain, 12–16 June 2008, pp. 323–332 (2008)
10.
go back to reference Domínguez, E., Lloret, J., Rubio, Á.L., Zapata, M.A.: MeDEA: a database evolution architecture with traceability. DKE 65(3), 419–441 (2008)CrossRef Domínguez, E., Lloret, J., Rubio, Á.L., Zapata, M.A.: MeDEA: a database evolution architecture with traceability. DKE 65(3), 419–441 (2008)CrossRef
11.
go back to reference Floudas, C.A., Pardalos, P.M., Adjiman, C., Esposito, W.R., Gümüs, Z.H., Harding, S.T., Klepeis, J.L., Meyer, C.A., Schweiger, C.A.: Handbook of Test Problems in Local and Global Optimization. Nonconvex Optimization and Its Applications. Springer, New York (2013)MATH Floudas, C.A., Pardalos, P.M., Adjiman, C., Esposito, W.R., Gümüs, Z.H., Harding, S.T., Klepeis, J.L., Meyer, C.A., Schweiger, C.A.: Handbook of Test Problems in Local and Global Optimization. Nonconvex Optimization and Its Applications. Springer, New York (2013)MATH
12.
go back to reference Gupta, A., Mumick, I.S.: Materialized Views: Techniques, Implementations, and Applications. MIT Press, Cambridge (1999) Gupta, A., Mumick, I.S.: Materialized Views: Techniques, Implementations, and Applications. MIT Press, Cambridge (1999)
14.
go back to reference Herrmann, K., Voigt, Hv , Behrend, A., Lehner, W.: CoDEL—a relationally complete language for database evolution. In: ADBIS (2015) Herrmann, K., Voigt, Hv , Behrend, A., Lehner, W.: CoDEL—a relationally complete language for database evolution. In: ADBIS (2015)
15.
go back to reference Herrmann, K., Voigt, H., Behrend, A., Rausch, J., Lehner,W.: Living in parallel realities: Co-existing schema versions with a bidirectional database evolution language. In: Proceedings of the 2017 ACM International Conference on Management of Data (SIGMOD), Chicago, 14–19 May 2017, pp. 1101–1116 (2017). https://doi.org/10.1145/3035918.3064046 Herrmann, K., Voigt, H., Behrend, A., Rausch, J., Lehner,W.: Living in parallel realities: Co-existing schema versions with a bidirectional database evolution language. In: Proceedings of the 2017 ACM International Conference on Management of Data (SIGMOD), Chicago, 14–19 May 2017, pp. 1101–1116 (2017). https://​doi.​org/​10.​1145/​3035918.​3064046
18.
go back to reference McKinna, J.: Complements witness consistency. In: Bx (2016) McKinna, J.: Complements witness consistency. In: Bx (2016)
19.
go back to reference Mistry, H., Roy, P., Sudarshan, S., Ramamritham, K.: Materialized view selection and maintenance using multi-query optimization. In: Proceedings of the 2001 international conference on Management of data (SIGMOD), Santa Barbara, CA, USA, 21–24 May 2001, pp. 307–318 (2001). https://doi.org/10.1145/375663.375703 Mistry, H., Roy, P., Sudarshan, S., Ramamritham, K.: Materialized view selection and maintenance using multi-query optimization. In: Proceedings of the 2001 international conference on Management of data (SIGMOD), Santa Barbara, CA, USA, 21–24 May 2001, pp. 307–318 (2001). https://​doi.​org/​10.​1145/​375663.​375703
20.
go back to reference Moon, H.J., Curino, C., Ham, M., Zaniolo, C.: PRIMA—archiving and querying historical data with evolving schemas. In: SIGMOD (2009) Moon, H.J., Curino, C., Ham, M., Zaniolo, C.: PRIMA—archiving and querying historical data with evolving schemas. In: SIGMOD (2009)
21.
go back to reference Rahm, E., Bernstein, P.A.: An online bibliography on schema evolution. ACM SIGMOD Rec. 35(4), 30–31 (2006)CrossRef Rahm, E., Bernstein, P.A.: An online bibliography on schema evolution. ACM SIGMOD Rec. 35(4), 30–31 (2006)CrossRef
22.
go back to reference Roddick, J.F.: SQL/SE—a query language extension for databases supporting schema evolution. ACM SIGMOD Rec. 21(2), 10–16 (1992)CrossRef Roddick, J.F.: SQL/SE—a query language extension for databases supporting schema evolution. ACM SIGMOD Rec. 21(2), 10–16 (1992)CrossRef
23.
go back to reference Roddick, J.F.: A survey of schema versioning issues for database systems. Inf. Softw. Technol. 37(7), 383–393 (1995)CrossRef Roddick, J.F.: A survey of schema versioning issues for database systems. Inf. Softw. Technol. 37(7), 383–393 (1995)CrossRef
24.
go back to reference Skoulis, I., Vassiliadis, P., Zarras, A.V.: Open-Source databases: within, outside, or beyond Lehman’s laws of software evolution?. In: Advanced Information Systems Engineering - 26th International Conference, CAiSE 2014, Thessaloniki, Greece, 16–20 June 2014, pp. 379–393 (2014). https://doi.org/10.1007/978-3-319-07881-6_26 Skoulis, I., Vassiliadis, P., Zarras, A.V.: Open-Source databases: within, outside, or beyond Lehman’s laws of software evolution?. In: Advanced Information Systems Engineering - 26th International Conference, CAiSE 2014, Thessaloniki, Greece, 16–20 June 2014, pp. 379–393 (2014). https://​doi.​org/​10.​1007/​978-3-319-07881-6_​26
25.
go back to reference Terwilliger, J.F., Cleve, A., Curino, C.: How clean is your sandbox? - Towards a unified theoretical framework for incremental bidirectional transformations In: Theory and Practice of Model Transformations -5th International Conference, ICMT 2012, Prague, Czech Republic, 28–29 May 2012, pp. 1–23 (2012). https://doi.org/10.1007/978-3-642-30476-7_1 Terwilliger, J.F., Cleve, A., Curino, C.: How clean is your sandbox? - Towards a unified theoretical framework for incremental bidirectional transformations In: Theory and Practice of Model Transformations -5th International Conference, ICMT 2012, Prague, Czech Republic, 28–29 May 2012, pp. 1–23 (2012). https://​doi.​org/​10.​1007/​978-3-642-30476-7_​1
26.
go back to reference Wall, B., Angryk, R.: Minimal data sets vs. synchronized data copies in a schema and data versioning system. In: Proceedings of the 4th workshop on Workshop for Ph.D. students in information and knowledge management (IPKM), Glasgow, United Kingdom, 24–28 October 2011, pp. 67–74 (2011). https://doi.org/10.1145/2065003.2065017 Wall, B., Angryk, R.: Minimal data sets vs. synchronized data copies in a schema and data versioning system. In: Proceedings of the 4th workshop on Workshop for Ph.D. students in information and knowledge management (IPKM), Glasgow, United Kingdom, 24–28 October 2011, pp. 67–74 (2011). https://​doi.​org/​10.​1145/​2065003.​2065017
27.
go back to reference Wentao, W., Chi, Y., Hacígümüş, H., Naughton, J.F.: Towards predicting query execution time for concurrent and dynamic database workloads. PVLDB 6(10), 925–936 (2013) Wentao, W., Chi, Y., Hacígümüş, H., Naughton, J.F.: Towards predicting query execution time for concurrent and dynamic database workloads. PVLDB 6(10), 925–936 (2013)
Metadata
Title
Multi-schema-version data management: data independence in the twenty-first century
Authors
Kai Herrmann
Hannes Voigt
Torben Bach Pedersen
Wolfgang Lehner
Publication date
12-06-2018
Publisher
Springer Berlin Heidelberg
Published in
The VLDB Journal / Issue 4/2018
Print ISSN: 1066-8888
Electronic ISSN: 0949-877X
DOI
https://doi.org/10.1007/s00778-018-0508-7

Other articles of this Issue 4/2018

The VLDB Journal 4/2018 Go to the issue

Premium Partner