Abstract
Supporting legacy applications when the database schema evolves represents a long-standing challenge of practical and theoretical importance. Recent work has produced algorithms and systems that automate the process of data migration and query adaptation; however, the problems of evolving integrity constraints and supporting legacy updates under schema and integrity constraints evolution are significantly more difficult and have thus far remained unsolved. In this paper, we address this issue by introducing a formal evolution model for the database schema structure and its integrity constraints, and use it to derive update mapping techniques akin to the rewriting techniques used for queries. Thus, we (i) propose a new set of Integrity Constraints Modification Operators (ICMOs), (ii) characterize the impact on integrity constraints of structural schema changes, (iii) devise representations that enable the rewriting of updates, and (iv) develop a unified approach for query and update rewriting under constraints. We then describe the implementation of these techniques provided by our PRISM++ system. The effectiveness of PRISM++ and its enabling technology has been verified on a testbed containing evolution histories of several scientific databases and web information systems, including the Genetic DB Ensembl (410+ schema versions in 9 years), and Wikipedia (240+ schema versions in 6 years).
- S. Abiteboul and O. M. Duschka. Complexity of answering queries using materialized views. In PODS, pages 254--263, 1998. Google Scholar
- S. Abiteboul, R. Hull, and V. Vianu. Foundations of databases. Addison Wesley, 1995. Google Scholar
- F. N. Afrati and P. G. Kolaitis. Repair checking in inconsistent databases: algorithms and complexity. In ICDT, pages 31--41, 2009. Google Scholar
- M. Arenas, L. Bertossi, and J. Chomicki. Consistent query answers in inconsistent databases. In PODS, pages 68--79, 1999. Google Scholar
- F. Bancilhon and N. Spyratos. Update semantics of relational views. ACM Trans. Database Syst., 6(4):557--575, 1981. Google Scholar
- P. A. Bernstein. Applying model management to classical meta data problems. In CIDR, 2003.Google Scholar
- P. A. Bernstein, T. J. Green, S. Melnik, and A. Nash. Implementing mapping composition. VLDB J., 17(2):333--353, 2008. Google Scholar
- A. Bohannon, B. C. Pierce, and J. A. Vaughan. Relational lenses: a language for updatable views. In PODS, pages 338--347, 2006. Google Scholar
- A. Cleve and J.-L. Hainaut. Co-transformations in database applications evolution. In GTTSE, pages 409--421, 2006. Google Scholar
- C. Curino, M. Ham, F. Moroni, and C. Zaniolo. Pantha rei data set: http://data.schemaevolution.org/. 2009.Google Scholar
- C. Curino, H. J. Moon, and C. Zaniolo. Graceful database schema evolution: the prism workbench. PVLDB, 1(1):761--772, 2008. Google Scholar
- U. Dayal and P. A. Bernstein. On the correct translation of update operations on relational views. ACM Trans. Database Syst., 7(3):381--416, 1982. Google Scholar
- A. Deutsch, A. Nash, and J. Remmel. The chase revisited. In PODS, pages 149--158, 2008. Google Scholar
- A. Deutsch and V. Tannen. Mars: A system for publishing xml from mixed and redundant storage. In VLDB, pages 201--212, 2003. Google Scholar
- Ensembl development team. Ensembl Genetic DB http://www.ensembl.org, 2009. {Online}.Google Scholar
- R. Fagin, P. G. Kolaitis, L. Popa, and W.-C. Tan. Composing schema mappings: Second-order dependencies to the rescue. ACM Trans. Database Syst., 30(4):994--1055, 2005. Google Scholar
- R. Fagin, P. G. Kolaitis, L. Popa, and W.-C. Tan. Quasi-inverses of schema mappings. In PODS, pages 123--132, 2007. Google Scholar
- R. Fagin, P. G. Kolaitis, L. Popa, and W. C. Tan. Reverse data exchange: coping with nulls. In PODS, pages 23--32, 2009. Google Scholar
- M. A. Hernández, R. J. Miller, and L. M. Haas. Clio: A semi-automatic tool for schema mapping. In SIGMOD, page 607, 2001. Google Scholar
- J.-M. Hick and J.-L. Hainaut. Database application evolution: a transformational approach. Data Knowl. Eng., 59(3):534--558, 2006. Google Scholar
- R. Hull. Non-finite specifiability of projections of functional dependency families. Theor. Comput. Sci., 39:239--265, 1985.Google Scholar
- Y. Kotidis, D. Srivastava, and Y. Velegrakis. Updates through views: A new hope. In ICDE, page 2, 2006. Google Scholar
- M. Lenzerini. Data integration: A theoretical perspective. In PODS, pages 233--246, 2002. Google Scholar
- Y. Liu, S. ren Zhang, and M. qi Fang. Ecological analysis on evolution of information systems. In I3E (2), pages 308--315, 2007.Google Scholar
- J. Madhavan and A. Y. Halevy. Composing mappings among data sources. In VLDB, pages 572--583, 2003. Google Scholar
- S. Melnik, E. Rahm, and P. A. Bernstein. Rondo: A programming platform for generic model management. In SIGMOD, pages 193--204, 2003. Google Scholar
- R. J. Miller, Y. E. Ioannidis, and R. Ramakrishnan. The use of information capacity in schema integration and translation. In VLDB, pages 120--133, 1993. Google Scholar
- R. J. Miller, Y. E. Ioannidis, and R. Ramakrishnan. Schema equivalence in heterogeneous systems: bridging theory and practice. Inf. Syst., 19(1):3--31, 1994. Google Scholar
- H. J. Moon, C. Curino, A. Deutsch, C.-Y. Hou, and C. Zaniolo. Managing and querying transaction-time databases under schema evolution. PVLDB, 1(1):882--895, 2008. Google Scholar
- Y.-G. Ra. Relational schema evolution for program independency. Intelligent Information Technology, pages 273--281, 2005. Google Scholar
- J. D. Ullman. Information integration using logical views. Theor. Comput. Sci., 239(2):189--210, 2000. Google Scholar
- Y. Velegrakis, R. J. Miller, and L. Popa. Mapping adaptation under evolving schemas. In VLDB, pages 584--595, 2003. Google Scholar
- C. Yu and L. Popa. Semantic adaptation of schema mappings when schemas evolve. In VLDB, pages 1006--1017, 2005. Google Scholar
Index Terms
- Update rewriting and integrity constraint maintenance in a schema evolution support system: PRISM++
Recommendations
Constraint Preserving Transformation from Relational Schema to XML Schema
XML has become the standard for publishing and exchanging data on the Web. However, most business data is managed and will remain to be managed by relational database management systems. As such, there is an increasing need to efficiently and accurately ...
Data schema evolution support in XML-relational database systems
Many XML-relational systems, i.e., the systems that use an XML schema as an external schema and a relational schema as an internal schema of the data application representation level, require modifications of the data schemas in the course of time. ...
A Conceptual Schema Based XML Schema with Integrity Constraints Checking
ICHIT '08: Proceedings of the 2008 International Conference on Convergence and Hybrid Information TechnologyThe more popular XML for exchanging and representing information on Web, the more important Flat XML (XML) and intelligent editors become. For data exchanging, an XML Data with an XML Schema and integrity constraints are preferred. We employ an Object-...
Comments