ABSTRACT
We address the problem of generating a mediated schema from a set of relational data source schemas and conjunctive queries that specify where those schemas overlap. Unlike past approaches that generate only the mediated schema, our algorithm also generates view definitions, i.e., source-to-mediated schema mappings.
Our main goal is to understand the requirements that a mediated schema and views should satisfy, such as completeness, preservation of overlapping information, normalization, and minimality. We show how these requirements influence the detailed structure of schemas and view definitions that are produced. We introduce a normal form for mediated schemas and view definitions, show how to generate them, and prove that schemas and views in this normal form satisfy our requirements.
The view definitions in our normal form use stylized GLAV mappings, for which query rewriting is easier than general GLAV mappings. We demonstrate the efficiency of query rewriting in a prototype implementation.
- C. Batini, M. Lenzerini, and S. B. Navathe, "A Comparative Analysis of Methodologies for Database Schema Integration," ACM Computing Surveys 18(4), pp. 323--364, 1986. Google ScholarDigital Library
- P. A. Bernstein, "Applying Model Management to Classical Meta Data Problems," CIDR 2003, pp. 209--220.Google Scholar
- J. Biskup and B. Convent, "A formal view integration method," SIGMOD 1986, pp. 398--407. Google ScholarDigital Library
- P. Buneman, S. B. Davidson, and A. Kosky, "Theoretical Aspects of Schema Merging," EDBT 1992, pp. 152--167. Google ScholarDigital Library
- A. Cali, D. Calvanese, G. D. Giacomo, and M. Lenzerini, "On the Expressive Power of Data Integration Systems," ER 2002, pp. 338--350. Google ScholarDigital Library
- M. A. Cassanova and V. M. P. Vidal, "Towards a Sound View Integration Methodology," PODS 1983, pp. 36--47. Google ScholarDigital Library
- M. Gubanov, P. A. Bernstein, M. Moshchuk, "Model Management Engine for Data Integration with Reverse-Engineering Support," ICDE 2008, to appear. Google ScholarDigital Library
- A. Y. Halevy, "Answering Queries Using Views: A Survey," VLDB J. 10(4), pp. 270--294, 2001. Google ScholarDigital Library
- R. Hull, "Relative Information Capacity of Simple Relational Database Schemata," SIAM J. Comput. 15(3): 856--886, 1986. Google ScholarDigital Library
- Illinois Semantic Integration Archive. http://pages.cs.wisc.edu/~anhai/wisc-si-archive/.Google Scholar
- M. Lenzerini, "Data Integration: A Theoretical Perspective," PODS 2002, pp. 233--246. Google ScholarDigital Library
- A. Y. Levy, A. O. Mendelzon, Y. Sagiv, and D. Srivastava, "Answering Queries Using Views," PODS 1995, pp. 95--104. Google ScholarDigital Library
- A. Y. Levy, A. Rajaraman, and J. J. Ordille, "Querying Heterogeneous Information Sources Using Source Descriptions," VLDB 1996, pp. 251--262. Google ScholarDigital Library
- P. McBrien, A. Poulovassilis, "Data Integration by Bi-Directional Schema Transformation Rules," ICDE 2003, 227--238.Google Scholar
- S. Melnik, P. A. Bernstein, A. Halevy, and E. Rahm, "Supporting Executable Mappings in Model Management," SIGMOD 2005, pp. 167--178. Google ScholarDigital Library
- S. Melnik, E. Rahm, and P. A. Bernstein, "Rondo: A Programming Platform for Generic Model Management," SIGMOD 2003, pp. 193--204. Google ScholarDigital Library
- R. J. Miller, Y. E. Ioannidis, and R. Ramakrishnan, "The Use of Information Capacity in Schema Integration and Translation," VLDB 1993, pp. 120--133. Google ScholarDigital Library
- R. Pottinger, Processing Queries and Merging Schemas in Support of Data Integration, PhD thesis, Univ. of Washington, 2004, http://www.cs.ubc.ca/~rap/publications/thesis.pdf Google ScholarDigital Library
- R. Pottinger and A. Levy, "A Scalable Algorithm for Answering Queries Using Views," VLDB 2000, pp. 484--495. Google ScholarDigital Library
- R. A. Pottinger and P. A. Bernstein, "Merging Models Based on Given Correspondences," VLDB 2003, pp. 862--873. Google ScholarDigital Library
- R. A. Pottinger and A. Y. Halevy, "MiniCon: A scalable algorithm for answering queries using views," VLDB J.10 (2--3), pp. 182--198, 2001. Google ScholarDigital Library
- S. Spaccapietra and C. Parent, "View Integration: A Step Forward in Solving Structural Conflicts," TKDE 6(2), pp. 258--274, 1994. Google ScholarDigital Library
Index Terms
- Schema merging and mapping creation for relational sources
Recommendations
Automatic schema merging using mapping constraints among incomplete sources
CIKM '10: Proceedings of the 19th ACM international conference on Information and knowledge managementSchema merging is the process of consolidating multiple schemas into a unified view. The task becomes particularly challenging when the schemas are highly heterogeneous and autonomous. Classical data integration systems rely on a mediated schema created ...
Constraint Preserving Transformation from Relational Schema to XML Schema
XML has become the standard for publishing and exchanging data on the Web. However, most business data is managed and will remain to be managed by relational database management systems. As such, there is an increasing need to efficiently and accurately ...
Structural characterizations of schema-mapping languages
ICDT '09: Proceedings of the 12th International Conference on Database TheorySchema mappings are declarative specifications that describe the relationship between two database schemas. In recent years, there has been an extensive study of schema mappings and of their applications to several different data inter-operability tasks,...
Comments