skip to main content
10.1145/1353343.1353357acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article
Free Access

Schema merging and mapping creation for relational sources

Published:25 March 2008Publication History

ABSTRACT

We address the problem of generating a mediated schema from a set of relational data source schemas and conjunctive queries that specify where those schemas overlap. Unlike past approaches that generate only the mediated schema, our algorithm also generates view definitions, i.e., source-to-mediated schema mappings.

Our main goal is to understand the requirements that a mediated schema and views should satisfy, such as completeness, preservation of overlapping information, normalization, and minimality. We show how these requirements influence the detailed structure of schemas and view definitions that are produced. We introduce a normal form for mediated schemas and view definitions, show how to generate them, and prove that schemas and views in this normal form satisfy our requirements.

The view definitions in our normal form use stylized GLAV mappings, for which query rewriting is easier than general GLAV mappings. We demonstrate the efficiency of query rewriting in a prototype implementation.

References

  1. C. Batini, M. Lenzerini, and S. B. Navathe, "A Comparative Analysis of Methodologies for Database Schema Integration," ACM Computing Surveys 18(4), pp. 323--364, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. P. A. Bernstein, "Applying Model Management to Classical Meta Data Problems," CIDR 2003, pp. 209--220.Google ScholarGoogle Scholar
  3. J. Biskup and B. Convent, "A formal view integration method," SIGMOD 1986, pp. 398--407. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. P. Buneman, S. B. Davidson, and A. Kosky, "Theoretical Aspects of Schema Merging," EDBT 1992, pp. 152--167. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. Cali, D. Calvanese, G. D. Giacomo, and M. Lenzerini, "On the Expressive Power of Data Integration Systems," ER 2002, pp. 338--350. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. A. Cassanova and V. M. P. Vidal, "Towards a Sound View Integration Methodology," PODS 1983, pp. 36--47. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Gubanov, P. A. Bernstein, M. Moshchuk, "Model Management Engine for Data Integration with Reverse-Engineering Support," ICDE 2008, to appear. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Y. Halevy, "Answering Queries Using Views: A Survey," VLDB J. 10(4), pp. 270--294, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. R. Hull, "Relative Information Capacity of Simple Relational Database Schemata," SIAM J. Comput. 15(3): 856--886, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Illinois Semantic Integration Archive. http://pages.cs.wisc.edu/~anhai/wisc-si-archive/.Google ScholarGoogle Scholar
  11. M. Lenzerini, "Data Integration: A Theoretical Perspective," PODS 2002, pp. 233--246. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Y. Levy, A. O. Mendelzon, Y. Sagiv, and D. Srivastava, "Answering Queries Using Views," PODS 1995, pp. 95--104. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Y. Levy, A. Rajaraman, and J. J. Ordille, "Querying Heterogeneous Information Sources Using Source Descriptions," VLDB 1996, pp. 251--262. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. P. McBrien, A. Poulovassilis, "Data Integration by Bi-Directional Schema Transformation Rules," ICDE 2003, 227--238.Google ScholarGoogle Scholar
  15. S. Melnik, P. A. Bernstein, A. Halevy, and E. Rahm, "Supporting Executable Mappings in Model Management," SIGMOD 2005, pp. 167--178. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Melnik, E. Rahm, and P. A. Bernstein, "Rondo: A Programming Platform for Generic Model Management," SIGMOD 2003, pp. 193--204. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. R. J. Miller, Y. E. Ioannidis, and R. Ramakrishnan, "The Use of Information Capacity in Schema Integration and Translation," VLDB 1993, pp. 120--133. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. R. Pottinger, Processing Queries and Merging Schemas in Support of Data Integration, PhD thesis, Univ. of Washington, 2004, http://www.cs.ubc.ca/~rap/publications/thesis.pdf Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. R. Pottinger and A. Levy, "A Scalable Algorithm for Answering Queries Using Views," VLDB 2000, pp. 484--495. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. R. A. Pottinger and P. A. Bernstein, "Merging Models Based on Given Correspondences," VLDB 2003, pp. 862--873. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. A. Pottinger and A. Y. Halevy, "MiniCon: A scalable algorithm for answering queries using views," VLDB J.10 (2--3), pp. 182--198, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. S. Spaccapietra and C. Parent, "View Integration: A Step Forward in Solving Structural Conflicts," TKDE 6(2), pp. 258--274, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Schema merging and mapping creation for relational sources

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            EDBT '08: Proceedings of the 11th international conference on Extending database technology: Advances in database technology
            March 2008
            762 pages
            ISBN:9781595939265
            DOI:10.1145/1353343

            Copyright © 2008 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 25 March 2008

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate7of10submissions,70%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader