skip to main content
10.1145/1989323.1989337acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Leveraging query logs for schema mapping generation in U-MAP

Published:12 June 2011Publication History

ABSTRACT

In this paper, we introduce U-MAP, a new system for schema mapping generation. U-MAP builds upon and extends existing schema mapping techniques. However, it mitigates some key problems in this area, which have not been previously addressed. The key tenet of U-MAP is to exploit the usage information extracted from the query logs associated with the schemas being mapped. We describe our experience in applying our proposed system to realistic datasets from the retail and life sciences domains. Our results demonstrate the effectiveness and efficiency of U-MAP compared to traditional approaches.

References

  1. B. Alexe, L. Chiticariu, R. J. Miller, and W. C. Tan. Muse: Mapping understanding and design by example. In ICDE, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. B. Alexe, W. C. Tan, and Y. Velegrakis. Stbenchmark: towards a benchmark for mapping systems. PVLDB, 1(1):230--244, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Y. An, A. Borgida, R. J. Miller, and J. Mylopoulos. A semantic approach to discovering schema mapping expressions. In ICDE, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  4. I. Baxter et al. PIIMS: An Integrated Functional Genomics Platform. Plant Physiology, 143:600--611, 2007.Google ScholarGoogle Scholar
  5. P. Bernstein, A. Halevy, and R. Pottinger. A vision for management of complex models. SIGMOD Record, 29(4):55--63, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. P. Bohannon, E. Elnahrawy, W. Fan, and M. Flaster. Putting context into schema matching. In VLDB, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Bonifati et al. Heptox: Marrying xml and hetergeneity in your p2p databases. In VLDB, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. L. Cabibbo. On keys, foreign keys and nullable attributes in relational mapping systems. In EDBT, pages 263--274, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. R. Dhamankar et al. imap: Discovering complex mappings between database schemas. In SIGMOD, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. H. Elmeleegy, M. Ouzzani, and A. Elmagarmid. Usage-based schema matching. In ICDE, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Fagin et al. Clio: Schema mapping creation and data exchange. Conceptual Modeling: Foundations and Applications, pages 198--236, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. R. Fagin, P. G. Kolaitis, and L. Popa. Data exchange: Getting to the core. ACM TODS, 30(1):174--210, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Fuxman, M. A. Hernandez, H. Ho, R. J. Miller, P. Papotti, and L. Popa. Netsed mappings: Schema mapping reloaded. In VLDB, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. G. Gottlob, R. Pichler, and V. Savenkov. Normalization and optimization of schema mappings. PVLDB, 2(1):1102--1113, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Halevy, Z. G. Ives, J. Madhavan, P. Mork, D. Suciu, and I. Tatarinov. The piazza peer data management system. TKDE, 16(7):787--798, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Z. Kedad and M. Bouzeghoub. Discovering view expressions from a multi-source information system. In CoopIS, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. D. Maier and A. Mendelzon. Testing implications of data dependencies. ACM TODS, 4:455--469, 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. B. Marnette, G. Mecca, and P. Papotti. Scalable data exchange with functional dependencies. PVLDB, 3(1):105--116, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. G. Mecca, P. Papotti, and S. Raunich. Core schema mappings. In SIGMOD, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. G. Mecca, P. Papotti, S. Raunich, and M. Buoncristiano. Concise and expressive mappings with spicy. PVLDB, 2(2):1582--1585, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. J. Miller, L. M. Haas, and M. A. Hernández. Schema mapping as query discovery. In VLDB, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. T. Milo and S. Zohar. Using schema matching to simplify heterogeneous data translation. In VLDB, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. L. Popa, Y. Velegrakis, R. J. Miller, M. A. Hernandez, and R. Fagin. Translating web data. In VLDB, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. E. Rahm and P. A. Bernstein. A survey of approaches to automatic schema matching. VLDB J., 10(4):334--350, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. M. Stonebraker. Implementation of integrity constraints and views by query modification. In SIGMOD, 1975. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. B. ten Cate et al. Laconic schema mappings: Computing the core with sql queries. PVLDB, 2(1):1006--1017, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. The TPC-W benchmark. http://tpc.org/tpcw.Google ScholarGoogle Scholar
  28. http://www.ece.wisc.edu/ pharm/tpcw.shtml.Google ScholarGoogle Scholar
  29. C. Yu and L. Popa. Constraint-based xml rewriting for data integration. In SIGMOD, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. C. Yu and L. Popa. Semantic adaptation of schema mappings when schemas evolve. In VLDB, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Leveraging query logs for schema mapping generation in U-MAP

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SIGMOD '11: Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
        June 2011
        1364 pages
        ISBN:9781450306614
        DOI:10.1145/1989323

        Copyright © 2011 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 12 June 2011

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate785of4,003submissions,20%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader