ABSTRACT
In this paper, we introduce U-MAP, a new system for schema mapping generation. U-MAP builds upon and extends existing schema mapping techniques. However, it mitigates some key problems in this area, which have not been previously addressed. The key tenet of U-MAP is to exploit the usage information extracted from the query logs associated with the schemas being mapped. We describe our experience in applying our proposed system to realistic datasets from the retail and life sciences domains. Our results demonstrate the effectiveness and efficiency of U-MAP compared to traditional approaches.
- B. Alexe, L. Chiticariu, R. J. Miller, and W. C. Tan. Muse: Mapping understanding and design by example. In ICDE, 2008. Google ScholarDigital Library
- B. Alexe, W. C. Tan, and Y. Velegrakis. Stbenchmark: towards a benchmark for mapping systems. PVLDB, 1(1):230--244, 2008. Google ScholarDigital Library
- Y. An, A. Borgida, R. J. Miller, and J. Mylopoulos. A semantic approach to discovering schema mapping expressions. In ICDE, 2007.Google ScholarCross Ref
- I. Baxter et al. PIIMS: An Integrated Functional Genomics Platform. Plant Physiology, 143:600--611, 2007.Google Scholar
- P. Bernstein, A. Halevy, and R. Pottinger. A vision for management of complex models. SIGMOD Record, 29(4):55--63, 2000. Google ScholarDigital Library
- P. Bohannon, E. Elnahrawy, W. Fan, and M. Flaster. Putting context into schema matching. In VLDB, 2006. Google ScholarDigital Library
- A. Bonifati et al. Heptox: Marrying xml and hetergeneity in your p2p databases. In VLDB, 2005. Google ScholarDigital Library
- L. Cabibbo. On keys, foreign keys and nullable attributes in relational mapping systems. In EDBT, pages 263--274, 2009. Google ScholarDigital Library
- R. Dhamankar et al. imap: Discovering complex mappings between database schemas. In SIGMOD, 2004. Google ScholarDigital Library
- H. Elmeleegy, M. Ouzzani, and A. Elmagarmid. Usage-based schema matching. In ICDE, 2008. Google ScholarDigital Library
- R. Fagin et al. Clio: Schema mapping creation and data exchange. Conceptual Modeling: Foundations and Applications, pages 198--236, 2009. Google ScholarDigital Library
- R. Fagin, P. G. Kolaitis, and L. Popa. Data exchange: Getting to the core. ACM TODS, 30(1):174--210, 2005. Google ScholarDigital Library
- A. Fuxman, M. A. Hernandez, H. Ho, R. J. Miller, P. Papotti, and L. Popa. Netsed mappings: Schema mapping reloaded. In VLDB, 2006. Google ScholarDigital Library
- G. Gottlob, R. Pichler, and V. Savenkov. Normalization and optimization of schema mappings. PVLDB, 2(1):1102--1113, 2009. Google ScholarDigital Library
- A. Halevy, Z. G. Ives, J. Madhavan, P. Mork, D. Suciu, and I. Tatarinov. The piazza peer data management system. TKDE, 16(7):787--798, 2004. Google ScholarDigital Library
- Z. Kedad and M. Bouzeghoub. Discovering view expressions from a multi-source information system. In CoopIS, 1999. Google ScholarDigital Library
- D. Maier and A. Mendelzon. Testing implications of data dependencies. ACM TODS, 4:455--469, 1979. Google ScholarDigital Library
- B. Marnette, G. Mecca, and P. Papotti. Scalable data exchange with functional dependencies. PVLDB, 3(1):105--116, 2010. Google ScholarDigital Library
- G. Mecca, P. Papotti, and S. Raunich. Core schema mappings. In SIGMOD, 2009. Google ScholarDigital Library
- G. Mecca, P. Papotti, S. Raunich, and M. Buoncristiano. Concise and expressive mappings with spicy. PVLDB, 2(2):1582--1585, 2009. Google ScholarDigital Library
- R. J. Miller, L. M. Haas, and M. A. Hernández. Schema mapping as query discovery. In VLDB, 2000. Google ScholarDigital Library
- T. Milo and S. Zohar. Using schema matching to simplify heterogeneous data translation. In VLDB, 1998. Google ScholarDigital Library
- L. Popa, Y. Velegrakis, R. J. Miller, M. A. Hernandez, and R. Fagin. Translating web data. In VLDB, 2002. Google ScholarDigital Library
- E. Rahm and P. A. Bernstein. A survey of approaches to automatic schema matching. VLDB J., 10(4):334--350, 2001. Google ScholarDigital Library
- M. Stonebraker. Implementation of integrity constraints and views by query modification. In SIGMOD, 1975. Google ScholarDigital Library
- B. ten Cate et al. Laconic schema mappings: Computing the core with sql queries. PVLDB, 2(1):1006--1017, 2009. Google ScholarDigital Library
- The TPC-W benchmark. http://tpc.org/tpcw.Google Scholar
- http://www.ece.wisc.edu/ pharm/tpcw.shtml.Google Scholar
- C. Yu and L. Popa. Constraint-based xml rewriting for data integration. In SIGMOD, 2004. Google ScholarDigital Library
- C. Yu and L. Popa. Semantic adaptation of schema mappings when schemas evolve. In VLDB, 2005. Google ScholarDigital Library
Index Terms
- Leveraging query logs for schema mapping generation in U-MAP
Recommendations
Structural characterizations of schema-mapping languages
ICDT '09: Proceedings of the 12th International Conference on Database TheorySchema mappings are declarative specifications that describe the relationship between two database schemas. In recent years, there has been an extensive study of schema mappings and of their applications to several different data inter-operability tasks,...
Towards a theory of schema-mapping optimization
PODS '08: Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systemsA schema mapping is a high-level specification that describes the relationship between two database schemas. As schema mappings constitute the essential building blocks of data exchange and data integration, an extensive investigation of the foundations ...
Quasi-inverses of schema mappings
Schema mappings are high-level specifications that describe the relationship between two database schemas. Two operators on schema mappings, namely the composition operator and the inverse operator, are regarded as especially important. Progress on the ...
Comments