Skip to main content

2011 | OriginalPaper | Buchkapitel

5. Discovery and Correctness of Schema Mapping Transformations

verfasst von : Angela Bonifati, Giansalvatore Mecca, Paolo Papotti, Yannis Velegrakis

Erschienen in: Schema Matching and Mapping

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Schema mapping is becoming pervasive in all data transformation, exchange, and integration tasks. It brings to the surface the problem of differences and mismatches between heterogeneous formats and models, respectively, used in source and target databases to be mapped one to another. In this chapter, we start by describing the problem of schema mapping, its background, and technical implications. Then, we outline the early schema mapping systems, along with the new generation of schema mapping tools. Moving from the former to the latter entailed a dramatic change in the performance of mapping generation algorithms. Finally, we conclude the chapter by revisiting the query answering techniques allowed by the mappings, and by discussing useful applications and future and current developments of schema mapping tools.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
We refer to naive chase rather than to the standard chase used in Fagin et al. (2005a), since the naive chase is much simpler and rather straightforward to implement in SQL. Such chase is sometimes calles oblivious chase, e.g., in Marnette (2009).
 
2
While Skolem terms are usually nested, for the sake of simplicity here we only consider flat terms.
 
3
We do not distinguish here between Σ st and Σ t and consider Σ as a set of generic constraints.
 
Literatur
.
Zurück zum Zitat Abiteboul S, Duschka OM (1998) Complexity of answering queries using materialized views. In: PODS. ACM, NY, pp 254–263 Abiteboul S, Duschka OM (1998) Complexity of answering queries using materialized views. In: PODS. ACM, NY, pp 254–263
.
Zurück zum Zitat Abiteboul S, Cluet S, Milo T (1997) Correspondence and translation for heterogeneous data. In: ICDT, Delphi, Greece. Springer, London, pp 351–363 Abiteboul S, Cluet S, Milo T (1997) Correspondence and translation for heterogeneous data. In: ICDT, Delphi, Greece. Springer, London, pp 351–363
.
Zurück zum Zitat Abu-Hamdeh R, Cordy J, Martin T (1994) Schema translation using structural transformation. In: CASCON. IBM Press, pp 202–215 Abu-Hamdeh R, Cordy J, Martin T (1994) Schema translation using structural transformation. In: CASCON. IBM Press, pp 202–215
.
Zurück zum Zitat Amano S, Libkin L, Murlak F (2009) XML schema mappings. In: PODS. ACM, NY, pp 33–42 Amano S, Libkin L, Murlak F (2009) XML schema mappings. In: PODS. ACM, NY, pp 33–42
.
Zurück zum Zitat An Y, Borgida A, Miller R, Mylopoulos J (2007) In: Proceedings of the 23rd International Conference on Data Engineering, ICDE 2007, April 15–20, 2007, The Marmara Hotel, Istanbul, Turkey An Y, Borgida A, Miller R, Mylopoulos J (2007) In: Proceedings of the 23rd International Conference on Data Engineering, ICDE 2007, April 15–20, 2007, The Marmara Hotel, Istanbul, Turkey
.
Zurück zum Zitat Atzeni P, Torlone R (1995) Schema translation between heterogeneous data models in a lattice framework. In: Data semantics conference. Chapman & Hall, London, pp 345–364 Atzeni P, Torlone R (1995) Schema translation between heterogeneous data models in a lattice framework. In: Data semantics conference. Chapman & Hall, London, pp 345–364
.
Zurück zum Zitat Atzeni P, Torlone R (1997) MDM: A multiple-data model tool for the management of heterogeneous database schemes. In: SIGMOD. ACM, NY, pp 528–531 Atzeni P, Torlone R (1997) MDM: A multiple-data model tool for the management of heterogeneous database schemes. In: SIGMOD. ACM, NY, pp 528–531
.
Zurück zum Zitat Beeri C, Milo T (1999) Schemas for intergration and translation of structured and semi-structured data. In: ICDT. Springer, London, pp 296–313 Beeri C, Milo T (1999) Schemas for intergration and translation of structured and semi-structured data. In: ICDT. Springer, London, pp 296–313
.
Zurück zum Zitat Bonifati A, Chang EQ, Ho T, Lakshmanan L, Pottinger R (2005) HePToX: Marrying XML and heterogeneity in your P2P databases. In: VLDB. VLDB Endowment, pp 1267–1270 Bonifati A, Chang EQ, Ho T, Lakshmanan L, Pottinger R (2005) HePToX: Marrying XML and heterogeneity in your P2P databases. In: VLDB. VLDB Endowment, pp 1267–1270
.
Zurück zum Zitat Bonifati A, Mecca G, Pappalardo A, Raunich S, Summa G (2008) Schema mapping verification: The spicy way. In: EDBT. ACM, NY, pp 85–96 Bonifati A, Mecca G, Pappalardo A, Raunich S, Summa G (2008) Schema mapping verification: The spicy way. In: EDBT. ACM, NY, pp 85–96
.
Zurück zum Zitat Bonifati A, Chang EQ, Ho T, Lakshmanan L, Pottinger R, Chung Y (2010) Schema mapping and query translation in heterogeneous P2P XML databases. VLDB J 19(2):231–256CrossRef Bonifati A, Chang EQ, Ho T, Lakshmanan L, Pottinger R, Chung Y (2010) Schema mapping and query translation in heterogeneous P2P XML databases. VLDB J 19(2):231–256CrossRef
.
Zurück zum Zitat Cabibbo L (2009) On keys, foreign keys and nullable attributes in relational mapping systems. In: EDBT. ACM, NY, pp 263–274 Cabibbo L (2009) On keys, foreign keys and nullable attributes in relational mapping systems. In: EDBT. ACM, NY, pp 263–274
.
Zurück zum Zitat Calì A, Gottlob G, Lukasiewicz T (2009a) Datalog ± : A unified approach to ontologies and integrity constraints. In: ICDT. ACM, NY, pp 14–30 Calì A, Gottlob G, Lukasiewicz T (2009a) Datalog ± : A unified approach to ontologies and integrity constraints. In: ICDT. ACM, NY, pp 14–30
.
Zurück zum Zitat Calì A, Gottlob G, Lukasiewicz T (2009b) A general datalog-based framework for tractable query answering over ontologies. In: PODS. ACM, NY, pp 77–86 Calì A, Gottlob G, Lukasiewicz T (2009b) A general datalog-based framework for tractable query answering over ontologies. In: PODS. ACM, NY, pp 77–86
.
Zurück zum Zitat Calvanese D, De Giacomo G, Lenzerini M, Rosati R (2004) Logical foundations of peer-to-peer data integration. In: ACM PODS. ACM, NY, pp 241–251 Calvanese D, De Giacomo G, Lenzerini M, Rosati R (2004) Logical foundations of peer-to-peer data integration. In: ACM PODS. ACM, NY, pp 241–251
.
Zurück zum Zitat Chandra AK, Merlin PM (1977) Optimal implementation of conjunctive queries in relational data bases. In: STOC. ACM, NY, pp 77–90 Chandra AK, Merlin PM (1977) Optimal implementation of conjunctive queries in relational data bases. In: STOC. ACM, NY, pp 77–90
.
Zurück zum Zitat Chiticariu L (2005) Computing the core in data exchange: Algorithmic issues. MS Project Report, unpublished manuscript Chiticariu L (2005) Computing the core in data exchange: Algorithmic issues. MS Project Report, unpublished manuscript
.
Zurück zum Zitat Cluet S, Delobel C, Siméon J, Smaga K (1998) Your mediators need data conversion! In: SIGMOD. ACM, NY, pp 177–188 Cluet S, Delobel C, Siméon J, Smaga K (1998) Your mediators need data conversion! In: SIGMOD. ACM, NY, pp 177–188
.
Zurück zum Zitat Davidson S, Kosky A (1997) IEEE Computer Society. In: Proceedings of the Thirteenth International Conference on Data Engineering, April 7–11, 1997 Birmingham UK Davidson S, Kosky A (1997) IEEE Computer Society. In: Proceedings of the Thirteenth International Conference on Data Engineering, April 7–11, 1997 Birmingham UK
.
Zurück zum Zitat Deutsch A, Popa L, Tannen V (1999) Physical data independence, constraints, and optimization with universal plans. In: VLDB. Morgan Kaufmann, CA, pp 459–470 Deutsch A, Popa L, Tannen V (1999) Physical data independence, constraints, and optimization with universal plans. In: VLDB. Morgan Kaufmann, CA, pp 459–470
.
Zurück zum Zitat Fagin R (2007) Inverting schema mappings. ACM TODS 32(4) Fagin R (2007) Inverting schema mappings. ACM TODS 32(4)
.
Zurück zum Zitat Fagin R, Kolaitis P, Nash A, Popa L (2008) Towards a theory of schema-mapping optimization. In: ACM PODS. ACM, NY, pp 33–42 Fagin R, Kolaitis P, Nash A, Popa L (2008) Towards a theory of schema-mapping optimization. In: ACM PODS. ACM, NY, pp 33–42
.
Zurück zum Zitat Fagin R, Haas LM, Hernandez M, Miller RJ, Popa L, Velegrakis Y (2009) Clio: Schema mapping creation and data exchange. In: Borgida A, Chaudhri V, Giorgini P, Yu E (eds) Conceptual modeling: Foundations and applications. Springer, Heidelberg, pp 198–236CrossRef Fagin R, Haas LM, Hernandez M, Miller RJ, Popa L, Velegrakis Y (2009) Clio: Schema mapping creation and data exchange. In: Borgida A, Chaudhri V, Giorgini P, Yu E (eds) Conceptual modeling: Foundations and applications. Springer, Heidelberg, pp 198–236CrossRef
.
Zurück zum Zitat Fuxman A, Hernández MA, Howard CT, Miller RJ, Papotti P, Popa L (2006) Nested mappings: Schema mapping reloaded. In: VLDB. VLDB Endowment, pp 67–78 Fuxman A, Hernández MA, Howard CT, Miller RJ, Papotti P, Popa L (2006) Nested mappings: Schema mapping reloaded. In: VLDB. VLDB Endowment, pp 67–78
.
Zurück zum Zitat Gottlob G, Pichler R, Savenkov V (2009) Normalization and optimization of schema mappings. PVLDB 2(1):1102–1113 Gottlob G, Pichler R, Savenkov V (2009) Normalization and optimization of schema mappings. PVLDB 2(1):1102–1113
.
Zurück zum Zitat Haas LM (2007) Lecture Notes in Computer Science, vol. 4353. In: ICDT, Springer. Haas LM (2007) Lecture Notes in Computer Science, vol. 4353. In: ICDT, Springer.
.
Zurück zum Zitat Halevy AY (2010) Technical perspective – schema mappings: Rules for mixing data. Commun CACM 53(1):100CrossRef Halevy AY (2010) Technical perspective – schema mappings: Rules for mixing data. Commun CACM 53(1):100CrossRef
.
Zurück zum Zitat Hernández MA, Papotti P, Tan WC (2008) Data exchange with data-metadata translations. PVLDB 1(1):260–273 Hernández MA, Papotti P, Tan WC (2008) Data exchange with data-metadata translations. PVLDB 1(1):260–273
.
Zurück zum Zitat Hull R, Yoshikawa M (1990) ILOG: Declarative creation and manipulation of object identifiers. In: VLDB. Morgan Kaufmann, CA, pp 455–468 Hull R, Yoshikawa M (1990) ILOG: Declarative creation and manipulation of object identifiers. In: VLDB. Morgan Kaufmann, CA, pp 455–468
.
Zurück zum Zitat Ives ZG, Halevy AY, Mork P, Tatarinov I (2004) Piazza: Mediation and integration infrastructure for semantic web data. J Web Sem 1(2):155–175CrossRef Ives ZG, Halevy AY, Mork P, Tatarinov I (2004) Piazza: Mediation and integration infrastructure for semantic web data. J Web Sem 1(2):155–175CrossRef
.
Zurück zum Zitat Ives ZG, Green TJ, Karvounarakis G, Taylor NE, Tannen V, Talukdar PP, Jacob M, Pereira F(2008) The orchestra collaborative data sharing system. SIGMOD Rec 37(3):26–32CrossRef Ives ZG, Green TJ, Karvounarakis G, Taylor NE, Tannen V, Talukdar PP, Jacob M, Pereira F(2008) The orchestra collaborative data sharing system. SIGMOD Rec 37(3):26–32CrossRef
.
Zurück zum Zitat Jiang H, Ho H, Popa L, Han W (2007) Mapping-driven XML transformation. In: WWW conference. ACM, NY, pp 1063–1072 Jiang H, Ho H, Popa L, Han W (2007) Mapping-driven XML transformation. In: WWW conference. ACM, NY, pp 1063–1072
.
Zurück zum Zitat Levy AY, Mendelzon A, Sagiv Y, Srivastava D (1995) Proceedings of the fourteenth ACM SIGACT-SIGMOD-SIGART symposium on principles of database systems. ACM Press, San Jose, California, May 22–25, 1995 Levy AY, Mendelzon A, Sagiv Y, Srivastava D (1995) Proceedings of the fourteenth ACM SIGACT-SIGMOD-SIGART symposium on principles of database systems. ACM Press, San Jose, California, May 22–25, 1995
.
Zurück zum Zitat Maier D, Mendelzon AO, Sagiv Y (1979) Testing implications of data dependencies. ACM TODS 4(4):455–469CrossRef Maier D, Mendelzon AO, Sagiv Y (1979) Testing implications of data dependencies. ACM TODS 4(4):455–469CrossRef
.
Zurück zum Zitat Marnette B (2009) Generalized schema mappings: From termination to tractability. In: ACM PODS. ACM, NY, pp 13–22 Marnette B (2009) Generalized schema mappings: From termination to tractability. In: ACM PODS. ACM, NY, pp 13–22
.
Zurück zum Zitat Marnette B, Mecca G, Papotti P (2010) Scalable data exchange with functional dependencies. PVLDB 3(1):106–116 Marnette B, Mecca G, Papotti P (2010) Scalable data exchange with functional dependencies. PVLDB 3(1):106–116
.
Zurück zum Zitat Mecca G, Papotti P, Raunich S (2009a) Core schema mappings. In: SIGMOD. ACM, NY, pp 655–668 Mecca G, Papotti P, Raunich S (2009a) Core schema mappings. In: SIGMOD. ACM, NY, pp 655–668
.
Zurück zum Zitat Mecca G, Papotti P, Raunich S, Buoncristiano M (2009b) Concise and expressive mappings with + Spicy. PVLDB 2(2):1582–1585 Mecca G, Papotti P, Raunich S, Buoncristiano M (2009b) Concise and expressive mappings with + Spicy. PVLDB 2(2):1582–1585
.
Zurück zum Zitat Melnik S, Bernstein P, Halevy A, Rahm E (2005) Supporting executable mappings in model management. In: SIGMOD. ACM, NY, pp 167–178 Melnik S, Bernstein P, Halevy A, Rahm E (2005) Supporting executable mappings in model management. In: SIGMOD. ACM, NY, pp 167–178
.
Zurück zum Zitat Miller RJ, Haas LM, Hernandez MA (2000) Schema mapping as query discovery. In: VLDB. Morgan Kaufmann, CA, pp 77–99 Miller RJ, Haas LM, Hernandez MA (2000) Schema mapping as query discovery. In: VLDB. Morgan Kaufmann, CA, pp 77–99
.
Zurück zum Zitat Milo T, Zohar S (1998) Using schema matching to simplify heterogeneous data translation. In: VLDB. Morgan Kaufmann, CA, pp 122–133 Milo T, Zohar S (1998) Using schema matching to simplify heterogeneous data translation. In: VLDB. Morgan Kaufmann, CA, pp 122–133
.
Zurück zum Zitat Popa L (2000) Object/relational query optimization with chase and backchase. PhD thesis, University of Pennsylvania Popa L (2000) Object/relational query optimization with chase and backchase. PhD thesis, University of Pennsylvania
.
Zurück zum Zitat Popa L, Tannen V (1999) An equational chase for path-conjunctive queries, constraints, and views. In: ICDT. Springer, London, pp 39–57 Popa L, Tannen V (1999) An equational chase for path-conjunctive queries, constraints, and views. In: ICDT. Springer, London, pp 39–57
.
Zurück zum Zitat Popa L, Velegrakis Y, Miller RJ, Hernandez MA, Fagin R (2002) Translating web data. In: VLDB. VLDB Endowment, pp 598–609 Popa L, Velegrakis Y, Miller RJ, Hernandez MA, Fagin R (2002) Translating web data. In: VLDB. VLDB Endowment, pp 598–609
.
Zurück zum Zitat Pottinger R, Halevy A (2001) Minicon: A scalable algorithm for answering queries using views. VLDB J 10(2–3):182–198MATH Pottinger R, Halevy A (2001) Minicon: A scalable algorithm for answering queries using views. VLDB J 10(2–3):182–198MATH
.
Zurück zum Zitat Raffio A, Braga D, Ceri S, Papotti P, Hernández MA (2008) Clip: A visual language for explicit schema mappings. In: ICDE. IEEE Computer Society, Washington, DC, pp 30–39 Raffio A, Braga D, Ceri S, Papotti P, Hernández MA (2008) Clip: A visual language for explicit schema mappings. In: ICDE. IEEE Computer Society, Washington, DC, pp 30–39
.
Zurück zum Zitat Rahm E, Bernstein PA (2001) A survey of approaches to automatic schema matching. VLDB J 10:334–350MATHCrossRef Rahm E, Bernstein PA (2001) A survey of approaches to automatic schema matching. VLDB J 10:334–350MATHCrossRef
.
Zurück zum Zitat Savenkov V, Pichler R (2008) Towards practical feasibility of core computation in data exchange. In: LPAR. Springer, Heidelberg, pp 62–78 Savenkov V, Pichler R (2008) Towards practical feasibility of core computation in data exchange. In: LPAR. Springer, Heidelberg, pp 62–78
.
Zurück zum Zitat Shu NC, Housel BC, Taylor RW, Ghosh SP, Lum VY (1977) EXPRESS: A data extraction, processing and restructuring system. ACM TODS 2(2):134–174CrossRef Shu NC, Housel BC, Taylor RW, Ghosh SP, Lum VY (1977) EXPRESS: A data extraction, processing and restructuring system. ACM TODS 2(2):134–174CrossRef
.
Zurück zum Zitat ten Cate B, Kolaitis PG (2009) Structural characterizations of schema-mapping languages. In: ICDT. ACM, NY, pp 63–72 ten Cate B, Kolaitis PG (2009) Structural characterizations of schema-mapping languages. In: ICDT. ACM, NY, pp 63–72
.
Zurück zum Zitat ten Cate B, Chiticariu L, Kolaitis P, Tan WC (2009) Laconic schema mappings: Computing core universal solutions by means of SQL queries. PVLDB 2(1):1006–1017 ten Cate B, Chiticariu L, Kolaitis P, Tan WC (2009) Laconic schema mappings: Computing core universal solutions by means of SQL queries. PVLDB 2(1):1006–1017
.
Zurück zum Zitat Tork-Roth M, Schwarz PM (1997) Don’t scrap it, wrap it! A wrapper architecture for legacy data sources. In: VLDB. Morgan Kaufmann, CA, pp 266–275 Tork-Roth M, Schwarz PM (1997) Don’t scrap it, wrap it! A wrapper architecture for legacy data sources. In: VLDB. Morgan Kaufmann, CA, pp 266–275
.
Zurück zum Zitat Velegrakis Y (2005) Managing schema mappings in highly heterogeneous environments. PhD thesis, University of Toronto Velegrakis Y (2005) Managing schema mappings in highly heterogeneous environments. PhD thesis, University of Toronto
.
Zurück zum Zitat Yu C, Popa L (2004) Constraint-based XML query rewriting for data integration. In: SIGMOD conference. ACM, NY, pp 371–382 Yu C, Popa L (2004) Constraint-based XML query rewriting for data integration. In: SIGMOD conference. ACM, NY, pp 371–382
Metadaten
Titel
Discovery and Correctness of Schema Mapping Transformations
verfasst von
Angela Bonifati
Giansalvatore Mecca
Paolo Papotti
Yannis Velegrakis
Copyright-Jahr
2011
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-642-16518-4_5