Skip to main content

2011 | OriginalPaper | Buchkapitel

14. Data Integration

verfasst von : Sonia Bergamaschi, Domenico Beneventano, Francesco Guerra, Mirko Orsini

Erschienen in: Handbook of Conceptual Modeling

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Given the many data integration approaches, a complete and exhaustive comparison of all the research activities is not possible. In this chapter we will present an overview of the most relevant research activities and ideas in the field investigated in the last 20 years. We will also introduce the MOMIS system, a framework to perform information extraction and integration from both structured and semistructured data sources, that is one of the most interesting results of our research activity. An open source version of the MOMIS system was delivered by the academic startup DataRiver (www.datariver.it).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
See for example Talend, http://​www.​talend.​com, an open source ETL and data integration system.
 
2
We use classes for including both the object-oriented and relational models.
 
4
www.service-architecture.com/database/articles/odmg_3_0.html.
 
5
A global attribute mapped onto only one source is a particular case of a homogeneous attribute.
 
Literatur
1.
Zurück zum Zitat Abiteboul S, Buneman P, Suciu D (1999) Data on the Web: from relations to semistructured data and XML. Morgan Kaufmann, San Francisco Abiteboul S, Buneman P, Suciu D (1999) Data on the Web: from relations to semistructured data and XML. Morgan Kaufmann, San Francisco
2.
Zurück zum Zitat Ananthakrishna R, Chaudhuri S, Ganti V (2002) Eliminating fuzzy duplicates in data warehouses. In Proceedings of the 28th international conference on Very Large Bases, Hong Kong, China, VLDB Endowment, p 586–597 Ananthakrishna R, Chaudhuri S, Ganti V (2002) Eliminating fuzzy duplicates in data warehouses. In Proceedings of the 28th international conference on Very Large Bases, Hong Kong, China, VLDB Endowment, p 586–597
3.
Zurück zum Zitat Arens Y, Knoblock CA (1993) Sims: retrieving and integrating information from multiple sources. In: Buneman P, Jajodia S (eds) Proceedings of the 1993 ACM SIGMOD international conference on management of data, Washington, DC, 26–28 May 1993. ACM, New York, pp 562–563CrossRef Arens Y, Knoblock CA (1993) Sims: retrieving and integrating information from multiple sources. In: Buneman P, Jajodia S (eds) Proceedings of the 1993 ACM SIGMOD international conference on management of data, Washington, DC, 26–28 May 1993. ACM, New York, pp 562–563CrossRef
4.
Zurück zum Zitat Aumueller D, Do HH, Massmann S, Rahm E (2005) Schema and ontology matching with coma++. In: Özcan F (ed) SIGMOD conference. ACM, New York, pp 906–908 Aumueller D, Do HH, Massmann S, Rahm E (2005) Schema and ontology matching with coma++. In: Özcan F (ed) SIGMOD conference. ACM, New York, pp 906–908
5.
Zurück zum Zitat Batini C, Lenzerini M, Navathe SB (1986) A comparative analysis of methodologies for database schema integration. ACM Comput Surv 18(4):323–364CrossRef Batini C, Lenzerini M, Navathe SB (1986) A comparative analysis of methodologies for database schema integration. ACM Comput Surv 18(4):323–364CrossRef
6.
Zurück zum Zitat Baumgartner R, Flesca S, Gottlob G (2001) Declarative information extraction, web crawling, and recursive wrapping with lixto. In: Eiter T, Faber W, Truszczynski M (eds) LPNMR. Lecture notes in computer science, vol 2173. Springer, Berlin, pp 21–41 Baumgartner R, Flesca S, Gottlob G (2001) Declarative information extraction, web crawling, and recursive wrapping with lixto. In: Eiter T, Faber W, Truszczynski M (eds) LPNMR. Lecture notes in computer science, vol 2173. Springer, Berlin, pp 21–41
7.
Zurück zum Zitat Benassi R, Bergamaschi S, Fergnani A, Miselli D (2004) Extending a lexicon ontology for intelligent information integration. In: Proceedings of the 16th Eureopean conference on artificial intelligence (ECAI’2004), pp 278–282 Benassi R, Bergamaschi S, Fergnani A, Miselli D (2004) Extending a lexicon ontology for intelligent information integration. In: Proceedings of the 16th Eureopean conference on artificial intelligence (ECAI’2004), pp 278–282
8.
Zurück zum Zitat Beneventano D, Bergamaschi S (2007) Semantic search engines based on data integration systems. In: Cardoso J (ed) Semantic Web services: theory, tools and applications. IGI Global, Hershey, pp 317–341CrossRef Beneventano D, Bergamaschi S (2007) Semantic search engines based on data integration systems. In: Cardoso J (ed) Semantic Web services: theory, tools and applications. IGI Global, Hershey, pp 317–341CrossRef
9.
Zurück zum Zitat Beneventano D, Bergamaschi S, Guerra F, Vincini M (2003) Synthesizing an integrated ontology. IEEE Internet Comput 7(5):42–51CrossRef Beneventano D, Bergamaschi S, Guerra F, Vincini M (2003) Synthesizing an integrated ontology. IEEE Internet Comput 7(5):42–51CrossRef
11.
Zurück zum Zitat Beneventano D, Bergamaschi S, Sorrentino S (2009) Extending wordnet with compound nouns for semi-automatic annotation in data integration systems. In: Proceedings of the international conference on natural language processing and knowledge engineering (NLP–KE), 24–27 September 2009, Dalian, China, pp 1–8 Beneventano D, Bergamaschi S, Sorrentino S (2009) Extending wordnet with compound nouns for semi-automatic annotation in data integration systems. In: Proceedings of the international conference on natural language processing and knowledge engineering (NLP–KE), 24–27 September 2009, Dalian, China, pp 1–8
12.
Zurück zum Zitat Beneventano D, Bergamaschi S, Vincini M, Orsini M, Nana RC (2007) Query translation on heterogeneous sources in momis data transformation systems. In: VLDB 3rd international workshop on database interoperability (InterDB 2007) Beneventano D, Bergamaschi S, Vincini M, Orsini M, Nana RC (2007) Query translation on heterogeneous sources in momis data transformation systems. In: VLDB 3rd international workshop on database interoperability (InterDB 2007)
13.
Zurück zum Zitat Beneventano D, Gennaro C, Guerra F (2008) A methodology for building and querying an ontology representing data and multimedia sources. In: ODBIS, pp 37–40 Beneventano D, Gennaro C, Guerra F (2008) A methodology for building and querying an ontology representing data and multimedia sources. In: ODBIS, pp 37–40
14.
Zurück zum Zitat Beneventano D, Guerra F, Maurino A, Palmonari M, Pasi G, Sala A (2009) Unified semantic search of data and services. In: Proceedings of the 3rd international conference on metadata and semantic research (MTSR 2009), Milan, Italy, 1–2 October 2009. Communications in computer and information science, vol 46. Springer, Berlin, pp 95–107 Beneventano D, Guerra F, Maurino A, Palmonari M, Pasi G, Sala A (2009) Unified semantic search of data and services. In: Proceedings of the 3rd international conference on metadata and semantic research (MTSR 2009), Milan, Italy, 1–2 October 2009. Communications in computer and information science, vol 46. Springer, Berlin, pp 95–107
15.
Zurück zum Zitat Beneventano D, Lenzerini M (2005) Final release of the system prototype for query management. Sewasie, deliverable D3.5, Dipartimento di Ingegneria dell’Informazione. http://dbgroup.unimo.it/TechnicalReport/D3.5Final.pdf Beneventano D, Lenzerini M (2005) Final release of the system prototype for query management. Sewasie, deliverable D3.5, Dipartimento di Ingegneria dell’Informazione. http://​dbgroup.​unimo.​it/​TechnicalReport/​D3.​5Final.​pdf
16.
Zurück zum Zitat den Bercken JV, Blohsfeld B, Dittrich JP, Krämer J, Schäfer T, Schneider M, Seeger B (2001) Xxl – a library approach to supporting efficient implementations of advanced database queries. In: Apers PMG, Atzeni P, Ceri S, Paraboschi S, Ramamohanarao K, Snodgrass RT (eds) VLDB, pp 39–48. Morgan Kaufmann, San Francisco den Bercken JV, Blohsfeld B, Dittrich JP, Krämer J, Schäfer T, Schneider M, Seeger B (2001) Xxl – a library approach to supporting efficient implementations of advanced database queries. In: Apers PMG, Atzeni P, Ceri S, Paraboschi S, Ramamohanarao K, Snodgrass RT (eds) VLDB, pp 39–48. Morgan Kaufmann, San Francisco
17.
Zurück zum Zitat Bergamaschi S, Castano S, Vincini M, Beneventano D (2001) Semantic integration of heterogeneous information sources. Data Knowl Eng 36(3):215–249MATHCrossRef Bergamaschi S, Castano S, Vincini M, Beneventano D (2001) Semantic integration of heterogeneous information sources. Data Knowl Eng 36(3):215–249MATHCrossRef
18.
Zurück zum Zitat Bergamaschi S, Maurino A (2009) Toward a unified view of data and services. In: Vossen G, Long DDE, Yu JX (eds) Proceedings of the 10th international conference on Web information systems engineering (WISE 2009), Poznan, Poland, 5–7 October 2009. Lecture notes in computer science, vol 5802. Springer, Berlin, pp 11–12 Bergamaschi S, Maurino A (2009) Toward a unified view of data and services. In: Vossen G, Long DDE, Yu JX (eds) Proceedings of the 10th international conference on Web information systems engineering (WISE 2009), Poznan, Poland, 5–7 October 2009. Lecture notes in computer science, vol 5802. Springer, Berlin, pp 11–12
19.
Zurück zum Zitat Bernstein PA, Melnik S, Petropoulos M, Quix C (2004) Industrial-strength schema matching. SIGMOD Rec 33(4):38–43CrossRef Bernstein PA, Melnik S, Petropoulos M, Quix C (2004) Industrial-strength schema matching. SIGMOD Rec 33(4):38–43CrossRef
20.
Zurück zum Zitat Bertossi LE, Chomicki J (2003) Query answering in inconsistent databases. In: Chomicki J, van der Meyden R, Saake G (eds) Logics for emerging applications of databases. Springer, Berlin, pp 43–83 Bertossi LE, Chomicki J (2003) Query answering in inconsistent databases. In: Chomicki J, van der Meyden R, Saake G (eds) Logics for emerging applications of databases. Springer, Berlin, pp 43–83
21.
Zurück zum Zitat Bleiholder J, Draba K, Naumann F (2007) Fusem – exploring different semantics of data fusion. In: Koch C, Gehrke J, Garofalakis MN, Srivastava D, Aberer K, Deshpande A, Florescu D, Chan CY, Ganti V, Kanne CC, Klas W, Neuhold EJ (eds) VLDB. ACM, New York, pp 1350–1353 Bleiholder J, Draba K, Naumann F (2007) Fusem – exploring different semantics of data fusion. In: Koch C, Gehrke J, Garofalakis MN, Srivastava D, Aberer K, Deshpande A, Florescu D, Chan CY, Ganti V, Kanne CC, Klas W, Neuhold EJ (eds) VLDB. ACM, New York, pp 1350–1353
22.
Zurück zum Zitat Bleiholder J, Naumann F (2008) Data fusion. ACM Comput Surv 41(1):1–41CrossRef Bleiholder J, Naumann F (2008) Data fusion. ACM Comput Surv 41(1):1–41CrossRef
23.
Zurück zum Zitat Bressan S, Goh CH, Levina N, Madnick SE, Shah A, Siegel M (2000) Context knowledge representation and reasoning in the context interchange system. Appl Intell 13(2):165–180CrossRef Bressan S, Goh CH, Levina N, Madnick SE, Shah A, Siegel M (2000) Context knowledge representation and reasoning in the context interchange system. Appl Intell 13(2):165–180CrossRef
24.
Zurück zum Zitat CalÌ A, Calvanese D, Giacomo GD, Lenzerini M (2002) Data integration under integrity constraints. In: Proceedings of the 14th international conference on advanced information systems engineering (CAiSE ’02). Springer, London, pp 262–279 CalÌ A, Calvanese D, Giacomo GD, Lenzerini M (2002) Data integration under integrity constraints. In: Proceedings of the 14th international conference on advanced information systems engineering (CAiSE ’02). Springer, London, pp 262–279
25.
Zurück zum Zitat CalÌ A, Lembo D, Rosati R (2003) Query rewriting and answering under constraints in data integration systems. In: Gottlob G, Walsh T (eds) Proceedings of the international joint conference on artificial intelligence. Morgan Kaufmann, pp 16–21 CalÌ A, Lembo D, Rosati R (2003) Query rewriting and answering under constraints in data integration systems. In: Gottlob G, Walsh T (eds) Proceedings of the international joint conference on artificial intelligence. Morgan Kaufmann, pp 16–21
26.
Zurück zum Zitat Calvanese D, Giacomo GD, Lembo D, Lenzerini M, Rosati R (2004) What to ask to a peer: ontology-based query reformulation. In: Dubois D, Welty CA, Williams MA (eds) Principles of Knowledge Representation and Reasoning. Proceedings of the Nineth International Conference (KR2004), Whistler, Canada, June 2–4 2004, AAAi Press, Menlo Park, pp 469–478 Calvanese D, Giacomo GD, Lembo D, Lenzerini M, Rosati R (2004) What to ask to a peer: ontology-based query reformulation. In: Dubois D, Welty CA, Williams MA (eds) Principles of Knowledge Representation and Reasoning. Proceedings of the Nineth International Conference (KR2004), Whistler, Canada, June 2–4 2004, AAAi Press, Menlo Park, pp 469–478
27.
Zurück zum Zitat Castano S, Ferrara A, Lorusso D, Montanelli S (2008) On the ontology instance matching problem. In: DEXA workshops. IEEE Computer Society, Washington, DC, pp 180–184 Castano S, Ferrara A, Lorusso D, Montanelli S (2008) On the ontology instance matching problem. In: DEXA workshops. IEEE Computer Society, Washington, DC, pp 180–184
28.
Zurück zum Zitat Chaudhuri S, Ganjam K, Ganti V, Motwani R (2003) Robust and efficient fuzzy match for online data cleaning. In: SIGMOD conference, pp 313–324 Chaudhuri S, Ganjam K, Ganti V, Motwani R (2003) Robust and efficient fuzzy match for online data cleaning. In: SIGMOD conference, pp 313–324
29.
Zurück zum Zitat Chen K, Madhavan J, Halevy AY (2009) Exploring schema repositories with schemr. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 2009), Providence, RI, 29 June–2 July 2009. ACM, New York, pp 1095–1098 Chen K, Madhavan J, Halevy AY (2009) Exploring schema repositories with schemr. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 2009), Providence, RI, 29 June–2 July 2009. ACM, New York, pp 1095–1098
30.
Zurück zum Zitat Crescenzi V, Mecca G, Merialdo P (2001) Automatic web information extraction in the roadrunner system. In: Arisawa H, Kambayashi Y, Kumar V, Mayr HC, Hunt I (eds) ER (workshops). Lecture notes in computer science, vol 2465. Springer, Berlin, pp 264–277 Crescenzi V, Mecca G, Merialdo P (2001) Automatic web information extraction in the roadrunner system. In: Arisawa H, Kambayashi Y, Kumar V, Mayr HC, Hunt I (eds) ER (workshops). Lecture notes in computer science, vol 2465. Springer, Berlin, pp 264–277
31.
Zurück zum Zitat Euzenat J, Shvaiko P (2007) Ontology matching. Springer, HeidelbergMATH Euzenat J, Shvaiko P (2007) Ontology matching. Springer, HeidelbergMATH
32.
Zurück zum Zitat Fagin R, Haas LM, Hernández MA, Miller RJ, Popa L, Velegrakis Y (2009) Clio: schema mapping creation and data exchange. In: Borgida A, Chaudhri VK, Giorgini P, Yu ESK (eds) Conceptual modeling: foundations and applications. Lecture notes in computer science, vol 5600. Springer, Berlin, pp 198–236CrossRef Fagin R, Haas LM, Hernández MA, Miller RJ, Popa L, Velegrakis Y (2009) Clio: schema mapping creation and data exchange. In: Borgida A, Chaudhri VK, Giorgini P, Yu ESK (eds) Conceptual modeling: foundations and applications. Lecture notes in computer science, vol 5600. Springer, Berlin, pp 198–236CrossRef
33.
Zurück zum Zitat Fagin R, Kolaitis PG, Miller RJ, Popa L (2005) Data exchange: semantics and query answering. Theor Comput Sci 336(1):89–124MathSciNetMATHCrossRef Fagin R, Kolaitis PG, Miller RJ, Popa L (2005) Data exchange: semantics and query answering. Theor Comput Sci 336(1):89–124MathSciNetMATHCrossRef
34.
Zurück zum Zitat Geist I (2004) Index-based keyword search in mediator systems. In: Lindner W, Mesiti M, Türker C, Tzitzikas Y, Vakali A (eds) EDBT workshops. Lecture notes in computer science, vol 3268. Springer, Berlin, pp 24–33 Geist I (2004) Index-based keyword search in mediator systems. In: Lindner W, Mesiti M, Türker C, Tzitzikas Y, Vakali A (eds) EDBT workshops. Lecture notes in computer science, vol 3268. Springer, Berlin, pp 24–33
35.
Zurück zum Zitat Giunchiglia F, Yatskevich M, Shvaiko P (2007) Semantic matching: algorithms and implementation. J Data Semant 9:1–38 Giunchiglia F, Yatskevich M, Shvaiko P (2007) Semantic matching: algorithms and implementation. J Data Semant 9:1–38
36.
Zurück zum Zitat Gottlob G, Koch C, Baumgartner R, Herzog M, Flesca S (2004) The lixto data extraction project – back and forth between theory and practice. In: Deutsch A (ed) PODS. ACM, New York, pp 1–12CrossRef Gottlob G, Koch C, Baumgartner R, Herzog M, Flesca S (2004) The lixto data extraction project – back and forth between theory and practice. In: Deutsch A (ed) PODS. ACM, New York, pp 1–12CrossRef
37.
Zurück zum Zitat Greco G, Greco S, Zumpano E (2003) A logical framework for querying and repairing inconsistent databases. IEEE Trans Knowl Data Eng 15(6):1389–1408CrossRef Greco G, Greco S, Zumpano E (2003) A logical framework for querying and repairing inconsistent databases. IEEE Trans Knowl Data Eng 15(6):1389–1408CrossRef
38.
Zurück zum Zitat Guerra F, Bergamaschi S, Orsini M, Sala A, Sartori C (2009) Keymantic: a keyword-based search engine using structural knowledge. In: Cordeiro J, Filipe J (eds) ICEIS, vol 1, pp 241–246 Guerra F, Bergamaschi S, Orsini M, Sala A, Sartori C (2009) Keymantic: a keyword-based search engine using structural knowledge. In: Cordeiro J, Filipe J (eds) ICEIS, vol 1, pp 241–246
39.
40.
Zurück zum Zitat Halevy AY, Ives ZG, Madhavan J, Mork P, Suciu D, Tatarinov I (2004) The piazza peer data management system. IEEE Trans Knowl Data Eng 16(7):787–798CrossRef Halevy AY, Ives ZG, Madhavan J, Mork P, Suciu D, Tatarinov I (2004) The piazza peer data management system. IEEE Trans Knowl Data Eng 16(7):787–798CrossRef
41.
Zurück zum Zitat Hammer J, Stonebraker M, Topsakal O (2005) Thalia: test harness for the assessment of legacy information integration approaches. In: ICDE, pp 485–486 Hammer J, Stonebraker M, Topsakal O (2005) Thalia: test harness for the assessment of legacy information integration approaches. In: ICDE, pp 485–486
42.
Zurück zum Zitat Heimbigner D, McLeod D (1985) A federated architecture for information management. ACM Trans Inf Syst 3(3):253–278CrossRef Heimbigner D, McLeod D (1985) A federated architecture for information management. ACM Trans Inf Syst 3(3):253–278CrossRef
43.
Zurück zum Zitat Hull R (1997) Managing semantic heterogeneity in databases: a theoretical perspective. In: PODS, pp 51–61 Hull R (1997) Managing semantic heterogeneity in databases: a theoretical perspective. In: PODS, pp 51–61
44.
Zurück zum Zitat Inmon WH (1992) Building the data warehouse. QED Information Sciences, Wellesley Inmon WH (1992) Building the data warehouse. QED Information Sciences, Wellesley
45.
Zurück zum Zitat Klein MCA, Fensel D, Kiryakov A, Ognyanov D (2002) Ontology versioning and change detection on the web. In: Gómez-Pérez A, Benjamins VR (eds) EKAW. Lecture notes in computer science, vol 2473. Springer, Berlin, pp 197–212 Klein MCA, Fensel D, Kiryakov A, Ognyanov D (2002) Ontology versioning and change detection on the web. In: Gómez-Pérez A, Benjamins VR (eds) EKAW. Lecture notes in computer science, vol 2473. Springer, Berlin, pp 197–212
46.
Zurück zum Zitat Köpcke H, Rahm E (2010) Frameworks for entity matching: a comparison. Data Knowl Eng 69(2):197–210CrossRef Köpcke H, Rahm E (2010) Frameworks for entity matching: a comparison. Data Knowl Eng 69(2):197–210CrossRef
47.
Zurück zum Zitat Laender AHF, Ribeiro-Neto BA, da Silva AS (2002) Debye – data extraction by example. Data Knowl Eng 40(2):121–154MATHCrossRef Laender AHF, Ribeiro-Neto BA, da Silva AS (2002) Debye – data extraction by example. Data Knowl Eng 40(2):121–154MATHCrossRef
48.
Zurück zum Zitat Lenzerini M (2002) Data integration: a theoretical perspective. In: Popa L (ed) PODS. ACM, New York, pp 233–246 Lenzerini M (2002) Data integration: a theoretical perspective. In: Popa L (ed) PODS. ACM, New York, pp 233–246
49.
Zurück zum Zitat Levy AY, Rajaraman A, Ordille JJ (1996) Querying heterogeneous information sources using source descriptions. In: Vijayaraman Tm, Buchmann AP, Mohan C, Sarda NL (eds) VLDB. Morgan Kaufmann, San Francisco, pp 251–262 Levy AY, Rajaraman A, Ordille JJ (1996) Querying heterogeneous information sources using source descriptions. In: Vijayaraman Tm, Buchmann AP, Mohan C, Sarda NL (eds) VLDB. Morgan Kaufmann, San Francisco, pp 251–262
50.
Zurück zum Zitat Li C, Yerneni R, Vassalos V, Garcia-Molina H, Papakonstantinou Y, Ullman JD, Valiveti M (1998) Capability based mediation in tsimmis. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 1998), 2–4 June 1998, Seattle. ACM Press, New York, pp 564–566CrossRef Li C, Yerneni R, Vassalos V, Garcia-Molina H, Papakonstantinou Y, Ullman JD, Valiveti M (1998) Capability based mediation in tsimmis. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 1998), 2–4 June 1998, Seattle. ACM Press, New York, pp 564–566CrossRef
51.
Zurück zum Zitat Lin J, Mendelzon AO (1998) Merging databases under constraints. Int J Cooperative Inf Syst 7(1):55–76CrossRef Lin J, Mendelzon AO (1998) Merging databases under constraints. Int J Cooperative Inf Syst 7(1):55–76CrossRef
52.
Zurück zum Zitat Ludäscher B, Himmeröder R, Lausen G, May W, Schlepphorst C (1998) Managing semistructured data with florid: a deductive object-oriented perspective. Inf Syst 23(8):589–613CrossRef Ludäscher B, Himmeröder R, Lausen G, May W, Schlepphorst C (1998) Managing semistructured data with florid: a deductive object-oriented perspective. Inf Syst 23(8):589–613CrossRef
53.
Zurück zum Zitat Madhavan J, Bernstein PA, Doan A, Halevy AY (2005) Corpus-based schema matching. In: ICDE, pp 57–68 Madhavan J, Bernstein PA, Doan A, Halevy AY (2005) Corpus-based schema matching. In: ICDE, pp 57–68
54.
Zurück zum Zitat Madhavan J, Cohen S, Dong XL, Halevy AY, Jeffery SR, Ko D, Yu C (2007) Web-scale data integration: you can afford to pay as you go. In: CIDR, pp 342–350. www.crdrdb.org Madhavan J, Cohen S, Dong XL, Halevy AY, Jeffery SR, Ko D, Yu C (2007) Web-scale data integration: you can afford to pay as you go. In: CIDR, pp 342–350. www.​crdrdb.​org
55.
Zurück zum Zitat Mecca G, Papotti P, Raunich S (2009) Core schema mappings. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 2009), Providence, RI, 29 June–2 July 2009. ACM, New York, pp 655–668 Mecca G, Papotti P, Raunich S (2009) Core schema mappings. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 2009), Providence, RI, 29 June–2 July 2009. ACM, New York, pp 655–668
56.
Zurück zum Zitat Melnik S, Garcia-Molina H, Rahm E (2002) Similarity flooding: A versatile graph matching algorithm and its application to schema matching. In: ICDE, pp 117–128. IEEE Computer Society, Washington, DC Melnik S, Garcia-Molina H, Rahm E (2002) Similarity flooding: A versatile graph matching algorithm and its application to schema matching. In: ICDE, pp 117–128. IEEE Computer Society, Washington, DC
57.
Zurück zum Zitat Mena E, Illarramendi A, Kashyap V, Sheth AP (2000) Observer: an approach for query processing in global information systems based on interoperation across pre-existing ontologies. Distrib Parallel Databases 8(2):223–271CrossRef Mena E, Illarramendi A, Kashyap V, Sheth AP (2000) Observer: an approach for query processing in global information systems based on interoperation across pre-existing ontologies. Distrib Parallel Databases 8(2):223–271CrossRef
59.
Zurück zum Zitat Miller RJ (1998) Using schematically heterogeneous structures. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 1998), 2–4 June 1998, Seattle. ACM Press, New York, pp 189–200CrossRef Miller RJ (1998) Using schematically heterogeneous structures. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 1998), 2–4 June 1998, Seattle. ACM Press, New York, pp 189–200CrossRef
60.
Zurück zum Zitat Myllymaki J (2002) Effective web data extraction with standard xml technologies. Comput Netw 39(5):635–644CrossRef Myllymaki J (2002) Effective web data extraction with standard xml technologies. Comput Netw 39(5):635–644CrossRef
61.
Zurück zum Zitat Naumann F, Bilke A, Bleiholder J, Weis M (2006) Data fusion in three steps: resolving schema, tuple, and value inconsistencies. IEEE Data Eng Bull 29(2):21–31 Naumann F, Bilke A, Bleiholder J, Weis M (2006) Data fusion in three steps: resolving schema, tuple, and value inconsistencies. IEEE Data Eng Bull 29(2):21–31
62.
Zurück zum Zitat Naumann F, Freytag JC, Leser U (2004) Completeness of integrated information sources. Inf Syst 29(7):583–615CrossRef Naumann F, Freytag JC, Leser U (2004) Completeness of integrated information sources. Inf Syst 29(7):583–615CrossRef
63.
Zurück zum Zitat Naumann F, Häussler M (2002) Declarative data merging with conflict resolution. In: Fisher C, Davidson BN (eds) IQ, pp 212–224. MIT, Cambridge Naumann F, Häussler M (2002) Declarative data merging with conflict resolution. In: Fisher C, Davidson BN (eds) IQ, pp 212–224. MIT, Cambridge
64.
Zurück zum Zitat Noy NF (2004) Semantic integration: a survey of ontology-based approaches. SIGMOD Rec 33(4):65–70CrossRef Noy NF (2004) Semantic integration: a survey of ontology-based approaches. SIGMOD Rec 33(4):65–70CrossRef
65.
Zurück zum Zitat Noy NF, Doan A, Halevy AY (2005) Semantic integration. AI Mag 26(1):7–10 Noy NF, Doan A, Halevy AY (2005) Semantic integration. AI Mag 26(1):7–10
66.
Zurück zum Zitat Po L, Sorrentino S, Bergamaschi S, Beneventano D (2009) Lexical knowledge extraction: an effective approach to schema and ontology matching. In: European conference on knowledge management (ECKM 2009), 3–4 September 2009, Vicenza, Italy Po L, Sorrentino S, Bergamaschi S, Beneventano D (2009) Lexical knowledge extraction: an effective approach to schema and ontology matching. In: European conference on knowledge management (ECKM 2009), 3–4 September 2009, Vicenza, Italy
67.
Zurück zum Zitat Popa L, Velegrakis Y, Miller RJ, Hernández MA, Fagin R (2002) Translating web data. In: VLDB. Morgan Kaufmann, San Francisco, pp 598–609 Popa L, Velegrakis Y, Miller RJ, Hernández MA, Fagin R (2002) Translating web data. In: VLDB. Morgan Kaufmann, San Francisco, pp 598–609
68.
Zurück zum Zitat Pottinger R, Bernstein PA (2002) Creating a mediated schema based on initial correspondences. IEEE Data Eng Bull 25(3):26–31 Pottinger R, Bernstein PA (2002) Creating a mediated schema based on initial correspondences. IEEE Data Eng Bull 25(3):26–31
69.
Zurück zum Zitat Pottinger R, Bernstein PA (2008) Schema merging and mapping creation for relational sources. In: Kemper A, Valduriez P, Mouaddib N, Teubner J, Bouzeghoub M, Markl V, Amsaleg L, Manolescu I (eds) EDBT. ACM international conference proceeding series, vol 261. ACM, New York, pp 73–84CrossRef Pottinger R, Bernstein PA (2008) Schema merging and mapping creation for relational sources. In: Kemper A, Valduriez P, Mouaddib N, Teubner J, Bouzeghoub M, Markl V, Amsaleg L, Manolescu I (eds) EDBT. ACM international conference proceeding series, vol 261. ACM, New York, pp 73–84CrossRef
70.
Zurück zum Zitat Rahm E, Bernstein PA (2001) A survey of approaches to automatic schema matching. VLDB J 10(4):334–350MATHCrossRef Rahm E, Bernstein PA (2001) A survey of approaches to automatic schema matching. VLDB J 10(4):334–350MATHCrossRef
71.
Zurück zum Zitat Roth MT, Arya M, Haas LM, Carey MJ, Cody WF, Fagin R, Schwarz PM, II JT, Wimmers EL (eds) The garlic project. In: Jagadish HV, Mumick IS (eds) SIGMOD conference. ACM Press, New York, p 557 Roth MT, Arya M, Haas LM, Carey MJ, Cody WF, Fagin R, Schwarz PM, II JT, Wimmers EL (eds) The garlic project. In: Jagadish HV, Mumick IS (eds) SIGMOD conference. ACM Press, New York, p 557
72.
Zurück zum Zitat Roth MT, Schwarz PM (1997) Don’t scrap it, wrap it! a wrapper architecture for legacy data sources. In: Jarke M, Carey MJ, Dittrich KR, Lochovsky FH, Loucopoulos P, Jeusfeld MA (eds) VLDB. Morgan Kaufmann, San Francisco, pp 266–275 Roth MT, Schwarz PM (1997) Don’t scrap it, wrap it! a wrapper architecture for legacy data sources. In: Jarke M, Carey MJ, Dittrich KR, Lochovsky FH, Loucopoulos P, Jeusfeld MA (eds) VLDB. Morgan Kaufmann, San Francisco, pp 266–275
73.
Zurück zum Zitat Sahuguet A, Azavant F (2001) Building intelligent web applications using lightweight wrappers. Data Knowl Eng 36(3):283–316MATHCrossRef Sahuguet A, Azavant F (2001) Building intelligent web applications using lightweight wrappers. Data Knowl Eng 36(3):283–316MATHCrossRef
74.
Zurück zum Zitat Sarawagi S (2008) Information extraction. Found Trends Databases 1(3):261–377CrossRef Sarawagi S (2008) Information extraction. Found Trends Databases 1(3):261–377CrossRef
75.
Zurück zum Zitat Sattler KU, Geist I, Schallehn E (2005) Concept-based querying in mediator systems. VLDB J 14(1):97–111CrossRef Sattler KU, Geist I, Schallehn E (2005) Concept-based querying in mediator systems. VLDB J 14(1):97–111CrossRef
76.
Zurück zum Zitat Shafer G (1976) A mathematical theory of evidence. Princeton University Press, PrincetonMATH Shafer G (1976) A mathematical theory of evidence. Princeton University Press, PrincetonMATH
77.
Zurück zum Zitat Shvaiko P, Euzenat J (2008) Ten challenges for ontology matching. In: Meersman R, Tari Z (eds) OTM conferences (2). Lecture notes in computer science, vol 5332. Springer, Berlin, pp 1164–1182 Shvaiko P, Euzenat J (2008) Ten challenges for ontology matching. In: Meersman R, Tari Z (eds) OTM conferences (2). Lecture notes in computer science, vol 5332. Springer, Berlin, pp 1164–1182
78.
Zurück zum Zitat Sorrentino S, Bergamaschi S, Alberto C (2009) Dealing with uncertainty in lexical annotation. In: Poster ER, Demo session 2009, in Special issue of Journal of Theoretical and Applied Informatics (Revista de Informatica Terica e Aplicada RITA) 2009. (An extended version of this paper has been submitted to the “Semantic Integration of Data, Multimedia, and Services” special issue of Information Systems Journal) Sorrentino S, Bergamaschi S, Alberto C (2009) Dealing with uncertainty in lexical annotation. In: Poster ER, Demo session 2009, in Special issue of Journal of Theoretical and Applied Informatics (Revista de Informatica Terica e Aplicada RITA) 2009. (An extended version of this paper has been submitted to the “Semantic Integration of Data, Multimedia, and Services” special issue of Information Systems Journal)
79.
Zurück zum Zitat Sorrentino S, Bergamaschi S, Gawinecki M, Po L (2009) Schema normalization for improving schema matching. In: ER ’09: Proceedings of the 28th international conference on conceptual modeling. Springer, Berlin, pp 280–293. [An extended version of this paper has been submitted to the ER special issue of Data and Knowledge Engineering (DKE) Journal] Sorrentino S, Bergamaschi S, Gawinecki M, Po L (2009) Schema normalization for improving schema matching. In: ER ’09: Proceedings of the 28th international conference on conceptual modeling. Springer, Berlin, pp 280–293. [An extended version of this paper has been submitted to the ER special issue of Data and Knowledge Engineering (DKE) Journal]
80.
Zurück zum Zitat Tejada S, Knoblock CA, Minton S (2001) Learning object identification rules for information integration. Inf Syst 26(8):607–633MATHCrossRef Tejada S, Knoblock CA, Minton S (2001) Learning object identification rules for information integration. Inf Syst 26(8):607–633MATHCrossRef
81.
Zurück zum Zitat Ullman JD (1997) Information integration using logical views. In: Afrati FN, Kolaitis PG (eds) ICDT. Lecture notes in computer science, vol 1186. Springer, Berlin, pp 19–40 Ullman JD (1997) Information integration using logical views. In: Afrati FN, Kolaitis PG (eds) ICDT. Lecture notes in computer science, vol 1186. Springer, Berlin, pp 19–40
82.
Zurück zum Zitat Ullman JD, Garcia-Molina H, Widom J (2001) Database systems: the complete book. Prentice-Hall, Upper Saddle River Ullman JD, Garcia-Molina H, Widom J (2001) Database systems: the complete book. Prentice-Hall, Upper Saddle River
83.
Zurück zum Zitat Vossen P (ed) (1998) EuroWordNet: a multilingual database with lexical semantic networks. Kluwer, NorwellMATH Vossen P (ed) (1998) EuroWordNet: a multilingual database with lexical semantic networks. Kluwer, NorwellMATH
84.
Zurück zum Zitat Wiederhold G (1992) Mediators in the architecture of future information systems. IEEE Comput 25(3):38–49CrossRef Wiederhold G (1992) Mediators in the architecture of future information systems. IEEE Comput 25(3):38–49CrossRef
85.
Zurück zum Zitat Wiederhold G (1993) Intelligent integration of information. In: Proceedings of the 1993 ACM SIGMOD international conference on management of data, Washington, DC, 26–28 May 1993. ACM Press, New York, pp 434–437CrossRef Wiederhold G (1993) Intelligent integration of information. In: Proceedings of the 1993 ACM SIGMOD international conference on management of data, Washington, DC, 26–28 May 1993. ACM Press, New York, pp 434–437CrossRef
Metadaten
Titel
Data Integration
verfasst von
Sonia Bergamaschi
Domenico Beneventano
Francesco Guerra
Mirko Orsini
Copyright-Jahr
2011
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-642-15865-0_14

Premium Partner