Skip to main content
Top

2011 | OriginalPaper | Chapter

14. Data Integration

Authors : Sonia Bergamaschi, Domenico Beneventano, Francesco Guerra, Mirko Orsini

Published in: Handbook of Conceptual Modeling

Publisher: Springer Berlin Heidelberg

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Given the many data integration approaches, a complete and exhaustive comparison of all the research activities is not possible. In this chapter we will present an overview of the most relevant research activities and ideas in the field investigated in the last 20 years. We will also introduce the MOMIS system, a framework to perform information extraction and integration from both structured and semistructured data sources, that is one of the most interesting results of our research activity. An open source version of the MOMIS system was delivered by the academic startup DataRiver (www.datariver.it).

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
See for example Talend, http://​www.​talend.​com, an open source ETL and data integration system.
 
2
We use classes for including both the object-oriented and relational models.
 
4
www.service-architecture.com/database/articles/odmg_3_0.html.
 
5
A global attribute mapped onto only one source is a particular case of a homogeneous attribute.
 
Literature
1.
go back to reference Abiteboul S, Buneman P, Suciu D (1999) Data on the Web: from relations to semistructured data and XML. Morgan Kaufmann, San Francisco Abiteboul S, Buneman P, Suciu D (1999) Data on the Web: from relations to semistructured data and XML. Morgan Kaufmann, San Francisco
2.
go back to reference Ananthakrishna R, Chaudhuri S, Ganti V (2002) Eliminating fuzzy duplicates in data warehouses. In Proceedings of the 28th international conference on Very Large Bases, Hong Kong, China, VLDB Endowment, p 586–597 Ananthakrishna R, Chaudhuri S, Ganti V (2002) Eliminating fuzzy duplicates in data warehouses. In Proceedings of the 28th international conference on Very Large Bases, Hong Kong, China, VLDB Endowment, p 586–597
3.
go back to reference Arens Y, Knoblock CA (1993) Sims: retrieving and integrating information from multiple sources. In: Buneman P, Jajodia S (eds) Proceedings of the 1993 ACM SIGMOD international conference on management of data, Washington, DC, 26–28 May 1993. ACM, New York, pp 562–563CrossRef Arens Y, Knoblock CA (1993) Sims: retrieving and integrating information from multiple sources. In: Buneman P, Jajodia S (eds) Proceedings of the 1993 ACM SIGMOD international conference on management of data, Washington, DC, 26–28 May 1993. ACM, New York, pp 562–563CrossRef
4.
go back to reference Aumueller D, Do HH, Massmann S, Rahm E (2005) Schema and ontology matching with coma++. In: Özcan F (ed) SIGMOD conference. ACM, New York, pp 906–908 Aumueller D, Do HH, Massmann S, Rahm E (2005) Schema and ontology matching with coma++. In: Özcan F (ed) SIGMOD conference. ACM, New York, pp 906–908
5.
go back to reference Batini C, Lenzerini M, Navathe SB (1986) A comparative analysis of methodologies for database schema integration. ACM Comput Surv 18(4):323–364CrossRef Batini C, Lenzerini M, Navathe SB (1986) A comparative analysis of methodologies for database schema integration. ACM Comput Surv 18(4):323–364CrossRef
6.
go back to reference Baumgartner R, Flesca S, Gottlob G (2001) Declarative information extraction, web crawling, and recursive wrapping with lixto. In: Eiter T, Faber W, Truszczynski M (eds) LPNMR. Lecture notes in computer science, vol 2173. Springer, Berlin, pp 21–41 Baumgartner R, Flesca S, Gottlob G (2001) Declarative information extraction, web crawling, and recursive wrapping with lixto. In: Eiter T, Faber W, Truszczynski M (eds) LPNMR. Lecture notes in computer science, vol 2173. Springer, Berlin, pp 21–41
7.
go back to reference Benassi R, Bergamaschi S, Fergnani A, Miselli D (2004) Extending a lexicon ontology for intelligent information integration. In: Proceedings of the 16th Eureopean conference on artificial intelligence (ECAI’2004), pp 278–282 Benassi R, Bergamaschi S, Fergnani A, Miselli D (2004) Extending a lexicon ontology for intelligent information integration. In: Proceedings of the 16th Eureopean conference on artificial intelligence (ECAI’2004), pp 278–282
8.
go back to reference Beneventano D, Bergamaschi S (2007) Semantic search engines based on data integration systems. In: Cardoso J (ed) Semantic Web services: theory, tools and applications. IGI Global, Hershey, pp 317–341CrossRef Beneventano D, Bergamaschi S (2007) Semantic search engines based on data integration systems. In: Cardoso J (ed) Semantic Web services: theory, tools and applications. IGI Global, Hershey, pp 317–341CrossRef
9.
go back to reference Beneventano D, Bergamaschi S, Guerra F, Vincini M (2003) Synthesizing an integrated ontology. IEEE Internet Comput 7(5):42–51CrossRef Beneventano D, Bergamaschi S, Guerra F, Vincini M (2003) Synthesizing an integrated ontology. IEEE Internet Comput 7(5):42–51CrossRef
11.
go back to reference Beneventano D, Bergamaschi S, Sorrentino S (2009) Extending wordnet with compound nouns for semi-automatic annotation in data integration systems. In: Proceedings of the international conference on natural language processing and knowledge engineering (NLP–KE), 24–27 September 2009, Dalian, China, pp 1–8 Beneventano D, Bergamaschi S, Sorrentino S (2009) Extending wordnet with compound nouns for semi-automatic annotation in data integration systems. In: Proceedings of the international conference on natural language processing and knowledge engineering (NLP–KE), 24–27 September 2009, Dalian, China, pp 1–8
12.
go back to reference Beneventano D, Bergamaschi S, Vincini M, Orsini M, Nana RC (2007) Query translation on heterogeneous sources in momis data transformation systems. In: VLDB 3rd international workshop on database interoperability (InterDB 2007) Beneventano D, Bergamaschi S, Vincini M, Orsini M, Nana RC (2007) Query translation on heterogeneous sources in momis data transformation systems. In: VLDB 3rd international workshop on database interoperability (InterDB 2007)
13.
go back to reference Beneventano D, Gennaro C, Guerra F (2008) A methodology for building and querying an ontology representing data and multimedia sources. In: ODBIS, pp 37–40 Beneventano D, Gennaro C, Guerra F (2008) A methodology for building and querying an ontology representing data and multimedia sources. In: ODBIS, pp 37–40
14.
go back to reference Beneventano D, Guerra F, Maurino A, Palmonari M, Pasi G, Sala A (2009) Unified semantic search of data and services. In: Proceedings of the 3rd international conference on metadata and semantic research (MTSR 2009), Milan, Italy, 1–2 October 2009. Communications in computer and information science, vol 46. Springer, Berlin, pp 95–107 Beneventano D, Guerra F, Maurino A, Palmonari M, Pasi G, Sala A (2009) Unified semantic search of data and services. In: Proceedings of the 3rd international conference on metadata and semantic research (MTSR 2009), Milan, Italy, 1–2 October 2009. Communications in computer and information science, vol 46. Springer, Berlin, pp 95–107
15.
go back to reference Beneventano D, Lenzerini M (2005) Final release of the system prototype for query management. Sewasie, deliverable D3.5, Dipartimento di Ingegneria dell’Informazione. http://dbgroup.unimo.it/TechnicalReport/D3.5Final.pdf Beneventano D, Lenzerini M (2005) Final release of the system prototype for query management. Sewasie, deliverable D3.5, Dipartimento di Ingegneria dell’Informazione. http://​dbgroup.​unimo.​it/​TechnicalReport/​D3.​5Final.​pdf
16.
go back to reference den Bercken JV, Blohsfeld B, Dittrich JP, Krämer J, Schäfer T, Schneider M, Seeger B (2001) Xxl – a library approach to supporting efficient implementations of advanced database queries. In: Apers PMG, Atzeni P, Ceri S, Paraboschi S, Ramamohanarao K, Snodgrass RT (eds) VLDB, pp 39–48. Morgan Kaufmann, San Francisco den Bercken JV, Blohsfeld B, Dittrich JP, Krämer J, Schäfer T, Schneider M, Seeger B (2001) Xxl – a library approach to supporting efficient implementations of advanced database queries. In: Apers PMG, Atzeni P, Ceri S, Paraboschi S, Ramamohanarao K, Snodgrass RT (eds) VLDB, pp 39–48. Morgan Kaufmann, San Francisco
17.
go back to reference Bergamaschi S, Castano S, Vincini M, Beneventano D (2001) Semantic integration of heterogeneous information sources. Data Knowl Eng 36(3):215–249MATHCrossRef Bergamaschi S, Castano S, Vincini M, Beneventano D (2001) Semantic integration of heterogeneous information sources. Data Knowl Eng 36(3):215–249MATHCrossRef
18.
go back to reference Bergamaschi S, Maurino A (2009) Toward a unified view of data and services. In: Vossen G, Long DDE, Yu JX (eds) Proceedings of the 10th international conference on Web information systems engineering (WISE 2009), Poznan, Poland, 5–7 October 2009. Lecture notes in computer science, vol 5802. Springer, Berlin, pp 11–12 Bergamaschi S, Maurino A (2009) Toward a unified view of data and services. In: Vossen G, Long DDE, Yu JX (eds) Proceedings of the 10th international conference on Web information systems engineering (WISE 2009), Poznan, Poland, 5–7 October 2009. Lecture notes in computer science, vol 5802. Springer, Berlin, pp 11–12
19.
go back to reference Bernstein PA, Melnik S, Petropoulos M, Quix C (2004) Industrial-strength schema matching. SIGMOD Rec 33(4):38–43CrossRef Bernstein PA, Melnik S, Petropoulos M, Quix C (2004) Industrial-strength schema matching. SIGMOD Rec 33(4):38–43CrossRef
20.
go back to reference Bertossi LE, Chomicki J (2003) Query answering in inconsistent databases. In: Chomicki J, van der Meyden R, Saake G (eds) Logics for emerging applications of databases. Springer, Berlin, pp 43–83 Bertossi LE, Chomicki J (2003) Query answering in inconsistent databases. In: Chomicki J, van der Meyden R, Saake G (eds) Logics for emerging applications of databases. Springer, Berlin, pp 43–83
21.
go back to reference Bleiholder J, Draba K, Naumann F (2007) Fusem – exploring different semantics of data fusion. In: Koch C, Gehrke J, Garofalakis MN, Srivastava D, Aberer K, Deshpande A, Florescu D, Chan CY, Ganti V, Kanne CC, Klas W, Neuhold EJ (eds) VLDB. ACM, New York, pp 1350–1353 Bleiholder J, Draba K, Naumann F (2007) Fusem – exploring different semantics of data fusion. In: Koch C, Gehrke J, Garofalakis MN, Srivastava D, Aberer K, Deshpande A, Florescu D, Chan CY, Ganti V, Kanne CC, Klas W, Neuhold EJ (eds) VLDB. ACM, New York, pp 1350–1353
22.
23.
go back to reference Bressan S, Goh CH, Levina N, Madnick SE, Shah A, Siegel M (2000) Context knowledge representation and reasoning in the context interchange system. Appl Intell 13(2):165–180CrossRef Bressan S, Goh CH, Levina N, Madnick SE, Shah A, Siegel M (2000) Context knowledge representation and reasoning in the context interchange system. Appl Intell 13(2):165–180CrossRef
24.
go back to reference CalÌ A, Calvanese D, Giacomo GD, Lenzerini M (2002) Data integration under integrity constraints. In: Proceedings of the 14th international conference on advanced information systems engineering (CAiSE ’02). Springer, London, pp 262–279 CalÌ A, Calvanese D, Giacomo GD, Lenzerini M (2002) Data integration under integrity constraints. In: Proceedings of the 14th international conference on advanced information systems engineering (CAiSE ’02). Springer, London, pp 262–279
25.
go back to reference CalÌ A, Lembo D, Rosati R (2003) Query rewriting and answering under constraints in data integration systems. In: Gottlob G, Walsh T (eds) Proceedings of the international joint conference on artificial intelligence. Morgan Kaufmann, pp 16–21 CalÌ A, Lembo D, Rosati R (2003) Query rewriting and answering under constraints in data integration systems. In: Gottlob G, Walsh T (eds) Proceedings of the international joint conference on artificial intelligence. Morgan Kaufmann, pp 16–21
26.
go back to reference Calvanese D, Giacomo GD, Lembo D, Lenzerini M, Rosati R (2004) What to ask to a peer: ontology-based query reformulation. In: Dubois D, Welty CA, Williams MA (eds) Principles of Knowledge Representation and Reasoning. Proceedings of the Nineth International Conference (KR2004), Whistler, Canada, June 2–4 2004, AAAi Press, Menlo Park, pp 469–478 Calvanese D, Giacomo GD, Lembo D, Lenzerini M, Rosati R (2004) What to ask to a peer: ontology-based query reformulation. In: Dubois D, Welty CA, Williams MA (eds) Principles of Knowledge Representation and Reasoning. Proceedings of the Nineth International Conference (KR2004), Whistler, Canada, June 2–4 2004, AAAi Press, Menlo Park, pp 469–478
27.
go back to reference Castano S, Ferrara A, Lorusso D, Montanelli S (2008) On the ontology instance matching problem. In: DEXA workshops. IEEE Computer Society, Washington, DC, pp 180–184 Castano S, Ferrara A, Lorusso D, Montanelli S (2008) On the ontology instance matching problem. In: DEXA workshops. IEEE Computer Society, Washington, DC, pp 180–184
28.
go back to reference Chaudhuri S, Ganjam K, Ganti V, Motwani R (2003) Robust and efficient fuzzy match for online data cleaning. In: SIGMOD conference, pp 313–324 Chaudhuri S, Ganjam K, Ganti V, Motwani R (2003) Robust and efficient fuzzy match for online data cleaning. In: SIGMOD conference, pp 313–324
29.
go back to reference Chen K, Madhavan J, Halevy AY (2009) Exploring schema repositories with schemr. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 2009), Providence, RI, 29 June–2 July 2009. ACM, New York, pp 1095–1098 Chen K, Madhavan J, Halevy AY (2009) Exploring schema repositories with schemr. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 2009), Providence, RI, 29 June–2 July 2009. ACM, New York, pp 1095–1098
30.
go back to reference Crescenzi V, Mecca G, Merialdo P (2001) Automatic web information extraction in the roadrunner system. In: Arisawa H, Kambayashi Y, Kumar V, Mayr HC, Hunt I (eds) ER (workshops). Lecture notes in computer science, vol 2465. Springer, Berlin, pp 264–277 Crescenzi V, Mecca G, Merialdo P (2001) Automatic web information extraction in the roadrunner system. In: Arisawa H, Kambayashi Y, Kumar V, Mayr HC, Hunt I (eds) ER (workshops). Lecture notes in computer science, vol 2465. Springer, Berlin, pp 264–277
31.
go back to reference Euzenat J, Shvaiko P (2007) Ontology matching. Springer, HeidelbergMATH Euzenat J, Shvaiko P (2007) Ontology matching. Springer, HeidelbergMATH
32.
go back to reference Fagin R, Haas LM, Hernández MA, Miller RJ, Popa L, Velegrakis Y (2009) Clio: schema mapping creation and data exchange. In: Borgida A, Chaudhri VK, Giorgini P, Yu ESK (eds) Conceptual modeling: foundations and applications. Lecture notes in computer science, vol 5600. Springer, Berlin, pp 198–236CrossRef Fagin R, Haas LM, Hernández MA, Miller RJ, Popa L, Velegrakis Y (2009) Clio: schema mapping creation and data exchange. In: Borgida A, Chaudhri VK, Giorgini P, Yu ESK (eds) Conceptual modeling: foundations and applications. Lecture notes in computer science, vol 5600. Springer, Berlin, pp 198–236CrossRef
33.
34.
go back to reference Geist I (2004) Index-based keyword search in mediator systems. In: Lindner W, Mesiti M, Türker C, Tzitzikas Y, Vakali A (eds) EDBT workshops. Lecture notes in computer science, vol 3268. Springer, Berlin, pp 24–33 Geist I (2004) Index-based keyword search in mediator systems. In: Lindner W, Mesiti M, Türker C, Tzitzikas Y, Vakali A (eds) EDBT workshops. Lecture notes in computer science, vol 3268. Springer, Berlin, pp 24–33
35.
go back to reference Giunchiglia F, Yatskevich M, Shvaiko P (2007) Semantic matching: algorithms and implementation. J Data Semant 9:1–38 Giunchiglia F, Yatskevich M, Shvaiko P (2007) Semantic matching: algorithms and implementation. J Data Semant 9:1–38
36.
go back to reference Gottlob G, Koch C, Baumgartner R, Herzog M, Flesca S (2004) The lixto data extraction project – back and forth between theory and practice. In: Deutsch A (ed) PODS. ACM, New York, pp 1–12CrossRef Gottlob G, Koch C, Baumgartner R, Herzog M, Flesca S (2004) The lixto data extraction project – back and forth between theory and practice. In: Deutsch A (ed) PODS. ACM, New York, pp 1–12CrossRef
37.
go back to reference Greco G, Greco S, Zumpano E (2003) A logical framework for querying and repairing inconsistent databases. IEEE Trans Knowl Data Eng 15(6):1389–1408CrossRef Greco G, Greco S, Zumpano E (2003) A logical framework for querying and repairing inconsistent databases. IEEE Trans Knowl Data Eng 15(6):1389–1408CrossRef
38.
go back to reference Guerra F, Bergamaschi S, Orsini M, Sala A, Sartori C (2009) Keymantic: a keyword-based search engine using structural knowledge. In: Cordeiro J, Filipe J (eds) ICEIS, vol 1, pp 241–246 Guerra F, Bergamaschi S, Orsini M, Sala A, Sartori C (2009) Keymantic: a keyword-based search engine using structural knowledge. In: Cordeiro J, Filipe J (eds) ICEIS, vol 1, pp 241–246
39.
40.
go back to reference Halevy AY, Ives ZG, Madhavan J, Mork P, Suciu D, Tatarinov I (2004) The piazza peer data management system. IEEE Trans Knowl Data Eng 16(7):787–798CrossRef Halevy AY, Ives ZG, Madhavan J, Mork P, Suciu D, Tatarinov I (2004) The piazza peer data management system. IEEE Trans Knowl Data Eng 16(7):787–798CrossRef
41.
go back to reference Hammer J, Stonebraker M, Topsakal O (2005) Thalia: test harness for the assessment of legacy information integration approaches. In: ICDE, pp 485–486 Hammer J, Stonebraker M, Topsakal O (2005) Thalia: test harness for the assessment of legacy information integration approaches. In: ICDE, pp 485–486
42.
go back to reference Heimbigner D, McLeod D (1985) A federated architecture for information management. ACM Trans Inf Syst 3(3):253–278CrossRef Heimbigner D, McLeod D (1985) A federated architecture for information management. ACM Trans Inf Syst 3(3):253–278CrossRef
43.
go back to reference Hull R (1997) Managing semantic heterogeneity in databases: a theoretical perspective. In: PODS, pp 51–61 Hull R (1997) Managing semantic heterogeneity in databases: a theoretical perspective. In: PODS, pp 51–61
44.
go back to reference Inmon WH (1992) Building the data warehouse. QED Information Sciences, Wellesley Inmon WH (1992) Building the data warehouse. QED Information Sciences, Wellesley
45.
go back to reference Klein MCA, Fensel D, Kiryakov A, Ognyanov D (2002) Ontology versioning and change detection on the web. In: Gómez-Pérez A, Benjamins VR (eds) EKAW. Lecture notes in computer science, vol 2473. Springer, Berlin, pp 197–212 Klein MCA, Fensel D, Kiryakov A, Ognyanov D (2002) Ontology versioning and change detection on the web. In: Gómez-Pérez A, Benjamins VR (eds) EKAW. Lecture notes in computer science, vol 2473. Springer, Berlin, pp 197–212
46.
go back to reference Köpcke H, Rahm E (2010) Frameworks for entity matching: a comparison. Data Knowl Eng 69(2):197–210CrossRef Köpcke H, Rahm E (2010) Frameworks for entity matching: a comparison. Data Knowl Eng 69(2):197–210CrossRef
47.
go back to reference Laender AHF, Ribeiro-Neto BA, da Silva AS (2002) Debye – data extraction by example. Data Knowl Eng 40(2):121–154MATHCrossRef Laender AHF, Ribeiro-Neto BA, da Silva AS (2002) Debye – data extraction by example. Data Knowl Eng 40(2):121–154MATHCrossRef
48.
go back to reference Lenzerini M (2002) Data integration: a theoretical perspective. In: Popa L (ed) PODS. ACM, New York, pp 233–246 Lenzerini M (2002) Data integration: a theoretical perspective. In: Popa L (ed) PODS. ACM, New York, pp 233–246
49.
go back to reference Levy AY, Rajaraman A, Ordille JJ (1996) Querying heterogeneous information sources using source descriptions. In: Vijayaraman Tm, Buchmann AP, Mohan C, Sarda NL (eds) VLDB. Morgan Kaufmann, San Francisco, pp 251–262 Levy AY, Rajaraman A, Ordille JJ (1996) Querying heterogeneous information sources using source descriptions. In: Vijayaraman Tm, Buchmann AP, Mohan C, Sarda NL (eds) VLDB. Morgan Kaufmann, San Francisco, pp 251–262
50.
go back to reference Li C, Yerneni R, Vassalos V, Garcia-Molina H, Papakonstantinou Y, Ullman JD, Valiveti M (1998) Capability based mediation in tsimmis. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 1998), 2–4 June 1998, Seattle. ACM Press, New York, pp 564–566CrossRef Li C, Yerneni R, Vassalos V, Garcia-Molina H, Papakonstantinou Y, Ullman JD, Valiveti M (1998) Capability based mediation in tsimmis. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 1998), 2–4 June 1998, Seattle. ACM Press, New York, pp 564–566CrossRef
51.
go back to reference Lin J, Mendelzon AO (1998) Merging databases under constraints. Int J Cooperative Inf Syst 7(1):55–76CrossRef Lin J, Mendelzon AO (1998) Merging databases under constraints. Int J Cooperative Inf Syst 7(1):55–76CrossRef
52.
go back to reference Ludäscher B, Himmeröder R, Lausen G, May W, Schlepphorst C (1998) Managing semistructured data with florid: a deductive object-oriented perspective. Inf Syst 23(8):589–613CrossRef Ludäscher B, Himmeröder R, Lausen G, May W, Schlepphorst C (1998) Managing semistructured data with florid: a deductive object-oriented perspective. Inf Syst 23(8):589–613CrossRef
53.
go back to reference Madhavan J, Bernstein PA, Doan A, Halevy AY (2005) Corpus-based schema matching. In: ICDE, pp 57–68 Madhavan J, Bernstein PA, Doan A, Halevy AY (2005) Corpus-based schema matching. In: ICDE, pp 57–68
54.
go back to reference Madhavan J, Cohen S, Dong XL, Halevy AY, Jeffery SR, Ko D, Yu C (2007) Web-scale data integration: you can afford to pay as you go. In: CIDR, pp 342–350. www.crdrdb.org Madhavan J, Cohen S, Dong XL, Halevy AY, Jeffery SR, Ko D, Yu C (2007) Web-scale data integration: you can afford to pay as you go. In: CIDR, pp 342–350. www.​crdrdb.​org
55.
go back to reference Mecca G, Papotti P, Raunich S (2009) Core schema mappings. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 2009), Providence, RI, 29 June–2 July 2009. ACM, New York, pp 655–668 Mecca G, Papotti P, Raunich S (2009) Core schema mappings. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 2009), Providence, RI, 29 June–2 July 2009. ACM, New York, pp 655–668
56.
go back to reference Melnik S, Garcia-Molina H, Rahm E (2002) Similarity flooding: A versatile graph matching algorithm and its application to schema matching. In: ICDE, pp 117–128. IEEE Computer Society, Washington, DC Melnik S, Garcia-Molina H, Rahm E (2002) Similarity flooding: A versatile graph matching algorithm and its application to schema matching. In: ICDE, pp 117–128. IEEE Computer Society, Washington, DC
57.
go back to reference Mena E, Illarramendi A, Kashyap V, Sheth AP (2000) Observer: an approach for query processing in global information systems based on interoperation across pre-existing ontologies. Distrib Parallel Databases 8(2):223–271CrossRef Mena E, Illarramendi A, Kashyap V, Sheth AP (2000) Observer: an approach for query processing in global information systems based on interoperation across pre-existing ontologies. Distrib Parallel Databases 8(2):223–271CrossRef
59.
go back to reference Miller RJ (1998) Using schematically heterogeneous structures. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 1998), 2–4 June 1998, Seattle. ACM Press, New York, pp 189–200CrossRef Miller RJ (1998) Using schematically heterogeneous structures. In: Proceedings of the ACM SIGMOD international conference on management of data (SIGMOD 1998), 2–4 June 1998, Seattle. ACM Press, New York, pp 189–200CrossRef
60.
go back to reference Myllymaki J (2002) Effective web data extraction with standard xml technologies. Comput Netw 39(5):635–644CrossRef Myllymaki J (2002) Effective web data extraction with standard xml technologies. Comput Netw 39(5):635–644CrossRef
61.
go back to reference Naumann F, Bilke A, Bleiholder J, Weis M (2006) Data fusion in three steps: resolving schema, tuple, and value inconsistencies. IEEE Data Eng Bull 29(2):21–31 Naumann F, Bilke A, Bleiholder J, Weis M (2006) Data fusion in three steps: resolving schema, tuple, and value inconsistencies. IEEE Data Eng Bull 29(2):21–31
62.
go back to reference Naumann F, Freytag JC, Leser U (2004) Completeness of integrated information sources. Inf Syst 29(7):583–615CrossRef Naumann F, Freytag JC, Leser U (2004) Completeness of integrated information sources. Inf Syst 29(7):583–615CrossRef
63.
go back to reference Naumann F, Häussler M (2002) Declarative data merging with conflict resolution. In: Fisher C, Davidson BN (eds) IQ, pp 212–224. MIT, Cambridge Naumann F, Häussler M (2002) Declarative data merging with conflict resolution. In: Fisher C, Davidson BN (eds) IQ, pp 212–224. MIT, Cambridge
64.
go back to reference Noy NF (2004) Semantic integration: a survey of ontology-based approaches. SIGMOD Rec 33(4):65–70CrossRef Noy NF (2004) Semantic integration: a survey of ontology-based approaches. SIGMOD Rec 33(4):65–70CrossRef
65.
go back to reference Noy NF, Doan A, Halevy AY (2005) Semantic integration. AI Mag 26(1):7–10 Noy NF, Doan A, Halevy AY (2005) Semantic integration. AI Mag 26(1):7–10
66.
go back to reference Po L, Sorrentino S, Bergamaschi S, Beneventano D (2009) Lexical knowledge extraction: an effective approach to schema and ontology matching. In: European conference on knowledge management (ECKM 2009), 3–4 September 2009, Vicenza, Italy Po L, Sorrentino S, Bergamaschi S, Beneventano D (2009) Lexical knowledge extraction: an effective approach to schema and ontology matching. In: European conference on knowledge management (ECKM 2009), 3–4 September 2009, Vicenza, Italy
67.
go back to reference Popa L, Velegrakis Y, Miller RJ, Hernández MA, Fagin R (2002) Translating web data. In: VLDB. Morgan Kaufmann, San Francisco, pp 598–609 Popa L, Velegrakis Y, Miller RJ, Hernández MA, Fagin R (2002) Translating web data. In: VLDB. Morgan Kaufmann, San Francisco, pp 598–609
68.
go back to reference Pottinger R, Bernstein PA (2002) Creating a mediated schema based on initial correspondences. IEEE Data Eng Bull 25(3):26–31 Pottinger R, Bernstein PA (2002) Creating a mediated schema based on initial correspondences. IEEE Data Eng Bull 25(3):26–31
69.
go back to reference Pottinger R, Bernstein PA (2008) Schema merging and mapping creation for relational sources. In: Kemper A, Valduriez P, Mouaddib N, Teubner J, Bouzeghoub M, Markl V, Amsaleg L, Manolescu I (eds) EDBT. ACM international conference proceeding series, vol 261. ACM, New York, pp 73–84CrossRef Pottinger R, Bernstein PA (2008) Schema merging and mapping creation for relational sources. In: Kemper A, Valduriez P, Mouaddib N, Teubner J, Bouzeghoub M, Markl V, Amsaleg L, Manolescu I (eds) EDBT. ACM international conference proceeding series, vol 261. ACM, New York, pp 73–84CrossRef
70.
go back to reference Rahm E, Bernstein PA (2001) A survey of approaches to automatic schema matching. VLDB J 10(4):334–350MATHCrossRef Rahm E, Bernstein PA (2001) A survey of approaches to automatic schema matching. VLDB J 10(4):334–350MATHCrossRef
71.
go back to reference Roth MT, Arya M, Haas LM, Carey MJ, Cody WF, Fagin R, Schwarz PM, II JT, Wimmers EL (eds) The garlic project. In: Jagadish HV, Mumick IS (eds) SIGMOD conference. ACM Press, New York, p 557 Roth MT, Arya M, Haas LM, Carey MJ, Cody WF, Fagin R, Schwarz PM, II JT, Wimmers EL (eds) The garlic project. In: Jagadish HV, Mumick IS (eds) SIGMOD conference. ACM Press, New York, p 557
72.
go back to reference Roth MT, Schwarz PM (1997) Don’t scrap it, wrap it! a wrapper architecture for legacy data sources. In: Jarke M, Carey MJ, Dittrich KR, Lochovsky FH, Loucopoulos P, Jeusfeld MA (eds) VLDB. Morgan Kaufmann, San Francisco, pp 266–275 Roth MT, Schwarz PM (1997) Don’t scrap it, wrap it! a wrapper architecture for legacy data sources. In: Jarke M, Carey MJ, Dittrich KR, Lochovsky FH, Loucopoulos P, Jeusfeld MA (eds) VLDB. Morgan Kaufmann, San Francisco, pp 266–275
73.
go back to reference Sahuguet A, Azavant F (2001) Building intelligent web applications using lightweight wrappers. Data Knowl Eng 36(3):283–316MATHCrossRef Sahuguet A, Azavant F (2001) Building intelligent web applications using lightweight wrappers. Data Knowl Eng 36(3):283–316MATHCrossRef
74.
go back to reference Sarawagi S (2008) Information extraction. Found Trends Databases 1(3):261–377CrossRef Sarawagi S (2008) Information extraction. Found Trends Databases 1(3):261–377CrossRef
75.
go back to reference Sattler KU, Geist I, Schallehn E (2005) Concept-based querying in mediator systems. VLDB J 14(1):97–111CrossRef Sattler KU, Geist I, Schallehn E (2005) Concept-based querying in mediator systems. VLDB J 14(1):97–111CrossRef
76.
go back to reference Shafer G (1976) A mathematical theory of evidence. Princeton University Press, PrincetonMATH Shafer G (1976) A mathematical theory of evidence. Princeton University Press, PrincetonMATH
77.
go back to reference Shvaiko P, Euzenat J (2008) Ten challenges for ontology matching. In: Meersman R, Tari Z (eds) OTM conferences (2). Lecture notes in computer science, vol 5332. Springer, Berlin, pp 1164–1182 Shvaiko P, Euzenat J (2008) Ten challenges for ontology matching. In: Meersman R, Tari Z (eds) OTM conferences (2). Lecture notes in computer science, vol 5332. Springer, Berlin, pp 1164–1182
78.
go back to reference Sorrentino S, Bergamaschi S, Alberto C (2009) Dealing with uncertainty in lexical annotation. In: Poster ER, Demo session 2009, in Special issue of Journal of Theoretical and Applied Informatics (Revista de Informatica Terica e Aplicada RITA) 2009. (An extended version of this paper has been submitted to the “Semantic Integration of Data, Multimedia, and Services” special issue of Information Systems Journal) Sorrentino S, Bergamaschi S, Alberto C (2009) Dealing with uncertainty in lexical annotation. In: Poster ER, Demo session 2009, in Special issue of Journal of Theoretical and Applied Informatics (Revista de Informatica Terica e Aplicada RITA) 2009. (An extended version of this paper has been submitted to the “Semantic Integration of Data, Multimedia, and Services” special issue of Information Systems Journal)
79.
go back to reference Sorrentino S, Bergamaschi S, Gawinecki M, Po L (2009) Schema normalization for improving schema matching. In: ER ’09: Proceedings of the 28th international conference on conceptual modeling. Springer, Berlin, pp 280–293. [An extended version of this paper has been submitted to the ER special issue of Data and Knowledge Engineering (DKE) Journal] Sorrentino S, Bergamaschi S, Gawinecki M, Po L (2009) Schema normalization for improving schema matching. In: ER ’09: Proceedings of the 28th international conference on conceptual modeling. Springer, Berlin, pp 280–293. [An extended version of this paper has been submitted to the ER special issue of Data and Knowledge Engineering (DKE) Journal]
80.
go back to reference Tejada S, Knoblock CA, Minton S (2001) Learning object identification rules for information integration. Inf Syst 26(8):607–633MATHCrossRef Tejada S, Knoblock CA, Minton S (2001) Learning object identification rules for information integration. Inf Syst 26(8):607–633MATHCrossRef
81.
go back to reference Ullman JD (1997) Information integration using logical views. In: Afrati FN, Kolaitis PG (eds) ICDT. Lecture notes in computer science, vol 1186. Springer, Berlin, pp 19–40 Ullman JD (1997) Information integration using logical views. In: Afrati FN, Kolaitis PG (eds) ICDT. Lecture notes in computer science, vol 1186. Springer, Berlin, pp 19–40
82.
go back to reference Ullman JD, Garcia-Molina H, Widom J (2001) Database systems: the complete book. Prentice-Hall, Upper Saddle River Ullman JD, Garcia-Molina H, Widom J (2001) Database systems: the complete book. Prentice-Hall, Upper Saddle River
83.
go back to reference Vossen P (ed) (1998) EuroWordNet: a multilingual database with lexical semantic networks. Kluwer, NorwellMATH Vossen P (ed) (1998) EuroWordNet: a multilingual database with lexical semantic networks. Kluwer, NorwellMATH
84.
go back to reference Wiederhold G (1992) Mediators in the architecture of future information systems. IEEE Comput 25(3):38–49CrossRef Wiederhold G (1992) Mediators in the architecture of future information systems. IEEE Comput 25(3):38–49CrossRef
85.
go back to reference Wiederhold G (1993) Intelligent integration of information. In: Proceedings of the 1993 ACM SIGMOD international conference on management of data, Washington, DC, 26–28 May 1993. ACM Press, New York, pp 434–437CrossRef Wiederhold G (1993) Intelligent integration of information. In: Proceedings of the 1993 ACM SIGMOD international conference on management of data, Washington, DC, 26–28 May 1993. ACM Press, New York, pp 434–437CrossRef
Metadata
Title
Data Integration
Authors
Sonia Bergamaschi
Domenico Beneventano
Francesco Guerra
Mirko Orsini
Copyright Year
2011
Publisher
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-642-15865-0_14

Premium Partner