Skip to main content

2016 | OriginalPaper | Buchkapitel

Semantic Integration of Open-Data Tables

verfasst von : Asha Subramanian, Ved Kurien Mathai, Vikkurthi Manikanta, Janaki Vinesh Joshi, Srinath Srinivasa

Erschienen in: On the Move to Meaningful Internet Systems: OTM 2016 Conferences

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

With vast amounts of tabular data freely available under several Open-Data initiatives, semantic integration of such datasets is a pressing need. Multiple research efforts have addressed the problem of annotating tabular data. However, to the best of our knowledge, they do not adequately address the problem of semantic integration of tables. A given collection of tables can be semantically integrated along several perspectives or themes. This makes semantic integration a “divergent aggregation” problem. Most existing approaches have focused on interpreting a single table, or rewriting tables to describe an overarching theme that is already provided. In this work, we address semantic integration along two levels: Theme identification (identifying dominant topics or perspectives through which the data can be characterized) and Schematic characterization (classes, relationships and instances that best characterize the data within the theme). The theme need not be represented by a single column, and may span across multiple columns or tables. We use Linked Open data (LOD) cloud to map ontologies that best suit the datasets. Our work also identifies incoherent datasets where a given collection may not have common topics. In such cases we are able to provide guidance on the intersection of semantic footprints of the tables for a judicious selection of the datasets for semantic integration.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Auer, S., Dietzold, S., Lehmann, J., Hellmann, S., Aumueller, D.: Triplify: light-weight linked data publication from relational databases. In: 18th International Conference on World Wide Web, pp. 621–630. ACM, April 2009 Auer, S., Dietzold, S., Lehmann, J., Hellmann, S., Aumueller, D.: Triplify: light-weight linked data publication from relational databases. In: 18th International Conference on World Wide Web, pp. 621–630. ACM, April 2009
3.
Zurück zum Zitat Daiber, J., Jakob, M., Hokamp, C., Mendes, P.N.: Improving efficiency and accuracy in multilingual entity extraction. In: Proceedings of the 9th International Conference on Semantic Systems, pp. 121–124. ACM, September 2013 Daiber, J., Jakob, M., Hokamp, C., Mendes, P.N.: Improving efficiency and accuracy in multilingual entity extraction. In: Proceedings of the 9th International Conference on Semantic Systems, pp. 121–124. ACM, September 2013
4.
Zurück zum Zitat Ding, L., DiFranzo, D., Graves, A., Michaelis, J., Li, X., McGuinness, D.L., Hendler, J.: Data-gov Wiki: towards linking government data. In: AAAI Spring Symposium: Linked data Meets Artificial Intelligence, vol. 10, p. 1 (2010) Ding, L., DiFranzo, D., Graves, A., Michaelis, J., Li, X., McGuinness, D.L., Hendler, J.: Data-gov Wiki: towards linking government data. In: AAAI Spring Symposium: Linked data Meets Artificial Intelligence, vol. 10, p. 1 (2010)
5.
Zurück zum Zitat Ding, L., Lebo, T., Erickson, J.S., DiFranzo, D., Williams, G.T., Li, X., Michaelis, J., Graves, A., Zheng, J.G., Shangguan, Z., Flores, J.: TWC LOGD: A portal for linked open government data ecosystems. Web Semant. Sci. Serv. Agents World Wide Web 9(3), 325–333 (2011)CrossRef Ding, L., Lebo, T., Erickson, J.S., DiFranzo, D., Williams, G.T., Li, X., Michaelis, J., Graves, A., Zheng, J.G., Shangguan, Z., Flores, J.: TWC LOGD: A portal for linked open government data ecosystems. Web Semant. Sci. Serv. Agents World Wide Web 9(3), 325–333 (2011)CrossRef
6.
Zurück zum Zitat Ermilov, I., Auer, S., Stadler, C.: Csv2rdf: User-driven csv to rdf mass conversion framework. In: ISEM, vol. 13, pp. 04–06, September 2013 Ermilov, I., Auer, S., Stadler, C.: Csv2rdf: User-driven csv to rdf mass conversion framework. In: ISEM, vol. 13, pp. 04–06, September 2013
7.
Zurück zum Zitat Han, L., Finin, T., Parr, C., Sachs, J., Joshi, A.: RDF123: from spreadsheets to RDF. In: Sheth, A., Staab, S., Dean, M., Paolucci, M., Maynard, D., Finin, T., Thirunarayan, K. (eds.) ISWC 2008. LNCS, vol. 5318, pp. 451–466. Springer, Heidelberg (2008). doi:10.1007/978-3-540-88564-1_29 CrossRef Han, L., Finin, T., Parr, C., Sachs, J., Joshi, A.: RDF123: from spreadsheets to RDF. In: Sheth, A., Staab, S., Dean, M., Paolucci, M., Maynard, D., Finin, T., Thirunarayan, K. (eds.) ISWC 2008. LNCS, vol. 5318, pp. 451–466. Springer, Heidelberg (2008). doi:10.​1007/​978-3-540-88564-1_​29 CrossRef
9.
Zurück zum Zitat Limaye, G., Sarawagi, S., Chakrabarti, S.: Annotating and searching web tables using entities, types and relationships. Proc. VLDB Endowment 3(1–2), 1338–1347 (2010)CrossRef Limaye, G., Sarawagi, S., Chakrabarti, S.: Annotating and searching web tables using entities, types and relationships. Proc. VLDB Endowment 3(1–2), 1338–1347 (2010)CrossRef
10.
Zurück zum Zitat Marshall, M.S., Boyce, R., Deus, H.F., Zhao, J., Willighagen, E.L., Samwald, M., Pichler, E., Hajagos, J., Prud’hommeaux, E., Stephens, S.: Emerging practices for mapping and linking life sciences data using RDF A case series. Web Semant. Sci. Serv. Agents World Wide Web 14, 2–13 (2012)CrossRef Marshall, M.S., Boyce, R., Deus, H.F., Zhao, J., Willighagen, E.L., Samwald, M., Pichler, E., Hajagos, J., Prud’hommeaux, E., Stephens, S.: Emerging practices for mapping and linking life sciences data using RDF A case series. Web Semant. Sci. Serv. Agents World Wide Web 14, 2–13 (2012)CrossRef
11.
Zurück zum Zitat Miles, A., Matthews, B., Wilson, M., Brickley, D.: Core: Simple knowledge organisation for the web. In: International Conference on Dublin Core and Metadata Applications (2005) Miles, A., Matthews, B., Wilson, M., Brickley, D.: Core: Simple knowledge organisation for the web. In: International Conference on Dublin Core and Metadata Applications (2005)
12.
Zurück zum Zitat Mulwad, V.V.: TABEL A Domain Independent and Extensible Framework for Inferring the Semantics of Tables (Doctoral dissertation, University of Maryland) (2015) Mulwad, V.V.: TABEL A Domain Independent and Extensible Framework for Inferring the Semantics of Tables (Doctoral dissertation, University of Maryland) (2015)
13.
Zurück zum Zitat Sekhavat, Y.A., Di Paolo, F., Barbosa, D., Merialdo, P.: Knowledge base augmentation using tabular data. In: LDOW (2014) Sekhavat, Y.A., Di Paolo, F., Barbosa, D., Merialdo, P.: Knowledge base augmentation using tabular data. In: LDOW (2014)
14.
Zurück zum Zitat Srinivasa, S., Agrawal, S., Jog, C., Deshmukh, J.: Characterizing open utilitarian knowledge. In: Proceedings of the First IKDD Conference on Data Sciences (CoDS 2014), New Delhi, India, March 2014 Srinivasa, S., Agrawal, S., Jog, C., Deshmukh, J.: Characterizing open utilitarian knowledge. In: Proceedings of the First IKDD Conference on Data Sciences (CoDS 2014), New Delhi, India, March 2014
15.
Zurück zum Zitat Subramanian, A.: Inferencing in the large characterizing semantic integration of open tabular data. In: ISWC-DC 2015 The ISWC 2015 Doctoral Consortium, pp. 74–81. CEUR-WS.org, Pennsylvania (2015) Subramanian, A.: Inferencing in the large characterizing semantic integration of open tabular data. In: ISWC-DC 2015 The ISWC 2015 Doctoral Consortium, pp. 74–81. CEUR-WS.org, Pennsylvania (2015)
16.
Zurück zum Zitat Subramanian, A., Srinivasa, S., Kumar, P., Vignesh, S.: Semantic integration of structured data powered by linked open data. In: Proceedings of the 5th International Conference on Web Intelligence, Mining and Semantics, p. 13. ACM, Cyprus (2015) Subramanian, A., Srinivasa, S., Kumar, P., Vignesh, S.: Semantic integration of structured data powered by linked open data. In: Proceedings of the 5th International Conference on Web Intelligence, Mining and Semantics, p. 13. ACM, Cyprus (2015)
17.
Zurück zum Zitat Unbehauen, J., Hellmann, S., Auer, S., Stadler, C.: Knowledge extraction from structured sources. In: Ceri, S., Brambilla, M. (eds.) Search Computing. LNCS, vol. 7538, pp. 34–52. Springer, Heidelberg (2012). doi:10.1007/978-3-642-34213-4_3 CrossRef Unbehauen, J., Hellmann, S., Auer, S., Stadler, C.: Knowledge extraction from structured sources. In: Ceri, S., Brambilla, M. (eds.) Search Computing. LNCS, vol. 7538, pp. 34–52. Springer, Heidelberg (2012). doi:10.​1007/​978-3-642-34213-4_​3 CrossRef
18.
Zurück zum Zitat Zhang, Z.: Start small, build complete: Effective and efficient semantic table interpretation using tableminer. Semant. Web J. Under Transparent Rev. (2014) Zhang, Z.: Start small, build complete: Effective and efficient semantic table interpretation using tableminer. Semant. Web J. Under Transparent Rev. (2014)
Metadaten
Titel
Semantic Integration of Open-Data Tables
verfasst von
Asha Subramanian
Ved Kurien Mathai
Vikkurthi Manikanta
Janaki Vinesh Joshi
Srinath Srinivasa
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-48472-3_35