Skip to main content
Erschienen in: Journal of Intelligent Information Systems 2/2019

03.11.2018

SemiLD: mediator-based framework for keyword search over semi-structured and linked data

verfasst von: Mohamed Kettouch, Cristina Luca, Mike Hobbs

Erschienen in: Journal of Intelligent Information Systems | Ausgabe 2/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Linked Data initiative has completely changed the procedure of sharing knowledge over the Web. It primarily aimed at improving the interoperability and semantics of the data published, by following a set of recommendations. Still, many data sources, which have a significant value, have not migrated to this new data space and continue to publish semi-structured data. Thus, new challenges arise in accessing and integrating the two data sources and models. This paper explores and identifies some of the major challenges, such as the continuous expansion and dynamism of a heterogeneous and an autonomous yet connected web of data, and addresses them by proposing SemiLD, a mediator-based framework to integrate on-the-fly heterogeneous semi-structured and Linked Data sources. The approach is implemented into a highly automated keyword search system that retrieves its input from various SPARQL endpoints and web APIs. The evaluation of the system illustrates the high precision, performance and recall of the contributed approach.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Bergamaschi, S., Domnori, E., Guerra, F., Orsini, M., Lado, R.T., Velegrakis, Y. (2010). Keymantic: semantic keyword-based searching in data integration systems. Proceedings of the VLDB Endowment, 3(1-2), 1637–1640.CrossRef Bergamaschi, S., Domnori, E., Guerra, F., Orsini, M., Lado, R.T., Velegrakis, Y. (2010). Keymantic: semantic keyword-based searching in data integration systems. Proceedings of the VLDB Endowment, 3(1-2), 1637–1640.CrossRef
Zurück zum Zitat Berners-Lee, T. (1999). Weaving the Web. Harper. Berners-Lee, T. (1999). Weaving the Web. Harper.
Zurück zum Zitat Bizer, C., Heath, T., Berners-Lee, T. (2009). Linked data-the story so far. International Journal on Semantic Web and Information Systems, 5(3), 1–22.CrossRef Bizer, C., Heath, T., Berners-Lee, T. (2009). Linked data-the story so far. International Journal on Semantic Web and Information Systems, 5(3), 1–22.CrossRef
Zurück zum Zitat Cai, Q, & Yates, A. (2013). Large-scale semantic parsing via schema matching and lexicon extension. In ACL (1) (pp. 423–433). Citeseer. Cai, Q, & Yates, A. (2013). Large-scale semantic parsing via schema matching and lexicon extension. In ACL (1) (pp. 423–433). Citeseer.
Zurück zum Zitat Collarana, D, Lange, C, Auer, S, Grangel-González, I. (2016). Fuhsen: a platform for federated, rdf-based hybrid search. In The 16th international conference on web engineering (ICWE2016). Collarana, D, Lange, C, Auer, S, Grangel-González, I. (2016). Fuhsen: a platform for federated, rdf-based hybrid search. In The 16th international conference on web engineering (ICWE2016).
Zurück zum Zitat Fatima, A., Luca, C., Wilson, G. (2014). User experience and efficiency for semantic search engine. In 2014 International conference on optimization of electrical and electronic equipment (OPTIM) (pp. 924–929). IEEE. Fatima, A., Luca, C., Wilson, G. (2014). User experience and efficiency for semantic search engine. In 2014 International conference on optimization of electrical and electronic equipment (OPTIM) (pp. 924–929). IEEE.
Zurück zum Zitat Freitas, A., Curry, E., Oliveira, J.G., Riain, S.O. (2012). Querying heterogeneous datasets on the linked data web: challenges, approaches, and trends. Internet Computing IEEE, 16(1), 24–33.CrossRef Freitas, A., Curry, E., Oliveira, J.G., Riain, S.O. (2012). Querying heterogeneous datasets on the linked data web: challenges, approaches, and trends. Internet Computing IEEE, 16(1), 24–33.CrossRef
Zurück zum Zitat Han, L., Kashyap, A., Finin, T., Mayfield, J., Weese, J. (2013). UMBC EBIQUITY-CORE: semantic textual similarity systems (Vol 44). Atlanta. Han, L., Kashyap, A., Finin, T., Mayfield, J., Weese, J. (2013). UMBC EBIQUITY-CORE: semantic textual similarity systems (Vol 44). Atlanta.
Zurück zum Zitat Jarke, M., Jeusfeld, M., Quix, C. (2014). Data-centric intelligent information integration from concepts to automation. Journal of Intelligent Information Systems, 43(3), 437–462.CrossRef Jarke, M., Jeusfeld, M., Quix, C. (2014). Data-centric intelligent information integration from concepts to automation. Journal of Intelligent Information Systems, 43(3), 437–462.CrossRef
Zurück zum Zitat Kalja, A, Haav, H.M., Robal, T. (2014). Databases and information systems VIII: selected papers from the eleventh international baltic conference, DB&IS 2014. IOS Press. Kalja, A, Haav, H.M., Robal, T. (2014). Databases and information systems VIII: selected papers from the eleventh international baltic conference, DB&IS 2014. IOS Press.
Zurück zum Zitat Kettouch, M.S., Luca, C., Hobbs, M. (2015a). An interlinking approach based on domain recognition for linked data. In 2015 IEEE 13th International conference on industrial informatics (INDIN) (pp. 488–491). IEEE. Kettouch, M.S., Luca, C., Hobbs, M. (2015a). An interlinking approach based on domain recognition for linked data. In 2015 IEEE 13th International conference on industrial informatics (INDIN) (pp. 488–491). IEEE.
Zurück zum Zitat Kettouch, M.S., Luca, C., Hobbs, M., Fatima, A. (2015b). Data integration approach for semi-structured and structured data (linked data). In 2015 IEEE 13th international conference on industrial informatics (INDIN) (pp. 820–825). IEEE. Kettouch, M.S., Luca, C., Hobbs, M., Fatima, A. (2015b). Data integration approach for semi-structured and structured data (linked data). In 2015 IEEE 13th international conference on industrial informatics (INDIN) (pp. 820–825). IEEE.
Zurück zum Zitat Koffina, I., Serfiotis, G., Christophides, V., Tannen, V. (2006). Mediating RDF/S queries to relational and XML sources. International Journal on Semantic Web and Information Systems, 2(4), 68–92.CrossRef Koffina, I., Serfiotis, G., Christophides, V., Tannen, V. (2006). Mediating RDF/S queries to relational and XML sources. International Journal on Semantic Web and Information Systems, 2(4), 68–92.CrossRef
Zurück zum Zitat Lopez, V., Fernández, M, Motta, E., Stieler, N. (2011). Poweraqua: supporting users in querying and exploring the semantic web. Semantic Web, 3(3), 249–265. Lopez, V., Fernández, M, Motta, E., Stieler, N. (2011). Poweraqua: supporting users in querying and exploring the semantic web. Semantic Web, 3(3), 249–265.
Zurück zum Zitat Macura, M. (2014). Integration of data from heterogeneous sources. Computer Science, 15(2), 109–132.CrossRef Macura, M. (2014). Integration of data from heterogeneous sources. Computer Science, 15(2), 109–132.CrossRef
Zurück zum Zitat Morbidoni, C., Le Phuoc, D., Polleres, A., Samwald, M., Tummarello, G. (2008). The semantic web: research and applications, lecture notes in computer science Vol. 5021. Berlin: Springer. Morbidoni, C., Le Phuoc, D., Polleres, A., Samwald, M., Tummarello, G. (2008). The semantic web: research and applications, lecture notes in computer science Vol. 5021. Berlin: Springer.
Zurück zum Zitat Nguyen, K., Ichise, R., Le, B. (2012). SLINT: a schema-independent linked data interlinking system. Ontology Matching, 1–12. Nguyen, K., Ichise, R., Le, B. (2012). SLINT: a schema-independent linked data interlinking system. Ontology Matching, 1–12.
Zurück zum Zitat Pánek, O. (2015). Integration of heterogeneous data sources based on a catalog of master entities. Diploma thesis, Czech Technical University, in Prague. Pánek, O. (2015). Integration of heterogeneous data sources based on a catalog of master entities. Diploma thesis, Czech Technical University, in Prague.
Zurück zum Zitat Pfaff, M., & Krcmar, H. (2014). Semantic integration of semi-structured distributed data in the domain of IT benchmarking - towards a domain specific ontology. In Proceedings of the 16th international conference on enterprise information systems (pp. 320–324). https://doi.org/10.5220/0004969303200324. Pfaff, M., & Krcmar, H. (2014). Semantic integration of semi-structured distributed data in the domain of IT benchmarking - towards a domain specific ontology. In Proceedings of the 16th international conference on enterprise information systems (pp. 320–324). https://​doi.​org/​10.​5220/​0004969303200324​.
Zurück zum Zitat Talukdar, P.P., Ives, Z.G., Pereira, F. (2010). Automatically incorporating new sources in keyword search-based data integration. In Proceedings of the 2010 ACM SIGMOD international conference on management of data (pp. 387–398). ACM. Talukdar, P.P., Ives, Z.G., Pereira, F. (2010). Automatically incorporating new sources in keyword search-based data integration. In Proceedings of the 2010 ACM SIGMOD international conference on management of data (pp. 387–398). ACM.
Zurück zum Zitat Umbrich, J. (2012). A hybrid framework for querying linked data dynamically. PhD thesis. Umbrich, J. (2012). A hybrid framework for querying linked data dynamically. PhD thesis.
Zurück zum Zitat Usbeck, R, Ngonga Ngomo, A, Bühmann, L, Unger, C. (2015). Hawk-hybrid question answering over linked data. In 12th extended semantic web conference. Usbeck, R, Ngonga Ngomo, A, Bühmann, L, Unger, C. (2015). Hawk-hybrid question answering over linked data. In 12th extended semantic web conference.
Zurück zum Zitat Verborgh, R., Steiner, T., Van de Walle, R., Gabarro, J. (2015). Linked data and linked apis: similarities, differences, and challenges. In Simperl, E, Norton, B, Mladenic, D, Della Valle, E, Fundulaki, I, Passant, A, Troncy, R (Eds.) The semantic web: ESWC 2012 satellite events (pp. 272–284). Berlin: Springer. Verborgh, R., Steiner, T., Van de Walle, R., Gabarro, J. (2015). Linked data and linked apis: similarities, differences, and challenges. In Simperl, E, Norton, B, Mladenic, D, Della Valle, E, Fundulaki, I, Passant, A, Troncy, R (Eds.) The semantic web: ESWC 2012 satellite events (pp. 272–284). Berlin: Springer.
Zurück zum Zitat Vincini, M., Beneventano, D., Bergamaschi, S. (2013). Semantic integration of heterogeneous data sources in the momis data transformation system. Journal of Universal Computer Science, 19(13), 1986–2012. Vincini, M., Beneventano, D., Bergamaschi, S. (2013). Semantic integration of heterogeneous data sources in the momis data transformation system. Journal of Universal Computer Science, 19(13), 1986–2012.
Zurück zum Zitat Ziegler, P, & Dittrich, K.R. (2007). Data integration-problems, approaches, and perspectives. In Conceptual modelling in information systems engineering (pp. 39–58). Springer. Ziegler, P, & Dittrich, K.R. (2007). Data integration-problems, approaches, and perspectives. In Conceptual modelling in information systems engineering (pp. 39–58). Springer.
Metadaten
Titel
SemiLD: mediator-based framework for keyword search over semi-structured and linked data
verfasst von
Mohamed Kettouch
Cristina Luca
Mike Hobbs
Publikationsdatum
03.11.2018
Verlag
Springer US
Erschienen in
Journal of Intelligent Information Systems / Ausgabe 2/2019
Print ISSN: 0925-9902
Elektronische ISSN: 1573-7675
DOI
https://doi.org/10.1007/s10844-018-0536-1

Weitere Artikel der Ausgabe 2/2019

Journal of Intelligent Information Systems 2/2019 Zur Ausgabe