Skip to main content

2017 | OriginalPaper | Buchkapitel

The Qanary Ecosystem: Getting New Insights by Composing Question Answering Pipelines

verfasst von : Dennis Diefenbach, Kuldeep Singh, Andreas Both, Didier Cherix, Christoph Lange, Sören Auer

Erschienen in: Web Engineering

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The field of Question Answering (QA) is very multi-disciplinary as it requires expertise from a large number of areas such as natural language processing (NLP), artificial intelligence, machine learning, information retrieval, speech recognition and semantic technologies. In the past years a large number of QA systems were proposed using approaches from different fields and focusing on particular tasks in the QA process. Unfortunately, most of these systems cannot be easily reused, extended, and results cannot be easily reproduced since the systems are mostly implemented in a monolithic fashion, lack standardized interfaces and are often not open source or available as Web services. To address these issues we developed the knowledge-based Qanary methodology for choreographing QA pipelines distributed over the Web. Qanary employs the qa vocabulary as an exchange format for typical QA components. As a result, QA systems can be built using the Qanary methodology in a simpler, more flexible and standardized way while becoming knowledge-driven instead of being process-oriented. This paper presents the components and services that are integrated using the qa vocabulary and the Qanary methodology within the Qanary ecosystem. Moreover, we show how the Qanary ecosystem can be used to analyse QA processes to detect weaknesses and research gaps. We illustrate this by focusing on the Entity Linking (EL) task w.r.t. textual natural language input, which is a fundamental step in most QA processes. Additionally, we contribute the first EL benchmark for QA, as open source. Our main goal is to show how the research community can use Qanary to gain new insights into QA processes.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
9
W3C Candidate Recommendation 2016-09-06, http://​www.​w3.​org/​TR/​annotation-model.
 
15
The configuration property spring.boot.admin.url defines the endpoint of the central component (and can be injected dynamically).
 
21
Note that resources of the type http://​dbpedia.​org/​ontology/​* would match a DBpedia property or class and therefore are not considered here.
 
22
This definition of the gold standard ignores the order of the URIs. In practice this definition rarely causes problems, but in theory one could construct counter-examples that pinpoint the limitations. Imagine the question “What German actors were not born in Germany?”, and imagine that the word “German” got linked to the entity dbr:Germany and “Germany” to dbr:Germans - clearly a wrong linking, but “correct” w.r.t. our gold standard definitions. However, in QALD (release 6) there are no questions in which such a mix-up is likely to happen.
 
23
In the corner cases where the number of system answers or the number of gold standard answers is zero we follow the same rules that are used in the QALD evaluation; see https://​github.​com/​ag-sc/​QALD/​blob/​master/​6/​scripts/​evaluation.​rb.
 
Literatur
1.
Zurück zum Zitat Atdag, S., Labatut, V.: A comparison of named entity recognition tools applied to biographical texts. In: 2nd International Conference on Systems and Computer Science (ICSCS) (2013) Atdag, S., Labatut, V.: A comparison of named entity recognition tools applied to biographical texts. In: 2nd International Conference on Systems and Computer Science (ICSCS) (2013)
2.
Zurück zum Zitat Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007). doi:10.1007/978-3-540-76298-0_52 CrossRef Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007). doi:10.​1007/​978-3-540-76298-0_​52 CrossRef
3.
Zurück zum Zitat Both, A., Diefenbach, D., Singh, K., Shekarpour, S., Cherix, D., Lange, C.: Qanary – a methodology for vocabulary-driven open question answering systems. In: The Semantic Web. Latest Advances and New Domains: 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, 29 May–2 June 2016, Proceedings (2016) Both, A., Diefenbach, D., Singh, K., Shekarpour, S., Cherix, D., Lange, C.: Qanary – a methodology for vocabulary-driven open question answering systems. In: The Semantic Web. Latest Advances and New Domains: 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, 29 May–2 June 2016, Proceedings (2016)
4.
Zurück zum Zitat Both, A., Ngonga Ngomo, A.-C., Usbeck, R., Lukovnikov, D., Lemke, C., Speicher, M.: A service-oriented search framework for full text, geospatial and semantic search. In: Proceedings of the 10th International Conference on Semantic Systems, SEM 2014, pp. 65–72. ACM (2014) Both, A., Ngonga Ngomo, A.-C., Usbeck, R., Lukovnikov, D., Lemke, C., Speicher, M.: A service-oriented search framework for full text, geospatial and semantic search. In: Proceedings of the 10th International Conference on Semantic Systems, SEM 2014, pp. 65–72. ACM (2014)
5.
Zurück zum Zitat Cabrio, E., Cojan, J., Aprosio, A.P., Magnini, B., Lavelli, A., Gandon, F.: QAKiS: an open domain QA system based on relational patterns. In: Glimm, B., Huynh, D. (eds.) Proceedings of the ISWC 2012 Posters & Demonstrations Track, vol. 914, CEUR Workshop Proceedings (2012). CEUR-WS.org Cabrio, E., Cojan, J., Aprosio, A.P., Magnini, B., Lavelli, A., Gandon, F.: QAKiS: an open domain QA system based on relational patterns. In: Glimm, B., Huynh, D. (eds.) Proceedings of the ISWC 2012 Posters & Demonstrations Track, vol. 914, CEUR Workshop Proceedings (2012). CEUR-WS.​org
6.
Zurück zum Zitat Diefenbach, D., Amjad, S., Both, A., Singh, K., Maret, P.: Trill: a reusable front-end for QA systems. In: ESWC P&D (2017) Diefenbach, D., Amjad, S., Both, A., Singh, K., Maret, P.: Trill: a reusable front-end for QA systems. In: ESWC P&D (2017)
7.
Zurück zum Zitat Diefenbach, D., Singh, K., Maret, P.: Wdaqua-core0: a question answering component for the research community. In: ESWC, 7th Open Challenge on Question Answering over Linked Data (QALD-7) (2017) Diefenbach, D., Singh, K., Maret, P.: Wdaqua-core0: a question answering component for the research community. In: ESWC, 7th Open Challenge on Question Answering over Linked Data (QALD-7) (2017)
8.
Zurück zum Zitat Dima, C.: Answering natural language questions with intui3. In: CLEF (Working Notes) (2014) Dima, C.: Answering natural language questions with intui3. In: CLEF (Working Notes) (2014)
9.
Zurück zum Zitat Ferrández, Ó., Spurk, C., Kouylekov, M., Dornescu, I., Ferrández, S., Negri, M., Izquierdo, R., Tomás, D., Orasan, C., Neumann, G., Magnini, B., González, J.L.V.: The QALL-ME framework: a specifiable-domain multilingual question answering architecture. Web Semant. Sci. Serv. Agents World Wide Web 9(2), 137–145 (2011)CrossRef Ferrández, Ó., Spurk, C., Kouylekov, M., Dornescu, I., Ferrández, S., Negri, M., Izquierdo, R., Tomás, D., Orasan, C., Neumann, G., Magnini, B., González, J.L.V.: The QALL-ME framework: a specifiable-domain multilingual question answering architecture. Web Semant. Sci. Serv. Agents World Wide Web 9(2), 137–145 (2011)CrossRef
10.
Zurück zum Zitat Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by Gibbs sampling. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, ACL 2005 (2005) Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by Gibbs sampling. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, ACL 2005 (2005)
11.
Zurück zum Zitat Freitas, A., Oliveira, J., Curry, E., O’Riain, S., da Silva, J.: Treo: combining entity-search, spreading activation and semantic relatedness for querying linked data. In: 1st Workshop on Question Answering over Linked Data (QALD-2011) (2011) Freitas, A., Oliveira, J., Curry, E., O’Riain, S., da Silva, J.: Treo: combining entity-search, spreading activation and semantic relatedness for querying linked data. In: 1st Workshop on Question Answering over Linked Data (QALD-2011) (2011)
12.
Zurück zum Zitat Lytra, I., Vidal, M.-E., Lange, C., Auer, S., Demidova, E.: WDAqua - answering questions using web data. In: Mannens, E., Dragoni, M., Nixon, L., Corcho, O. (eds.) EU Project Networking (2016) Lytra, I., Vidal, M.-E., Lange, C., Auer, S., Demidova, E.: WDAqua - answering questions using web data. In: Mannens, E., Dragoni, M., Nixon, L., Corcho, O. (eds.) EU Project Networking (2016)
13.
Zurück zum Zitat Marx, E., Usbeck, R., Ngonga Ngomo, A., Höffner, K., Lehmann, J., Auer, S.: Towards an open question answering architecture. In: 10th International Conference on Semantic Systems (2014) Marx, E., Usbeck, R., Ngonga Ngomo, A., Höffner, K., Lehmann, J., Auer, S.: Towards an open question answering architecture. In: 10th International Conference on Semantic Systems (2014)
14.
Zurück zum Zitat Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: DBpedia spotlight: shedding light on the web of documents. In: Proceedings of the 7th International Conference on Semantic Systems, I-Semantics 2011 (2011) Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: DBpedia spotlight: shedding light on the web of documents. In: Proceedings of the 7th International Conference on Semantic Systems, I-Semantics 2011 (2011)
16.
Zurück zum Zitat Rizzo, G., Troncy, R.: NERD: a framework for unifying named entity recognition and disambiguation extraction tools. In: 13th Conference of the European Chapter of the Association for Computational Linguistics (2012) Rizzo, G., Troncy, R.: NERD: a framework for unifying named entity recognition and disambiguation extraction tools. In: 13th Conference of the European Chapter of the Association for Computational Linguistics (2012)
17.
Zurück zum Zitat Shekarpour, S., Marx, E., Ngonga Ngomo, A.-C., Auer, S.: SINA: semantic interpretation of user queries for question answering on interlinked data. Web Semant. Sci. Serv. Agents WWW 30, 39–51 (2015)CrossRef Shekarpour, S., Marx, E., Ngonga Ngomo, A.-C., Auer, S.: SINA: semantic interpretation of user queries for question answering on interlinked data. Web Semant. Sci. Serv. Agents WWW 30, 39–51 (2015)CrossRef
18.
Zurück zum Zitat Singh, K., Both, A., Diefenbach, D., Shekarpour, S.: Towards a message-driven vocabulary for promoting the interoperability of question answering systems. In: 2016 IEEE Tenth International Conference on Semantic Computing (ICSC) (2016) Singh, K., Both, A., Diefenbach, D., Shekarpour, S.: Towards a message-driven vocabulary for promoting the interoperability of question answering systems. In: 2016 IEEE Tenth International Conference on Semantic Computing (ICSC) (2016)
19.
Zurück zum Zitat Singh, K., Both, A., Diefenbach, D., Shekarpour, S., Cherix, D., Lange, C.: Qanary-the fast track to create a question answering system with linked data technology. In: The Semantic Web: ESWC 2016 Satellite Events, Heraklion, Crete, Greece, 29 May–2 June 2016, Revised Selected Papers (2016) Singh, K., Both, A., Diefenbach, D., Shekarpour, S., Cherix, D., Lange, C.: Qanary-the fast track to create a question answering system with linked data technology. In: The Semantic Web: ESWC 2016 Satellite Events, Heraklion, Crete, Greece, 29 May–2 June 2016, Revised Selected Papers (2016)
20.
Zurück zum Zitat Speck, R., Ngonga Ngomo, A.-C.: Ensemble learning for named entity recognition. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014. LNCS, vol. 8796, pp. 519–534. Springer, Cham (2014). doi:10.1007/978-3-319-11964-9_33 Speck, R., Ngonga Ngomo, A.-C.: Ensemble learning for named entity recognition. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014. LNCS, vol. 8796, pp. 519–534. Springer, Cham (2014). doi:10.​1007/​978-3-319-11964-9_​33
21.
Zurück zum Zitat Unger, C., Forascu, C., Lopez, V., Ngonga Ngomo, A., Cabrio, E., Cimiano, P., Walter, S.: Question answering over linked data (QALD-5). In: CLEF (Working Notes) (2015) Unger, C., Forascu, C., Lopez, V., Ngonga Ngomo, A., Cabrio, E., Cimiano, P., Walter, S.: Question answering over linked data (QALD-5). In: CLEF (Working Notes) (2015)
22.
Zurück zum Zitat Usbeck, R., Ngonga Ngomo, A.-C., Röder, M., Gerber, D., Coelho, S.A., Auer, S., Both, A.: AGDISTIS - graph-based disambiguation of named entities using linked data. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014. LNCS, vol. 8796, pp. 457–471. Springer, Cham (2014). doi:10.1007/978-3-319-11964-9_29 Usbeck, R., Ngonga Ngomo, A.-C., Röder, M., Gerber, D., Coelho, S.A., Auer, S., Both, A.: AGDISTIS - graph-based disambiguation of named entities using linked data. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014. LNCS, vol. 8796, pp. 457–471. Springer, Cham (2014). doi:10.​1007/​978-3-319-11964-9_​29
23.
Zurück zum Zitat Usbeck, R., Röder, M., Ngonga Ngomo, A., Baron, C., Both, A., Brümmer, M., Ceccarelli, D., Cornolti, M., Cherix, D., Eickmann, B., Ferragina, P., Lemke, C., Moro, A., Navigli, R., Piccinno, F., Rizzo, G., Sack, H., Speck, R., Troncy, R., Waitelonis, J., Wesemann, L.: GERBIL: general entity annotator benchmarking framework. In: 24th International Conference on World Wide Web (2015) Usbeck, R., Röder, M., Ngonga Ngomo, A., Baron, C., Both, A., Brümmer, M., Ceccarelli, D., Cornolti, M., Cherix, D., Eickmann, B., Ferragina, P., Lemke, C., Moro, A., Navigli, R., Piccinno, F., Rizzo, G., Sack, H., Speck, R., Troncy, R., Waitelonis, J., Wesemann, L.: GERBIL: general entity annotator benchmarking framework. In: 24th International Conference on World Wide Web (2015)
Metadaten
Titel
The Qanary Ecosystem: Getting New Insights by Composing Question Answering Pipelines
verfasst von
Dennis Diefenbach
Kuldeep Singh
Andreas Both
Didier Cherix
Christoph Lange
Sören Auer
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-60131-1_10