Skip to main content

2020 | OriginalPaper | Buchkapitel

Question Answering When Knowledge Bases are Incomplete

verfasst von : Camille Pradel, Damien Sileo, Álvaro Rodrigo, Anselmo Peñas, Eneko Agirre

Erschienen in: Experimental IR Meets Multilinguality, Multimodality, and Interaction

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

While systems for question answering over knowledge bases (KB) continue to progress, real world usage requires systems that are robust to incomplete KBs. Dependence on the closed world assumption is highly problematic, as in many practical cases the information is constantly evolving and KBs cannot keep up. In this paper we formalize a typology of missing information in knowledge bases, and present a dataset based on the Spider KB question answering dataset, where we deliberately remove information from several knowledge bases, in this case implemented as relational databases (The dataset and the code to reproduce experiments are available at https://​github.​com/​camillepradel/​IDK.). Our dataset, called IDK (Incomplete Data in Knowledge base question answering), allows to perform studies on how to detect and recover from such cases. The analysis shows that simple baselines fail to detect most of the unanswerable questions.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
2
 
3
Plus domain and range properties, and labels language tags we did not include in our definition for the sake of simplicity.
 
4
We used Magnitude [12] in order to query the embeddings in a way that is robust to minor morphological word differences.
 
Literatur
1.
Zurück zum Zitat Artzi, Y., Zettlemoyer, L.: Weakly supervised learning of semantic parsers for mapping instructions to actions. Trans. Assoc. Comput. Linguist. 1, 49–62 (2013)CrossRef Artzi, Y., Zettlemoyer, L.: Weakly supervised learning of semantic parsers for mapping instructions to actions. Trans. Assoc. Comput. Linguist. 1, 49–62 (2013)CrossRef
2.
Zurück zum Zitat Berant, J., Chou, A., Frostig, R., Liang, P.: Semantic parsing on Freebase from question-answer pairs. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, pp. 1533–1544. Association for Computational Linguistics, October 2013. https://www.aclweb.org/anthology/D13-1160 Berant, J., Chou, A., Frostig, R., Liang, P.: Semantic parsing on Freebase from question-answer pairs. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, pp. 1533–1544. Association for Computational Linguistics, October 2013. https://​www.​aclweb.​org/​anthology/​D13-1160
3.
Zurück zum Zitat Bogin, B., Gardner, M., Berant, J.: Representing schema structure with graph neural networks for text-to-SQL parsing. In: ACL (2019) Bogin, B., Gardner, M., Berant, J.: Representing schema structure with graph neural networks for text-to-SQL parsing. In: ACL (2019)
4.
Zurück zum Zitat Chebotko, A., Lu, S., Fotouhi, F.: Semantics preserving SPARQL-to-SQL translation. Data Knowl. Eng. 68, 973–1000 (2009)CrossRef Chebotko, A., Lu, S., Fotouhi, F.: Semantics preserving SPARQL-to-SQL translation. Data Knowl. Eng. 68, 973–1000 (2009)CrossRef
5.
6.
Zurück zum Zitat Guo, J., et al.: Towards complex text-to-SQL in cross-domain database with intermediate representation. arXiv preprint arXiv:1905.08205 (2019). Version 2 Guo, J., et al.: Towards complex text-to-SQL in cross-domain database with intermediate representation. arXiv preprint arXiv:​1905.​08205 (2019). Version 2
8.
Zurück zum Zitat Joshi, M., Choi, E., Weld, D.S., Zettlemoyer, L.: Triviaqa: a large scale distantly supervised challenge dataset for reading comprehension. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, Vancouver, Canada. Association for Computational Linguistics, July 2017 Joshi, M., Choi, E., Weld, D.S., Zettlemoyer, L.: Triviaqa: a large scale distantly supervised challenge dataset for reading comprehension. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, Vancouver, Canada. Association for Computational Linguistics, July 2017
10.
Zurück zum Zitat Mikolov, T., Grave, E., Bojanowski, P., Puhrsch, C., Joulin, A.: Advances in pre-training distributed word representations. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association (ELRA), May 2018. https://www.aclweb.org/anthology/L18-1008 Mikolov, T., Grave, E., Bojanowski, P., Puhrsch, C., Joulin, A.: Advances in pre-training distributed word representations. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association (ELRA), May 2018. https://​www.​aclweb.​org/​anthology/​L18-1008
13.
Zurück zum Zitat Peñas, A., Veron, M., Pradel, C., Otegi, A., Echegoyen, G., Rodrigo, A.: Continuous learning for question answering. In: Proceedings of the 10th International Workshop on Spoken Dialog Systems (IWSDS 2019) - DSLL Special Session (2019) Peñas, A., Veron, M., Pradel, C., Otegi, A., Echegoyen, G., Rodrigo, A.: Continuous learning for question answering. In: Proceedings of the 10th International Workshop on Spoken Dialog Systems (IWSDS 2019) - DSLL Special Session (2019)
17.
Zurück zum Zitat Soussi, N., Bahaj, M.: Semantics preserving SQL-to-SPARQL query translation for nested right and left outer join. J. Appl. Res. Technol. 15(5), 504–512 (2017)CrossRef Soussi, N., Bahaj, M.: Semantics preserving SQL-to-SPARQL query translation for nested right and left outer join. J. Appl. Res. Technol. 15(5), 504–512 (2017)CrossRef
18.
Zurück zum Zitat Yu, T., et al.: CoSQL: a conversational text-to-SQL challenge towards cross-domain natural language interfaces to databases. In: Proceedings of EMNLP 2019 (2019) Yu, T., et al.: CoSQL: a conversational text-to-SQL challenge towards cross-domain natural language interfaces to databases. In: Proceedings of EMNLP 2019 (2019)
20.
Zurück zum Zitat Yu, T., et al.: SParC: cross-domain semantic parsing in context. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy. Association for Computational Linguistics (2019) Yu, T., et al.: SParC: cross-domain semantic parsing in context. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy. Association for Computational Linguistics (2019)
22.
Zurück zum Zitat Zhong, V., Xiong, C., Socher, R.: Seq2SQL: generating structured queries from natural language using reinforcement learning. arXiv preprint arXiv:1709.00103 (2017) Zhong, V., Xiong, C., Socher, R.: Seq2SQL: generating structured queries from natural language using reinforcement learning. arXiv preprint arXiv:​1709.​00103 (2017)
Metadaten
Titel
Question Answering When Knowledge Bases are Incomplete
verfasst von
Camille Pradel
Damien Sileo
Álvaro Rodrigo
Anselmo Peñas
Eneko Agirre
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-58219-7_4