Skip to main content
Top

2023 | OriginalPaper | Chapter

Visconde: Multi-document QA with GPT-3 and Neural Reranking

Authors : Jayr Pereira, Robson Fidalgo, Roberto Lotufo, Rodrigo Nogueira

Published in: Advances in Information Retrieval

Publisher: Springer Nature Switzerland

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper proposes a question-answering system that can answer questions whose supporting evidence is spread over multiple (potentially long) documents. The system, called Visconde, uses a three-step pipeline to perform the task: decompose, retrieve, and aggregate. The first step decomposes the question into simpler questions using a few-shot large language model (LLM). Then, a state-of-the-art search engine is used to retrieve candidate passages from a large collection for each decomposed question. In the final step, we use the LLM in a few-shot setting to aggregate the contents of the passages into the final answer. The system is evaluated on three datasets: IIRC, Qasper, and StrategyQA. Results suggest that current retrievers are the main bottleneck and that readers are already performing at the human level as long as relevant passages are provided. The system is also shown to be more effective when the model is induced to give explanations before answering a question. Code is available at https://​github.​com/​neuralmind-ai/​visconde.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
The name is a homage to Visconde de Sabugosa a fictional character invented by Monteiro Lobato that is a corn cob doll whose wisdom comes from reading books.
 
2
We used the 3 billion parameters version, whose checkpoint is available at https://​huggingface.​co/​castorini/​monot5-3b-msmarco-10k.
 
3
We used this model as our sentence encoder: sentence-transformers/msmarco-bert-base-dot-v5.
 
Literature
14.
go back to reference Khashabi, D., et al.: Unifiedqa: crossing format boundaries with a single QA system (2020) Khashabi, D., et al.: Unifiedqa: crossing format boundaries with a single QA system (2020)
15.
go back to reference Khashabi, D., Kordi, Y., Hajishirzi, H.: Unifiedqa-v2: stronger generalization via broader cross-format training. arXiv preprint arXiv:2202.12359 (2022) Khashabi, D., Kordi, Y., Hajishirzi, H.: Unifiedqa-v2: stronger generalization via broader cross-format training. arXiv preprint arXiv:​2202.​12359 (2022)
17.
go back to reference Lazaridou, A., Gribovskaya, E., Stokowiec, W., Grigorev, N.: Internet-augmented language models through few-shot prompting for open-domain question answering. arXiv preprint arXiv:2203.05115 (2022) Lazaridou, A., Gribovskaya, E., Stokowiec, W., Grigorev, N.: Internet-augmented language models through few-shot prompting for open-domain question answering. arXiv preprint arXiv:​2203.​05115 (2022)
19.
go back to reference Lin, J., Ma, X., Lin, S.C., Yang, J.H., Pradeep, R., Nogueira, R.: Pyserini: a python toolkit for reproducible information retrieval research with sparse and dense representations. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 2356–2362. SIGIR 2021, Association for Computing Machinery, New York, NY, USA (2021). https://doi.org/10.1145/3404835.3463238 Lin, J., Ma, X., Lin, S.C., Yang, J.H., Pradeep, R., Nogueira, R.: Pyserini: a python toolkit for reproducible information retrieval research with sparse and dense representations. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 2356–2362. SIGIR 2021, Association for Computing Machinery, New York, NY, USA (2021). https://​doi.​org/​10.​1145/​3404835.​3463238
23.
go back to reference Perez, E., Lewis, P., Yih, W.T., Cho, K., Kiela, D.: Unsupervised question decomposition for question answering. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 8864–8880 (2020) Perez, E., Lewis, P., Yih, W.T., Cho, K., Kiela, D.: Unsupervised question decomposition for question answering. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 8864–8880 (2020)
24.
go back to reference Press, O., Zhang, M., Min, S., Schmidt, L., Smith, N.A., Lewis, M.: Measuring and narrowing the compositionality gap in language models. arXiv preprint arXiv:2210.03350 (2022) Press, O., Zhang, M., Min, S., Schmidt, L., Smith, N.A., Lewis, M.: Measuring and narrowing the compositionality gap in language models. arXiv preprint arXiv:​2210.​03350 (2022)
25.
go back to reference Raffel, C., et al.: Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res. 21(140), 1–67 (2020)MathSciNetMATH Raffel, C., et al.: Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res. 21(140), 1–67 (2020)MathSciNetMATH
27.
go back to reference Robertson, S.E., Walker, S., Jones, S., Hancock-Beaulieu, M., Gatford, M.: Okapi at trec-3. In: TREC (1994) Robertson, S.E., Walker, S., Jones, S., Hancock-Beaulieu, M., Gatford, M.: Okapi at trec-3. In: TREC (1994)
28.
go back to reference Sachan, D.S., Lewis, M., Yogatama, D., Zettlemoyer, L., Pineau, J., Zaheer, M.: Questions are all you need to train a dense passage retriever. arXiv preprint arXiv:2206.10658 (2022) Sachan, D.S., Lewis, M., Yogatama, D., Zettlemoyer, L., Pineau, J., Zaheer, M.: Questions are all you need to train a dense passage retriever. arXiv preprint arXiv:​2206.​10658 (2022)
36.
go back to reference Yang, Z., et al.: Hotpotqa: a dataset for diverse, explainable multi-hop question answering. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 2369–2380 (2018) Yang, Z., et al.: Hotpotqa: a dataset for diverse, explainable multi-hop question answering. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 2369–2380 (2018)
Metadata
Title
Visconde: Multi-document QA with GPT-3 and Neural Reranking
Authors
Jayr Pereira
Robson Fidalgo
Roberto Lotufo
Rodrigo Nogueira
Copyright Year
2023
DOI
https://doi.org/10.1007/978-3-031-28238-6_44