Skip to main content

2022 | OriginalPaper | Buchkapitel

Passage Retrieval on Structured Documents Using Graph Attention Networks

verfasst von : Lucas Albarede, Philippe Mulhem, Lorraine Goeuriot, Claude Le Pape-Gardeux, Sylvain Marie, Trinidad Chardin-Segui

Erschienen in: Advances in Information Retrieval

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Passage Retrieval systems aim at retrieving and ranking small text units according to their estimated relevance to a query. A usual practice is to consider the context a passage appears in (its containing document, neighbour passages, etc.) to improve its relevance estimation. In this work, we study the use of Graph Attention Networks (GATs), a graph node embedding method, to perform passage contextualization. More precisely, we first propose a document graph representation based on several inter- and intra-document relations. Then, we investigate two ways of leveraging the use of GATs on this representation in order to incorporate contextual information for passage retrieval. We evaluate our approach on a Passage Retrieval task for structured documents: CLEF-IP2013. Our results show that our document graph representation coupled with the expressive power of GATs allows for a better context representation leading to improved performances.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Andersson, L., Lupu, M., Palotti, J.A., Hanbury, A., Rauber, A.: When is the time ripe for natural language processing for patent passage retrieval? In: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, CIKM 2016, pp. 1453–1462. Association for Computing Machinery, New York (2016). https://doi.org/10.1145/2983323.2983858 Andersson, L., Lupu, M., Palotti, J.A., Hanbury, A., Rauber, A.: When is the time ripe for natural language processing for patent passage retrieval? In: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, CIKM 2016, pp. 1453–1462. Association for Computing Machinery, New York (2016). https://​doi.​org/​10.​1145/​2983323.​2983858
3.
Zurück zum Zitat Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate (2016) Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate (2016)
6.
10.
Zurück zum Zitat Guo, J., et al.: A deep look into neural ranking models for information retrieval. Inf. Process. Manag. 57(6), 102067 (2020) Guo, J., et al.: A deep look into neural ranking models for information retrieval. Inf. Process. Manag. 57(6), 102067 (2020)
11.
Zurück zum Zitat Han, F., Niu, D., Lai, K., Guo, W., He, Y., Xu, Y.: Inferring search queries from web documents via a graph-augmented sequence to attention network. In: The World Wide Web Conference, WWW 2019, pp. 2792–2798. Association for Computing Machinery, New York (2019). https://doi.org/10.1145/3308558.3313746 Han, F., Niu, D., Lai, K., Guo, W., He, Y., Xu, Y.: Inferring search queries from web documents via a graph-augmented sequence to attention network. In: The World Wide Web Conference, WWW 2019, pp. 2792–2798. Association for Computing Machinery, New York (2019). https://​doi.​org/​10.​1145/​3308558.​3313746
12.
Zurück zum Zitat Karpukhin, V., et al.: Dense passage retrieval for open-domain question answering (2020) Karpukhin, V., et al.: Dense passage retrieval for open-domain question answering (2020)
14.
Zurück zum Zitat Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization (2017) Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization (2017)
16.
Zurück zum Zitat Li, X., et al.: Learning better representations for neural information retrieval with graph information. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, CIKM 2020, pp. 795–804. Association for Computing Machinery, New York (2020). https://doi.org/10.1145/3340531.3411957 Li, X., et al.: Learning better representations for neural information retrieval with graph information. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, CIKM 2020, pp. 795–804. Association for Computing Machinery, New York (2020). https://​doi.​org/​10.​1145/​3340531.​3411957
17.
Zurück zum Zitat Macdonald, C., McCreadie, R., Santos, R.L., Ounis, I.: From puppy to maturity: experiences in developing terrier. In: Proceedings of OSIR at SIGIR, pp. 60–63 (2012) Macdonald, C., McCreadie, R., Santos, R.L., Ounis, I.: From puppy to maturity: experiences in developing terrier. In: Proceedings of OSIR at SIGIR, pp. 60–63 (2012)
19.
Zurück zum Zitat Mahdabi, P., Gerani, S., Huang, J.X., Crestani, F.: Leveraging conceptual lexicon: query disambiguation using proximity information for patent retrieval. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2013, pp. 113–122. Association for Computing Machinery, New York (2013). https://doi.org/10.1145/2484028.2484056 Mahdabi, P., Gerani, S., Huang, J.X., Crestani, F.: Leveraging conceptual lexicon: query disambiguation using proximity information for patent retrieval. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2013, pp. 113–122. Association for Computing Machinery, New York (2013). https://​doi.​org/​10.​1145/​2484028.​2484056
21.
Zurück zum Zitat Murdock, V., Croft, W.B.: A translation model for sentence retrieval. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, pp. 684–691. Association for Computational Linguistics, Vancouver (2005). https://www.aclweb.org/anthology/H05-1086 Murdock, V., Croft, W.B.: A translation model for sentence retrieval. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, pp. 684–691. Association for Computational Linguistics, Vancouver (2005). https://​www.​aclweb.​org/​anthology/​H05-1086
23.
Zurück zum Zitat Norozi, M.A., Arvola, P.: Kinship contextualization: Utilizing the preceding and following structural elements. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2013, pp. 837–840. Association for Computing Machinery, New York (2013). https://doi.org/10.1145/2484028.2484111 Norozi, M.A., Arvola, P.: Kinship contextualization: Utilizing the preceding and following structural elements. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2013, pp. 837–840. Association for Computing Machinery, New York (2013). https://​doi.​org/​10.​1145/​2484028.​2484111
24.
Zurück zum Zitat Norozi, M.A., Arvola, P., de Vries, A.P.: Contextualization using hyperlinks and internal hierarchical structure of wikipedia documents. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM 2012, pp. 734–743. Association for Computing Machinery, New York (2012). https://doi.org/10.1145/2396761.2396855 Norozi, M.A., Arvola, P., de Vries, A.P.: Contextualization using hyperlinks and internal hierarchical structure of wikipedia documents. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM 2012, pp. 734–743. Association for Computing Machinery, New York (2012). https://​doi.​org/​10.​1145/​2396761.​2396855
25.
Zurück zum Zitat Norozi, M.A., de Vries, A.P., Arvola, P.: Contextualization from the bibliographic structure (2012) Norozi, M.A., de Vries, A.P., Arvola, P.: Contextualization from the bibliographic structure (2012)
27.
Zurück zum Zitat Robertson, S., Walker, S., Jones, S., Hancock-Beaulieu, M., Gatford, M.: Okapi at trec-3, pp. 109–126 (1996) Robertson, S., Walker, S., Jones, S., Hancock-Beaulieu, M., Gatford, M.: Okapi at trec-3, pp. 109–126 (1996)
28.
Zurück zum Zitat Sheetrit, E., Shtok, A., Kurland, O.: A passage-based approach to learning to rank documents (2019) Sheetrit, E., Shtok, A., Kurland, O.: A passage-based approach to learning to rank documents (2019)
29.
Zurück zum Zitat Vaswani, A., et al.: Attention is all you need (2017) Vaswani, A., et al.: Attention is all you need (2017)
30.
Zurück zum Zitat Veličković, P., Cucurull, G., Casanova, A., Romero, A., Liò, P., Bengio, Y.: Graph attention networks (2018) Veličković, P., Cucurull, G., Casanova, A., Romero, A., Liò, P., Bengio, Y.: Graph attention networks (2018)
32.
Zurück zum Zitat Xiong, L., et al.: Approximate nearest neighbor negative contrastive learning for dense text retrieval (2020) Xiong, L., et al.: Approximate nearest neighbor negative contrastive learning for dense text retrieval (2020)
33.
Zurück zum Zitat Xue, X., Croft, W.B.: Automatic query generation for patent search. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009, pp. 2037–2040. Association for Computing Machinery, New York (2009). https://doi.org/10.1145/1645953.1646295 Xue, X., Croft, W.B.: Automatic query generation for patent search. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, CIKM 2009, pp. 2037–2040. Association for Computing Machinery, New York (2009). https://​doi.​org/​10.​1145/​1645953.​1646295
34.
Zurück zum Zitat Yu, J., et al.: Modeling text with graph convolutional network for cross-modal information retrieval (2018) Yu, J., et al.: Modeling text with graph convolutional network for cross-modal information retrieval (2018)
Metadaten
Titel
Passage Retrieval on Structured Documents Using Graph Attention Networks
verfasst von
Lucas Albarede
Philippe Mulhem
Lorraine Goeuriot
Claude Le Pape-Gardeux
Sylvain Marie
Trinidad Chardin-Segui
Copyright-Jahr
2022
DOI
https://doi.org/10.1007/978-3-030-99739-7_2