Skip to main content

2018 | OriginalPaper | Buchkapitel

Key Terms Guided Expansion for Verbose Queries in Medical Domain

verfasst von : Yue Wang, Hui Fang

Erschienen in: Information Retrieval Technology

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Due to the complex nature of medical concepts and information need, the queries tend to be verbose in medical domain. Verbose queries lead to sub-optimal performance since the current search engine promotes the results covering every query term, but not the truly important ones. Key term extraction has been studied to solve this problem, but another problem, i.e., vocabulary gap between query and documents, need to be discussed. Although various query expansion techniques have been well studied for the vocabulary gap problem, existing methods suffer different drawbacks such as inefficiency and expansion term mismatch. In this work, we propose to solve this problem by following the intuition that the surrounding contexts of the important terms in the original query should also be essential for retrieval. Specifically, we first identify the key terms from the verbose query and then locate the contexts of these key terms in the original document collection. The terms in the contexts are weighted and aggregated to select the expansion terms. We conduct experiments with five TREC data collections using the proposed methods. The results show that the improvement of the retrieval performance of proposed method is statistically significant comparing with the baseline methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Roberts, K., Demner-Fushman, D., Voorhees, E.M., Hersh, W.R.: Overview of the TREC 2016 clinical decision support track. In: TREC (2016) Roberts, K., Demner-Fushman, D., Voorhees, E.M., Hersh, W.R.: Overview of the TREC 2016 clinical decision support track. In: TREC (2016)
2.
Zurück zum Zitat Gupta, M., Bendersky, M.: Information retrieval with verbose queries. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2015, pp. 1121–1124 (2015) Gupta, M., Bendersky, M.: Information retrieval with verbose queries. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2015, pp. 1121–1124 (2015)
4.
Zurück zum Zitat Bendersky, M., Croft, W.B.: Discovering key concepts in verbose queries. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2008, pp. 491–498 (2008) Bendersky, M., Croft, W.B.: Discovering key concepts in verbose queries. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2008, pp. 491–498 (2008)
5.
Zurück zum Zitat Turney, P.D.: Learning algorithms for keyphrase extraction. Inf. Retrieval 2(4), 303–336 (2000)CrossRef Turney, P.D.: Learning algorithms for keyphrase extraction. Inf. Retrieval 2(4), 303–336 (2000)CrossRef
6.
Zurück zum Zitat Balasubramanian, N., Kumaran, G., Carvalho, V.R.: Exploring reductions for long web queries. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2010, pp. 571–578 (2010) Balasubramanian, N., Kumaran, G., Carvalho, V.R.: Exploring reductions for long web queries. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2010, pp. 571–578 (2010)
7.
Zurück zum Zitat Wang, Y., Fang, H.: Extracting useful information from clinical notes. In: TREC 2016 (2016) Wang, Y., Fang, H.: Extracting useful information from clinical notes. In: TREC 2016 (2016)
8.
Zurück zum Zitat Paik, J.H., Oard, D.W.: A fixed-point method for weighting terms in verbose informational queries. In: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, pp. 131–140 (2014) Paik, J.H., Oard, D.W.: A fixed-point method for weighting terms in verbose informational queries. In: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, pp. 131–140 (2014)
9.
Zurück zum Zitat Zhao, L., Callan, J.: Term necessity prediction. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, CIKM 2010, pp. 259–268 (2010) Zhao, L., Callan, J.: Term necessity prediction. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, CIKM 2010, pp. 259–268 (2010)
10.
Zurück zum Zitat Lease, M.: An improved Markov random field model for supporting verbose queries. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2009, pp. 476–483 (2009) Lease, M.: An improved Markov random field model for supporting verbose queries. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2009, pp. 476–483 (2009)
11.
Zurück zum Zitat Bonchi, F., Perego, R., Silvestri, F., Vahabi, H., Venturini, R.: Recommendations for the long tail by term-query graph. In: Proceedings of the 20th International Conference Companion on World Wide Web, WWW 2011, pp. 15–16 (2011) Bonchi, F., Perego, R., Silvestri, F., Vahabi, H., Venturini, R.: Recommendations for the long tail by term-query graph. In: Proceedings of the 20th International Conference Companion on World Wide Web, WWW 2011, pp. 15–16 (2011)
12.
Zurück zum Zitat Xue, X., Jeon, J., Croft, W.B.: Retrieval models for question and answer archives. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2008, pp. 475–482 (2008) Xue, X., Jeon, J., Croft, W.B.: Retrieval models for question and answer archives. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2008, pp. 475–482 (2008)
14.
Zurück zum Zitat Bhogal, J., Macfarlane, A., Smith, P.: A review of ontology based query expansion. Inf. Process. Manage. 43, 866–886 (2007)CrossRef Bhogal, J., Macfarlane, A., Smith, P.: A review of ontology based query expansion. Inf. Process. Manage. 43, 866–886 (2007)CrossRef
15.
Zurück zum Zitat Buckley, C.: Automatic query expansion using smart: TREC 3. In: Proceedings of The third Text Retrieval Conference, pp. 69–80 (1994) Buckley, C.: Automatic query expansion using smart: TREC 3. In: Proceedings of The third Text Retrieval Conference, pp. 69–80 (1994)
16.
Zurück zum Zitat Salton, G., Buckley, C.: Readings in Information Retrieval, pp. 355–364. Morgan Kaufmann Publishers Inc., San Francisco (1997) Salton, G., Buckley, C.: Readings in Information Retrieval, pp. 355–364. Morgan Kaufmann Publishers Inc., San Francisco (1997)
17.
Zurück zum Zitat Rocchio, J.J.: Relevance feedback in information retrieval. In: The Smart Retrieval System - Experiments in Automatic Document Processing, pp. 313–323 (1971) Rocchio, J.J.: Relevance feedback in information retrieval. In: The Smart Retrieval System - Experiments in Automatic Document Processing, pp. 313–323 (1971)
18.
Zurück zum Zitat Chen, C., Chunyan, H., Xiaojie, Y.: Relevance feedback fusion via query expansion. In: 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, pp. 122–126 (2012) Chen, C., Chunyan, H., Xiaojie, Y.: Relevance feedback fusion via query expansion. In: 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, pp. 122–126 (2012)
19.
Zurück zum Zitat Wang, Y., Fang, H.: Exploring the query expansion methods for concept based representation. In: The Twenty-Third Text Retrieval Conference Proceedings, TREC 2014 (2014) Wang, Y., Fang, H.: Exploring the query expansion methods for concept based representation. In: The Twenty-Third Text Retrieval Conference Proceedings, TREC 2014 (2014)
20.
Zurück zum Zitat Xu, J., Croft, W.B.: Query expansion using local and global document analysis. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1996, pp. 4–11 (1996) Xu, J., Croft, W.B.: Query expansion using local and global document analysis. In: Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1996, pp. 4–11 (1996)
21.
Zurück zum Zitat Lv, Y., Zhai, C.: Positional relevance model for pseudo-relevance feedback. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2010, pp. 579–586 (2010) Lv, Y., Zhai, C.: Positional relevance model for pseudo-relevance feedback. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2010, pp. 579–586 (2010)
22.
Zurück zum Zitat Ermakova, L., Mothe, J., Nikitina, E.: Proximity relevance model for query expansion. In: Proceedings of the 31st Annual ACM Symposium on Applied Computing, SAC 2016, pp. 1054–1059 (2016) Ermakova, L., Mothe, J., Nikitina, E.: Proximity relevance model for query expansion. In: Proceedings of the 31st Annual ACM Symposium on Applied Computing, SAC 2016, pp. 1054–1059 (2016)
23.
Zurück zum Zitat Zamani, H., Croft, W.B.: Relevance-based word embedding. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2017, pp. 505–514 (2017) Zamani, H., Croft, W.B.: Relevance-based word embedding. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2017, pp. 505–514 (2017)
24.
Zurück zum Zitat Martinez, D., Otegi, A., Soroa, A., Agirre, E.: Improving search over electronic health records using UMLS-based query expansion through random walks. J. Biomed. Inform. 51, 100–106 (2014)CrossRef Martinez, D., Otegi, A., Soroa, A., Agirre, E.: Improving search over electronic health records using UMLS-based query expansion through random walks. J. Biomed. Inform. 51, 100–106 (2014)CrossRef
25.
Zurück zum Zitat Zhu, D., Carterette, B.: Combining multi-level evidence for medical record retrieval. In: Proceedings of the 2012 International Workshop on Smart Health and Wellbeing (SHB 2012), pp. 49–56 (2012) Zhu, D., Carterette, B.: Combining multi-level evidence for medical record retrieval. In: Proceedings of the 2012 International Workshop on Smart Health and Wellbeing (SHB 2012), pp. 49–56 (2012)
26.
Zurück zum Zitat Wang, Y., Fang, H.: Exploring the query expansion methods for concept based representation. In: TREC 2014 (2014) Wang, Y., Fang, H.: Exploring the query expansion methods for concept based representation. In: TREC 2014 (2014)
27.
Zurück zum Zitat Limsopatham, N., Macdonald, C., Ounis, I.: Learning to combine representations for medical records search. In: Proceedings of SIGIR 2013 (2013) Limsopatham, N., Macdonald, C., Ounis, I.: Learning to combine representations for medical records search. In: Proceedings of SIGIR 2013 (2013)
28.
Zurück zum Zitat Wang, Y., Liu, X., Fang, H.: A study of concept-based weighting regularization for medical records search. In: ACL 2014(2014) Wang, Y., Liu, X., Fang, H.: A study of concept-based weighting regularization for medical records search. In: ACL 2014(2014)
29.
Zurück zum Zitat Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to information retrieval. ACM Transactions on Information Systems, pp. 179–214 (2004)CrossRef Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to information retrieval. ACM Transactions on Information Systems, pp. 179–214 (2004)CrossRef
30.
Zurück zum Zitat Lavrenko, V., Croft, W.B.: Relevance based language models. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2001, pp. 120–127 (2001) Lavrenko, V., Croft, W.B.: Relevance based language models. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2001, pp. 120–127 (2001)
31.
Zurück zum Zitat Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. Computing Research Repository (2013) Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. Computing Research Repository (2013)
Metadaten
Titel
Key Terms Guided Expansion for Verbose Queries in Medical Domain
verfasst von
Yue Wang
Hui Fang
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-03520-4_14

Neuer Inhalt