Skip to main content

2018 | OriginalPaper | Buchkapitel

Evaluating Different Similarity Measures for Automatic Biomedical Text Summarization

verfasst von : Mozhgan Nasr Azadani, Nasser Ghadiri

Erschienen in: Intelligent Systems Design and Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Automatic biomedical text summarization is maturing and can provide a solution for biomedical researchers to access the information they need efficiently. Biomedical summarization approaches often rely on the similarity measure to model the source document, mainly when they employ redundancy removal or graph structures. In this paper, we examine the impact of the similarity measure on the performance of the summarization methods. We model the document as a weighted graph. Various similarity measures are used to build different graphs based on biomedical concepts, semantic types and a combination of them. We next use the graphs to generate and evaluate the automatic summaries. The results suggest that the selection of the similarity measure has a substantial effect on the quality of the summaries (≈37% improvement in ROUGE-2 metric, and ≈29% in ROUGE-SU4). The results also demonstrate that exploiting both biomedical concepts and semantic types yields slightly better performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Afantenos, S., Karkaletsis, V., Stamatopoulos, P.: Summarization from medical documents: a survey. J. Artif. Intel. Med. 33, 157–177 (2005)CrossRef Afantenos, S., Karkaletsis, V., Stamatopoulos, P.: Summarization from medical documents: a survey. J. Artif. Intel. Med. 33, 157–177 (2005)CrossRef
3.
Zurück zum Zitat Fleuren, W.W.M., Alkema, W.: Application of text mining in the biomedical domain. J. Meth. 74, 97–106 (2015)CrossRef Fleuren, W.W.M., Alkema, W.: Application of text mining in the biomedical domain. J. Meth. 74, 97–106 (2015)CrossRef
4.
Zurück zum Zitat Jones, K.S.: Automatic summarising: the state of the art. J. Inf. Process. Manage. 43, 1449–1481 (2007)CrossRef Jones, K.S.: Automatic summarising: the state of the art. J. Inf. Process. Manage. 43, 1449–1481 (2007)CrossRef
5.
Zurück zum Zitat Mishra, R., Bian, J., Fiszman, M., Weir, C.R., Jonnalagadda, S., Mostafa, J., et al.: Text summarization in the biomedical domain: a systematic review of recent research. J. Biomed. Inform. 52, 457–467 (2014)CrossRef Mishra, R., Bian, J., Fiszman, M., Weir, C.R., Jonnalagadda, S., Mostafa, J., et al.: Text summarization in the biomedical domain: a systematic review of recent research. J. Biomed. Inform. 52, 457–467 (2014)CrossRef
6.
Zurück zum Zitat Reeve, L.H., Han, H., Nagori, S., Yang, J.C., Schwimmer, T.A.: Concept frequency distribution in biomedical text summarization. In: Proceedings of the 15th ACM International Conference on Information and Knowledge Management, pp. 604–611 (2006) Reeve, L.H., Han, H., Nagori, S., Yang, J.C., Schwimmer, T.A.: Concept frequency distribution in biomedical text summarization. In: Proceedings of the 15th ACM International Conference on Information and Knowledge Management, pp. 604–611 (2006)
7.
Zurück zum Zitat Sarkar, K.: Using domain knowledge for text summarization in medical domain. Int. J. Recent Trends Eng. 1, 200–205 (2009) Sarkar, K.: Using domain knowledge for text summarization in medical domain. Int. J. Recent Trends Eng. 1, 200–205 (2009)
8.
Zurück zum Zitat Plaza, L., Díaz, A., Gervás, P.: A semantic graph-based approach to biomedical summarisation. J. Artif. Intell. Med. 53, 1–14 (2011)CrossRef Plaza, L., Díaz, A., Gervás, P.: A semantic graph-based approach to biomedical summarisation. J. Artif. Intell. Med. 53, 1–14 (2011)CrossRef
9.
Zurück zum Zitat Lin, C.-Y.: Rouge: a package for automatic evaluation of summaries. In: Proceedings of Workshop on Text Summarization Branches Out, Workshop of ACL (2004) Lin, C.-Y.: Rouge: a package for automatic evaluation of summaries. In: Proceedings of Workshop on Text Summarization Branches Out, Workshop of ACL (2004)
10.
Zurück zum Zitat Gambhir, M., Gupta, V.: Recent automatic text summarization techniques: a survey. J. Artif. Intell. Rev. 47, 1–66 (2017)CrossRef Gambhir, M., Gupta, V.: Recent automatic text summarization techniques: a survey. J. Artif. Intell. Rev. 47, 1–66 (2017)CrossRef
11.
Zurück zum Zitat Yao, J.-G., Wan, X., Xiao, J.: Recent advances in document summarization. J. Knowl. Inf. Syst. 53, 297–336 (2017)CrossRef Yao, J.-G., Wan, X., Xiao, J.: Recent advances in document summarization. J. Knowl. Inf. Syst. 53, 297–336 (2017)CrossRef
12.
Zurück zum Zitat Nelson, S.J., Powell, T., Humphreys, B.L.: The Unified Medical Language System (UMLS) project, Encyclopedia of library (2002) Nelson, S.J., Powell, T., Humphreys, B.L.: The Unified Medical Language System (UMLS) project, Encyclopedia of library (2002)
13.
Zurück zum Zitat Reeve, L.H., Han, H., Brooks, A.D.: BioChain: lexical chaining methods for biomedical text summarization. In: Proceedings of the 2006 ACM Symposium on Applied Computing, pp. 180–184. ACM (2006) Reeve, L.H., Han, H., Brooks, A.D.: BioChain: lexical chaining methods for biomedical text summarization. In: Proceedings of the 2006 ACM Symposium on Applied Computing, pp. 180–184. ACM (2006)
14.
Zurück zum Zitat Yoo, I., Hu, X., Song, I.-Y.: A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method. J. BMC Bioinform. 8, S4 (2007)CrossRef Yoo, I., Hu, X., Song, I.-Y.: A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method. J. BMC Bioinform. 8, S4 (2007)CrossRef
15.
Zurück zum Zitat Menendez, H.D., Plaza, L., Camacho, D.: A genetic graph-based clustering approach to biomedical summarization. In: Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics, pp. 1–8. ACM (2013) Menendez, H.D., Plaza, L., Camacho, D.: A genetic graph-based clustering approach to biomedical summarization. In: Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics, pp. 1–8. ACM (2013)
16.
Zurück zum Zitat Fiszman, M., Demner-Fushman, D., Kilicoglu, H., Rindflesch, T.C.: Automatic summarization of MEDLINE citations for evidence-based medical treatment: a topic-oriented evaluation. J. Biomed. Inform. 42, 801–813 (2009)CrossRef Fiszman, M., Demner-Fushman, D., Kilicoglu, H., Rindflesch, T.C.: Automatic summarization of MEDLINE citations for evidence-based medical treatment: a topic-oriented evaluation. J. Biomed. Inform. 42, 801–813 (2009)CrossRef
17.
Zurück zum Zitat Zhang, H., Fiszman, M., Shin, D., Wilkowski, B., Rindflesch, T.C.: Clustering cliques for graph-based summarization of the biomedical research literature. J. BMC Bioinform. 14, 182 (2013)CrossRef Zhang, H., Fiszman, M., Shin, D., Wilkowski, B., Rindflesch, T.C.: Clustering cliques for graph-based summarization of the biomedical research literature. J. BMC Bioinform. 14, 182 (2013)CrossRef
18.
Zurück zum Zitat Zhang, H., Fiszman, M., Shin, D., Miller, C.M., Rosemblat, G., Rindflesch, T.C.: Degree centrality for semantic abstraction summarization of therapeutic studies. J. Biomed. Inform. 44, 830–838 (2011)CrossRef Zhang, H., Fiszman, M., Shin, D., Miller, C.M., Rosemblat, G., Rindflesch, T.C.: Degree centrality for semantic abstraction summarization of therapeutic studies. J. Biomed. Inform. 44, 830–838 (2011)CrossRef
19.
Zurück zum Zitat Reeve, L.H., Han, H., Brooks, A.D.: The use of domain-specific concepts in biomedical text summarization. J. Inf. Process. Manage. 43, 1765–1776 (2007)CrossRef Reeve, L.H., Han, H., Brooks, A.D.: The use of domain-specific concepts in biomedical text summarization. J. Inf. Process. Manage. 43, 1765–1776 (2007)CrossRef
20.
Zurück zum Zitat Plaza, L.: Comparing different knowledge sources for the automatic summarization of biomedical literature. J. Biomed. Inform. 52, 319–328 (2014)CrossRef Plaza, L.: Comparing different knowledge sources for the automatic summarization of biomedical literature. J. Biomed. Inform. 52, 319–328 (2014)CrossRef
21.
Zurück zum Zitat Plaza, L., Carrillo-de-Albornoz, J.: Evaluating the use of different positional strategies for sentence selection in biomedical literature summarization. J. BMC Bioinform. 14, 71 (2013)CrossRef Plaza, L., Carrillo-de-Albornoz, J.: Evaluating the use of different positional strategies for sentence selection in biomedical literature summarization. J. BMC Bioinform. 14, 71 (2013)CrossRef
22.
Zurück zum Zitat Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann Publishers, Burlington (2011)MATH Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann Publishers, Burlington (2011)MATH
23.
Zurück zum Zitat Mihalcea, R., Tarau, P.: TextRank: bringing order into text. In: Proceeding of the 2004 Conference on Empirical Methods in Natural Language Processing, pp. 404–411 (2004) Mihalcea, R., Tarau, P.: TextRank: bringing order into text. In: Proceeding of the 2004 Conference on Empirical Methods in Natural Language Processing, pp. 404–411 (2004)
25.
Zurück zum Zitat Doherty, J.L., Owen, M.J.: Genomic insights into the overlap between psychiatric disorders: implications for research and clinical practice. J. Genome Med. 6, 29 (2014)CrossRef Doherty, J.L., Owen, M.J.: Genomic insights into the overlap between psychiatric disorders: implications for research and clinical practice. J. Genome Med. 6, 29 (2014)CrossRef
27.
Zurück zum Zitat Mitkov, R.: The Oxford Handbook of Computational Linguistics. Oxford University Press, Oxford (2003)MATH Mitkov, R.: The Oxford Handbook of Computational Linguistics. Oxford University Press, Oxford (2003)MATH
Metadaten
Titel
Evaluating Different Similarity Measures for Automatic Biomedical Text Summarization
verfasst von
Mozhgan Nasr Azadani
Nasser Ghadiri
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-76348-4_30