Skip to main content
Top

2017 | OriginalPaper | Chapter

Concept-Based Compound Keyword Extraction Based on Using Sentential Distance, Conceptual Distance and Production Rules: Calculation of the Keyword Importance

Author : Samuel Sangkon Lee

Published in: Advances in Computer Science and Ubiquitous Computing

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Humans can read a document and conceptually organize its contents into few compound keywords that capture the essence of the topic of a document. Based on this information, this study proposes a method for extracting keywords that gives the gist of a document. It uses a set of academic papers as test data to set up a concept-based production rule for forming compound keywords even when author-provided keywords do not appear in the text body of a document. It also proposes a method of calculating the importance of keyword in order to refrain from extracting meaningless keywords. Also the validity of extracted keywords was tested using a data set of thesis paper titles and summaries in the field of natural language processing and speech recognition. Comparison of the author-provided keywords to the keyword results of the developed system showed that the developed system was very useful with an accuracy rate as good as up to 96 %.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Nagata, M. et al.: A newspaper keyword gene- ration method based on key-concept extraction. In: Proceedings of the 37th National Convention Information Processing, pp. 1030–1031 (1988) Nagata, M. et al.: A newspaper keyword gene- ration method based on key-concept extraction. In: Proceedings of the 37th National Convention Information Processing, pp. 1030–1031 (1988)
2.
go back to reference Morohashi, M.: Automatic indexing survey. Mag. IPS Jpn 25, 918–925 (1984) Morohashi, M.: Automatic indexing survey. Mag. IPS Jpn 25, 918–925 (1984)
3.
go back to reference Lee, S.S., Shishibori, M., Sumitomo, T., Aoe, J.-I.: Extraction of field-coherent passages. Intl. J. Inf. Process. Manage. 38, 173–207 (2002) Lee, S.S., Shishibori, M., Sumitomo, T., Aoe, J.-I.: Extraction of field-coherent passages. Intl. J. Inf. Process. Manage. 38, 173–207 (2002)
4.
go back to reference Al-Hashemi, R.: Text summarization extraction system (TSES) using extracted keywords. Intl. Arab J. e-Technol. 1, 164–168 (2010) Al-Hashemi, R.: Text summarization extraction system (TSES) using extracted keywords. Intl. Arab J. e-Technol. 1, 164–168 (2010)
5.
go back to reference Chena, Y.-H., Lub, E.J.-L., Tsaib, M.F.: Finding keywords in blogs: efficient keyword extraction in blog mining via user behaviors. Expert Syst. Appl. 41, 663–670 (2014)CrossRef Chena, Y.-H., Lub, E.J.-L., Tsaib, M.F.: Finding keywords in blogs: efficient keyword extraction in blog mining via user behaviors. Expert Syst. Appl. 41, 663–670 (2014)CrossRef
6.
go back to reference Choi, Y.-L., Jeon, W.-S., Yoon, S.-H.: Improving database system performance by applying NoSQL. J. Inf. Process. Syst. (JIPS) 10, 355–364 (2014)CrossRef Choi, Y.-L., Jeon, W.-S., Yoon, S.-H.: Improving database system performance by applying NoSQL. J. Inf. Process. Syst. (JIPS) 10, 355–364 (2014)CrossRef
Metadata
Title
Concept-Based Compound Keyword Extraction Based on Using Sentential Distance, Conceptual Distance and Production Rules: Calculation of the Keyword Importance
Author
Samuel Sangkon Lee
Copyright Year
2017
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-3023-9_69