Skip to main content

2020 | OriginalPaper | Buchkapitel

Extracting Hierarchical Relations Between the Back-of-the-Book Index Terms

verfasst von : Ning Li, Meng Tian, Shuqi Lv

Erschienen in: Chinese Lexical Semantics

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Aiming at solving the problem that the single level back-of-the-book index system is not enough to fully explore the semantics relations between the index terms, a method to extract the hierarchical relations between the index terms based on combination of lexical-syntactic analysis and text structure features is proposed in this paper. It first organizes index terms according to the text structure features, and constructs the indexed term pairs with hierarchical relations step by step. Then based on word vectors, the semantic similarity of paired index terms is calculated to eliminate the misidentified pairs. Finally, the index term pairs with hierarchical relations are optimized in the direct graph to remove redundant and conflict relations, and the hierarchical index system is built at last. Compared with the other results, our method improves precision rate and F value by 11.44% and 5.65% respectively.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Guo, L.F., Wen, G.Q.: Comparative research of index software between English and Chinese. Library 4, 47–48 (2010) Guo, L.F., Wen, G.Q.: Comparative research of index software between English and Chinese. Library 4, 47–48 (2010)
2.
Zurück zum Zitat Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: a large ontology from Wikipedia and WordNet. In: Web Semantics Science Services & Agents on the World Wide Web, vol. 6(3), pp. 203–217 (2008) Suchanek, F.M., Kasneci, G., Weikum, G.: YAGO: a large ontology from Wikipedia and WordNet. In: Web Semantics Science Services & Agents on the World Wide Web, vol. 6(3), pp. 203–217 (2008)
4.
Zurück zum Zitat Tian, F., Ren, F.: Hyponymy acquisition from Chinese text by SVM. In: International Conference on Natural Language Processing & Knowledge Engineering, Dalian, pp. 1–6. IEEE (2009) Tian, F., Ren, F.: Hyponymy acquisition from Chinese text by SVM. In: International Conference on Natural Language Processing & Knowledge Engineering, Dalian, pp. 1–6. IEEE (2009)
5.
Zurück zum Zitat Wang, S., Liang, C., Wu, Z., et al.: Concept hierarchy extraction from textbooks. In: ACM Symposium on Document Engineering, pp. 147–156. ACM (2015) Wang, S., Liang, C., Wu, Z., et al.: Concept hierarchy extraction from textbooks. In: ACM Symposium on Document Engineering, pp. 147–156. ACM (2015)
7.
Zurück zum Zitat Tang, Q., Lv, X.Q., Li, Z.: Research on domain ontology concept hyponymy relation extraction. Microelectron. Comput. 31(6), 68–71 (2014) Tang, Q., Lv, X.Q., Li, Z.: Research on domain ontology concept hyponymy relation extraction. Microelectron. Comput. 31(6), 68–71 (2014)
8.
Zurück zum Zitat Ruan, D.R., He, X.Y., Li, D.Y.: Modeling and extracting hyponymy relationships on Chinese electric power field content. In: 8th International Conference on Modelling, Identification and Control (ICMIC), Algiers, pp. 439–443. IEEE (2016) Ruan, D.R., He, X.Y., Li, D.Y.: Modeling and extracting hyponymy relationships on Chinese electric power field content. In: 8th International Conference on Modelling, Identification and Control (ICMIC), Algiers, pp. 439–443. IEEE (2016)
9.
Zurück zum Zitat Jing, C., Bo, X., et al.: A research on internal hierarchical topic organization model of the book based on hLDA. Libr. Inf. Serv. 60(18), 140–148 (2016) Jing, C., Bo, X., et al.: A research on internal hierarchical topic organization model of the book based on hLDA. Libr. Inf. Serv. 60(18), 140–148 (2016)
10.
Zurück zum Zitat Wu, Z.H., Li, Z.H., Mitra, P., et al.: Can back-of-the-book indexes be automatically created? In: CIKM 2013 Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, San Francisco, pp. 1745–1750. ACM (2013) Wu, Z.H., Li, Z.H., Mitra, P., et al.: Can back-of-the-book indexes be automatically created? In: CIKM 2013 Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, San Francisco, pp. 1745–1750. ACM (2013)
11.
Zurück zum Zitat Tian, M., Li, N., et al.: Extraction of index terms for Chinese books. Comput. Eng. Des. 40(1), 261–267 (2019) Tian, M., Li, N., et al.: Extraction of index terms for Chinese books. Comput. Eng. Des. 40(1), 261–267 (2019)
12.
Zurück zum Zitat Liu, L., Cao, C.G.: Hyponymy relation verification method based on hybrid features. Comput. Eng. 34(14), 12–13 (2008)CrossRef Liu, L., Cao, C.G.: Hyponymy relation verification method based on hybrid features. Comput. Eng. 34(14), 12–13 (2008)CrossRef
13.
Zurück zum Zitat Mikolov, T., Chen, K., Corrado, G., et al.: Efficient estimation of word representations in vector space. arXiv:1301.3781v3, pp. 1–12 (2013) Mikolov, T., Chen, K., Corrado, G., et al.: Efficient estimation of word representations in vector space. arXiv:​1301.​3781v3, pp. 1–12 (2013)
14.
Zurück zum Zitat Lv, S.Q.: Research on the Method of Automatically Generating Back-of-the-Book Index. Beijing Information Science & Technology University, Beijing (2017) Lv, S.Q.: Research on the Method of Automatically Generating Back-of-the-Book Index. Beijing Information Science & Technology University, Beijing (2017)
Metadaten
Titel
Extracting Hierarchical Relations Between the Back-of-the-Book Index Terms
verfasst von
Ning Li
Meng Tian
Shuqi Lv
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-38189-9_45