Skip to main content

2017 | OriginalPaper | Buchkapitel

Holographic Lexical Chain and Its Application in Chinese Text Summarization

verfasst von : Shengluan Hou, Yu Huang, Chaoqun Fei, Shuhan Zhang, Ruqian Lu

Erschienen in: Web and Big Data

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Lexical chain has been widely used in many NLP areas. However, when using it for Web text summarization, especially for domain-specific text summarization, we got low accuracy results. The main reason is that traditional lexical chains only take nouns into consideration while information of other grammatical parts is missing. We introduce lexical chains of predicates and adjectives (adverbs) respectively. These three types of lexical chains together are called holographic lexical chains (HLCs), which capture most of the information included in the text. A specifically designed construction method for HLC is presented. We applied HLC method to Chinese text summarization and used machine learning methods whose features are adapted to the new method. In a comparative study of Chinese foreign trade texts, we got summarization results with accuracy of 86.88%. Our HLC construction method obtained improvements of 7.02% in accuracy than the known best methods in Chinese text summarization.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alam, H., Kumar, A., Nakamura, M., et al.: Structured and unstructured document summarization: design of a commercial summarizer using lexical chains. In: ICDAR, vol. 3, pp. 1147 (2003) Alam, H., Kumar, A., Nakamura, M., et al.: Structured and unstructured document summarization: design of a commercial summarizer using lexical chains. In: ICDAR, vol. 3, pp. 1147 (2003)
2.
Zurück zum Zitat Barzilay, R., Elhadad, M.: Using lexical chains for text summarization. Adv. Autom. Text Summar. 111–121 (1999) Barzilay, R., Elhadad, M.: Using lexical chains for text summarization. Adv. Autom. Text Summar. 111–121 (1999)
3.
Zurück zum Zitat Brügmann, S., Bouayad-Agha, N., Burga, A., et al.: Towards content-oriented patent document processing: intelligent patent analysis and summarization. World Patent Inf. 40, 30–42 (2015)CrossRef Brügmann, S., Bouayad-Agha, N., Burga, A., et al.: Towards content-oriented patent document processing: intelligent patent analysis and summarization. World Patent Inf. 40, 30–42 (2015)CrossRef
5.
Zurück zum Zitat Feng, W.L.: Research of theme statement extraction for chinese literature based on lexical chain. Int. J. Multimedia Ubiquitous Eng. 11(6), 379–388 (2016)CrossRef Feng, W.L.: Research of theme statement extraction for chinese literature based on lexical chain. Int. J. Multimedia Ubiquitous Eng. 11(6), 379–388 (2016)CrossRef
6.
Zurück zum Zitat Galley, M., McKeown, K.: Improving word sense disambiguation in lexical chaining. In: IJCAI, vol. 3, pp. 1486–1488 (2003) Galley, M., McKeown, K.: Improving word sense disambiguation in lexical chaining. In: IJCAI, vol. 3, pp. 1486–1488 (2003)
7.
Zurück zum Zitat Hirst, G., St-Onge, D.: Lexical chains as representations of context for the detection and correction of malapropisms. WordNet Electr. Lex. Database 305, 305–332 (1998) Hirst, G., St-Onge, D.: Lexical chains as representations of context for the detection and correction of malapropisms. WordNet Electr. Lex. Database 305, 305–332 (1998)
8.
Zurück zum Zitat Jarmasz, M., Szpakowicz, S.: Not as easy as it seems: automating the construction of lexical chains using Roget’s Thesaurus. In: Xiang, Y., Chaib-draa, B. (eds.) AI 2003. LNCS, vol. 2671, pp. 544–549. Springer, Heidelberg (2003). doi:10.1007/3-540-44886-1_48 CrossRef Jarmasz, M., Szpakowicz, S.: Not as easy as it seems: automating the construction of lexical chains using Roget’s Thesaurus. In: Xiang, Y., Chaib-draa, B. (eds.) AI 2003. LNCS, vol. 2671, pp. 544–549. Springer, Heidelberg (2003). doi:10.​1007/​3-540-44886-1_​48 CrossRef
9.
Zurück zum Zitat Krovetz, R.: More than one sense per discourse. NEC Princeton NJ Labs., Research Memorandum (1998) Krovetz, R.: More than one sense per discourse. NEC Princeton NJ Labs., Research Memorandum (1998)
10.
Zurück zum Zitat Li, J., Sun, L., Kit, C., et al.: A query-focused multi-document summarizer based on lexical chains. In: Proceedings of Document Understanding Conference (2007) Li, J., Sun, L., Kit, C., et al.: A query-focused multi-document summarizer based on lexical chains. In: Proceedings of Document Understanding Conference (2007)
11.
Zurück zum Zitat Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S.J., McClosky, D.: The stanford CoreNLP natural language processing toolkit. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 55–60 (2014) Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S.J., McClosky, D.: The stanford CoreNLP natural language processing toolkit. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 55–60 (2014)
12.
Zurück zum Zitat Miller, G.A.: WordNet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)CrossRef Miller, G.A.: WordNet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)CrossRef
13.
Zurück zum Zitat Morris, J., Hirst, G.: Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Comput. Linguist. 17(1), 21–48 (1991) Morris, J., Hirst, G.: Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Comput. Linguist. 17(1), 21–48 (1991)
14.
Zurück zum Zitat Munot, N., Govilkar, S.S.: Comparative study of text summarization methods. Int. J. Comput. Appl. 102(12), 33–37 (2014) Munot, N., Govilkar, S.S.: Comparative study of text summarization methods. Int. J. Comput. Appl. 102(12), 33–37 (2014)
15.
Zurück zum Zitat Nenkova, A., McKeown, K.: A survey of text summarization techniques. In: Aggarwal, C., Zhai, C. (eds.) Mining Text Data, pp. 43–76. Springer, US (2012)CrossRef Nenkova, A., McKeown, K.: A survey of text summarization techniques. In: Aggarwal, C., Zhai, C. (eds.) Mining Text Data, pp. 43–76. Springer, US (2012)CrossRef
16.
Zurück zum Zitat Novischi, A., Moldovan, D.: Question answering with lexical chains propagating verb arguments. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, pp. 897–904. Association for Computational Linguistics (2006) Novischi, A., Moldovan, D.: Question answering with lexical chains propagating verb arguments. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, pp. 897–904. Association for Computational Linguistics (2006)
17.
Zurück zum Zitat Qian, T., Ji, D., Zhang, M., et al.: Word sense induction using lexical chain based hypergraph model. In: COLING, pp. 1601–1611 (2014) Qian, T., Ji, D., Zhang, M., et al.: Word sense induction using lexical chain based hypergraph model. In: COLING, pp. 1601–1611 (2014)
18.
Zurück zum Zitat Remus, S., Biemann, C.: Three knowledge-free methods for automatic lexical chain extraction. In: HLT-NAACL, pp. 989–999 (2013) Remus, S., Biemann, C.: Three knowledge-free methods for automatic lexical chain extraction. In: HLT-NAACL, pp. 989–999 (2013)
19.
Zurück zum Zitat Silber, H.G., McCoy, K.F.: Efficiently computed lexical chains as an intermediate representation for automatic text summarization. Comput. Linguist. 28(4), 487–496 (2002)CrossRef Silber, H.G., McCoy, K.F.: Efficiently computed lexical chains as an intermediate representation for automatic text summarization. Comput. Linguist. 28(4), 487–496 (2002)CrossRef
20.
Zurück zum Zitat Somasundaran, S., Burstein, J., Chodorow, M.: Lexical chaining for measuring discourse coherence quality in test-taker essays. In: COLING, pp. 950–961 (2014) Somasundaran, S., Burstein, J., Chodorow, M.: Lexical chaining for measuring discourse coherence quality in test-taker essays. In: COLING, pp. 950–961 (2014)
21.
Zurück zum Zitat Che, W., Li, Z., Liu, T.: LTP: a chinese language technology platform. In: Proceedings of the Coling 2010: Demonstrations, pp 13–16, Beijing, China, August 2010 Che, W., Li, Z., Liu, T.: LTP: a chinese language technology platform. In: Proceedings of the Coling 2010: Demonstrations, pp 13–16, Beijing, China, August 2010
22.
Zurück zum Zitat Wei, T., Lu, Y., Chang, H., et al.: A semantic approach for text clustering using WordNet and lexical chains. Expert Syst. Appl. 42(4), 2264–2275 (2015)CrossRef Wei, T., Lu, Y., Chang, H., et al.: A semantic approach for text clustering using WordNet and lexical chains. Expert Syst. Appl. 42(4), 2264–2275 (2015)CrossRef
23.
Zurück zum Zitat Xiong, D., Ding, Y., Zhang, M., et al.: Lexical chain based cohesion models for document-level statistical machine translation. In: EMNLP, pp. 1563–1573 (2013) Xiong, D., Ding, Y., Zhang, M., et al.: Lexical chain based cohesion models for document-level statistical machine translation. In: EMNLP, pp. 1563–1573 (2013)
Metadaten
Titel
Holographic Lexical Chain and Its Application in Chinese Text Summarization
verfasst von
Shengluan Hou
Yu Huang
Chaoqun Fei
Shuhan Zhang
Ruqian Lu
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-63579-8_21