Skip to main content

2020 | OriginalPaper | Buchkapitel

Research on Extraction of Simple Modifier-Head Chunks Based on Corpus

verfasst von : Wang Chengwen, Zhang Zheng, Rao Gaoqi, Xun Endong, Miao Jingjing

Erschienen in: Chinese Lexical Semantics

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The purpose of this study is to automatically extract a set of simple modifier-head chunks from a large-scale corpus. By analyzing the distribution of simple modifier-head chunks in usage, a set of formal rules of chunks extraction are formulated and a rule-based automatic extraction algorithm is designed. In the experiment of random sampling, the precision of extraction result with this method reaches 82.63%, which casts light on knowledge extraction based on large-scale corpus.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
“Adjective + noun” modifier-head chunks in Chinese mainly consist of collocations of “disyllable adjective +  https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-38189-9_37/MediaObjects/486900_1_En_37_Figaj_HTML.png  + disyllable noun”, such as “ https://static-content.springer.com/image/chp%3A10.1007%2F978-3-030-38189-9_37/MediaObjects/486900_1_En_37_Figak_HTML.png ”(magnificent + de + architecturea magnificent building).
 
Literatur
1.
Zurück zum Zitat Miller, G.A.: The magical number seven, plus or minus two: some limits on our capacity for processing information. Psychol. Rev. 63, 81–97 (1956)CrossRef Miller, G.A.: The magical number seven, plus or minus two: some limits on our capacity for processing information. Psychol. Rev. 63, 81–97 (1956)CrossRef
2.
Zurück zum Zitat Becker, J.: The phrasal lexicon. In: Shank, R., Nash-Webber, B.L. (eds.) Theoretical Issues in Natural Language Processing, pp. 60–63. Bolt Beranek & Newman, Cambridge (1975) Becker, J.: The phrasal lexicon. In: Shank, R., Nash-Webber, B.L. (eds.) Theoretical Issues in Natural Language Processing, pp. 60–63. Bolt Beranek & Newman, Cambridge (1975)
3.
Zurück zum Zitat Zhou, J.: Reinforce the language chunk teaching to foster the intuition of Chinese. Jinan J. (Philos. Soc. Sci. Ed.) 1 (2007). (in Chinese) Zhou, J.: Reinforce the language chunk teaching to foster the intuition of Chinese. Jinan J. (Philos. Soc. Sci. Ed.) 1 (2007). (in Chinese)
4.
Zurück zum Zitat Qian, X.: A preliminary study on Chinese chunk. J. Peking Univ. (Philos. Soc. Sci. Ed.) 5 (2008). (in Chinese) Qian, X.: A preliminary study on Chinese chunk. J. Peking Univ. (Philos. Soc. Sci. Ed.) 5 (2008). (in Chinese)
5.
Zurück zum Zitat Lu, B.: Defined, classify and teaching research of Chinese practical chunk. Guangzhou University (2012). (in Chinese) Lu, B.: Defined, classify and teaching research of Chinese practical chunk. Guangzhou University (2012). (in Chinese)
6.
Zurück zum Zitat Xue, X., Shi, C.: The nature of lexical chunks and the hierarchical relationship of the Chinese lexical chunk system. Contemp. Rhetor. 3 (2013). (in Chinese) Xue, X., Shi, C.: The nature of lexical chunks and the hierarchical relationship of the Chinese lexical chunk system. Contemp. Rhetor. 3 (2013). (in Chinese)
7.
Zurück zum Zitat Zhan, H.: Methods and tools of retrieving lexical chunks from Corpora. Foreign Lang. Teach. (2011). (in Chinese) Zhan, H.: Methods and tools of retrieving lexical chunks from Corpora. Foreign Lang. Teach. (2011). (in Chinese)
8.
Zurück zum Zitat Zhan, H.: Psychological reality of L2 phraseologisms: evidence from phoneme monitoring. Foreign Lang. Foreign Lang. Teach. (2012). (in Chinese) Zhan, H.: Psychological reality of L2 phraseologisms: evidence from phoneme monitoring. Foreign Lang. Foreign Lang. Teach. (2012). (in Chinese)
9.
Zurück zum Zitat Jiang, B.: Chinese multi-word chunks extraction for computer aided translation. Chin. J. Inf. Technol. 21(1) ( 2007). (in Chinese) Jiang, B.: Chinese multi-word chunks extraction for computer aided translation. Chin. J. Inf. Technol. 21(1) ( 2007). (in Chinese)
10.
Zurück zum Zitat Xun, E., Rao, G., Xiao, X., Zang, J.: The construction of the BCC corpus in the age of big data. Corpus Linguist. 3(1), 93–109 (2016). (in Chinese) Xun, E., Rao, G., Xiao, X., Zang, J.: The construction of the BCC corpus in the age of big data. Corpus Linguist. 3(1), 93–109 (2016). (in Chinese)
Metadaten
Titel
Research on Extraction of Simple Modifier-Head Chunks Based on Corpus
verfasst von
Wang Chengwen
Zhang Zheng
Rao Gaoqi
Xun Endong
Miao Jingjing
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-38189-9_37