Skip to main content

2003 | OriginalPaper | Buchkapitel

Not as Easy as It Seems: Automating the Construction of Lexical Chains Using Roget’s Thesaurus

verfasst von : Mario Jarmasz, Stan Szpakowicz

Erschienen in: Advances in Artificial Intelligence

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Morris and Hirst [10] present a method of linking significant words that are about the same topic. The resulting lexical chains are a means of identifying cohesive regions in a text, with applications in many natural language processing tasks, including text summarization. The first lexical chains were constructed manually using Roget’s International Thesaurus. Morris and Hirst wrote that automation would be straightforward given an electronic thesaurus. All applications so far have used WordNet to produce lexical chains, perhaps because adequate electronic versions of Roget’s were not available until recently. We discuss the building of lexical chains using an electronic version of Roget’s Thesaurus. We implement a variant of the original algorithm, and explain the necessary design decisions. We include a comparison with other implementations.

Metadaten
Titel
Not as Easy as It Seems: Automating the Construction of Lexical Chains Using Roget’s Thesaurus
verfasst von
Mario Jarmasz
Stan Szpakowicz
Copyright-Jahr
2003
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/3-540-44886-1_48

Neuer Inhalt