Skip to main content

1999 | OriginalPaper | Buchkapitel

Use of a Weighted Topic Hierarchy for Document Classification

verfasst von : Alexander Gelbukh, Grigori Sidorov, Adolfo Guzman-Arénas

Erschienen in: Text, Speech and Dialogue

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

A statistical method of document classification driven by a hierarchical topic dictionary is proposed. The method uses a dictionary with a simple structure and is insensible to inaccuracies in the dictionary. Two kinds of weights of dictionary entries, namely, relevance and discrimination weights are discussed. The first type of weights is associated with the links between words and topics and between the nodes in the tree, while the weights of the second type depend on user database. A common sense-complaint way of assignment of these weights to the topics is presented. A system for text classification Classifier based on the discussed method is described.

Metadaten
Titel
Use of a Weighted Topic Hierarchy for Document Classification
verfasst von
Alexander Gelbukh
Grigori Sidorov
Adolfo Guzman-Arénas
Copyright-Jahr
1999
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/3-540-48239-3_24

Neuer Inhalt