2011 | OriginalPaper | Buchkapitel
Constrained LDA for Grouping Product Features in Opinion Mining
verfasst von : Zhongwu Zhai, Bing Liu, Hua Xu, Peifa Jia
Erschienen in: Advances in Knowledge Discovery and Data Mining
Verlag: Springer Berlin Heidelberg
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
In opinion mining of product reviews, one often wants to produce a summary of opinions based on product features. However, for the same feature, people can express it with different words and phrases. To produce an effective summary, these words and phrases, which are domain synonyms, need to be grouped under the same feature. Topic modeling is a suitable method for the task. However, instead of simply letting topic modeling find groupings freely, we believe it is possible to do better by giving it some pre-existing knowledge in the form of automatically extracted constraints. In this paper, we first extend a popular topic modeling method, called Latent Dirichlet Allocation (LDA), with the ability to process
large
scale constraints. Then, two novel methods are proposed to extract two types of constraints automatically. Finally, the resulting
constrained-LDA
and the extracted constraints are applied to group product features. Experiments show that
constrained-LDA
outperforms the original LDA and the latest
mLSA
by a large margin.