ABSTRACT
Capturing knowledge from free-form evaluative texts about an entity is a challenging task. New techniques of feature extraction, polarity determination and strength evaluation have been proposed. Feature extraction is particularly important to the task as it provides the underpinnings of the extracted knowledge. The work in this paper introduces an improved method for feature extraction that draws on an existing unsupervised method. By including user-specific prior knowledge of the evaluated entity, we turn the task of feature extraction into one of term similarity by mapping crude (learned) features into a user-defined taxonomy of the entity's features. Results show promise both in terms of the accuracy of the mapping as well as the reduction in the semantic redundancy of crude features.
- AAAI. Spring Symposium on Exploring Attitude and Effect in Text: Theories and Apps, 2004.Google Scholar
- R. Agrawal and R. Srikant. Fast algorithms for mining association rules. In Proc. 20th Int'l Conf. VLDB, 1994. Google ScholarDigital Library
- A. Budanitsky and G. Hirst. Semantic distance in wordnet: An experimental, application-oriented evaluation of five measures. In Workshop on WordNet and Other Lexical Resources, NAACL, 2001.Google Scholar
- G. Carenini and J. D. Moore. An empirical study of the influence of user tailoring on evaluative argument effectiveness. In Proc. IJCAI, 2001. Google ScholarDigital Library
- I. Dagan. Contextual Word Similarity, chapter 19, pages 459--476. Marcel Dekker Inc, 2000.Google Scholar
- European Conference on AI. Workshop on ML for Information Extraction, 2000.Google Scholar
- Z. S. Harris. Mathematical Structures of Language. Wiley, 1968.Google Scholar
- M. Hu and B. Liu. Feature based summary of customer reviews dataset. http://www.cs.uic.edu/~liub/FBS/FBS.html, 2004.Google Scholar
- M. Hu and B. Liu. Mining and summarizing customer reviews. In Proc. ACM SIGKDD, 2004. Google ScholarDigital Library
- M. Hu and B. Liu. Mining opinion features in customer reviews. In Proc. AAAI, 2004. Google ScholarDigital Library
- D. Kushal, S. Lawrence, and D. M. Pennock. Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In Proc. 12th Int'l Conf. on WWW, 2003. Google ScholarDigital Library
- D. Lin. An information-theoretic definition of similarity. In Proc. 2nd Int'l Conf. on ML, 1998. Google ScholarDigital Library
- G. A. Miller. An online lexical database. Int'l Journal of Lexicography, 3(4):235--312, 1990.Google ScholarCross Ref
- S. Morinaga, K. Yamanishi, K. Tateishi, and T. Fukushima. Mining product reputations on the web. In Proc. ACM SIGKDD, 2002. Google ScholarDigital Library
- S. Patwardhan and T. Pedersen. Cpan module wordnet::similarity, 2003.Google Scholar
- P. Resnik. Using information content to evaluate semantic similarity in a taxonomy. In IJCAI, 1995. Google ScholarDigital Library
- J. Weeds, D. Weir, and D. McCarthy. Characterising measures of lexical distributional similarity. In Proc. COLING, 2004. Google ScholarDigital Library
- T. Wilson, J. Wiebe, and R. Hwa. Just how mad are you? finding strong and weak opinion clauses. In Proc. AAAI, 2004. Google ScholarDigital Library
- I. Zukerman, S. George, and Y. Wen. Lexical paraphrasing for document retrieval and node identification. In 2nd Inter. Workshop on Paraphrasing (IWP2003), 2003. Google ScholarDigital Library
Index Terms
- Extracting knowledge from evaluative text
Recommendations
Unsupervised product feature extraction for feature-oriented opinion determination
Proposes an unsupervised method of product feature extraction by using comparative domain corpora.A novel term similarity measure is introduced to evaluate the association of candidate features and domain entities.Proposes feature-oriented opinion ...
Multi-document Hyperedge-based Ranking for Text Summarization
CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge ManagementIn a multi-document settings, graph-based extractive summarization approaches build a similarity graph out of sentences in each cluster of documents then use graph centrality approaches to measure the importance of sentences. The similarity is computed ...
Exploring events and distributed representations of text in multi-document summarization
We explore an event detection framework to improve multi-document summarizationWe use distributed representations of text to address different lexical realizationsSummarization is based on the hierarchical combination of single-document summariesWe ...
Comments