ABSTRACT
Surveys are an important part of marketing and customer relationship management, and open answers (i.e., answers to open questions) in particular may contain valuable information and provide an important basis for making business decisions. We have developed a text mining system that provides a new way for analyzing open answers in questionnaire data. The product is able to perform the following two functions: (A) accurate extraction of characteristics for individual analysis targets, (B) accurate extraction of the relationships among characteristics of analysis targets. In this paper, we describe the working of our text mining system. It employs two statistical learning techniques: rule analysis and Correspondence Analysis for performing the two functions. Our text mining system has already been put into use by a number of large corporations in Japan in the performance of text mining on various types of survey data, including open answers about brand images, open answers about company images, complaints about products, comments written on home pages, business reports, and help desk records. In this it has been found to be useful in forming a basis for effective business decisions.
- 1.M.R. Anderberg, Cluster Analysis for Applications, Academic Press, 1973.Google Scholar
- 2.J.P. Benzecri, Correspondence Analysis Handbook. Mercel Dekker, i 992.Google Scholar
- 3.Jochen Dorre and Peter Gerstl and Roland Seiffert, Text mining: finding nuggets in mountains of textual data, Proceedings of the 5th International Conference on Knowledge Discovery and Data Mining, 398-401, 1999. Google ScholarDigital Library
- 4.Ronen Feldman and Ido Dagan, Knowledge discovery in textual databases (KDT), Proceedings of First International Conference on Knowledge Discovery and Data Mining, 1995.Google Scholar
- 5.Fujitsu, Symfoware World http://www.fuiitsu.co.ip/ip/soft/symfoware/index.html, 2001.Google Scholar
- 6.Marko Grobelnik, Dunja Mladenic, and Natasa Milic-Fraling (Ed.) Proceedings of KDD-2000 Workshop on Text Mining, 2000.Google Scholar
- 7.Marti Hearst, Untangling text data mining, Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, 3-10, 1999. Google ScholarDigital Library
- 8.Komatsu Soft, Information Mining Tool VextSearch (in Japanese) http://www.komatsusoft.co.jp/develp/vxtsc/index.html, 2001.Google Scholar
- 9.Brian Lent and Rakesh Agrawal and Ramakrishnan Srikant, Discovering trends in text databases, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, 227-230, 1997.Google ScholarDigital Library
- 10.Hang Li and Kenji Yamanishi, Text classification using ESC-based stochastic decision lists, Proceedings of the 8th International Conference on Information and Knowledge Management, 122-130, 1999. Google ScholarDigital Library
- 11.Hang Li and Kenji Yamanishi, Topic analysis using a finite mixture model, Proceedings of 2000 Joint ACL-SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 35-44, 2000. Google ScholarDigital Library
- 12.Jorma Rissanen, Fisher information and stochastic complexity, IEEE Transaction on Information Theory, 42(1):40- 47, 1996. Google ScholarDigital Library
- 13.Russell Swan and James Allan, Extracting significant time varying features from text, Proceedings of the 8th International Conference on Information and Knowledge Management, 45, 1999. Google ScholarDigital Library
- 14.Mark Shewhart and Mark Wasson, Monitoring a newsfeed for hot topics, Proceedings of the 5th International Conference on Knowledge Discovery and Data Mining, 402-404, 1999. Google ScholarDigital Library
- 15.Kenji Yamanishi, A learning criterion for stochastic rules, Machine Learning, 9:165-203, 1992. Google ScholarDigital Library
- 16.Kenji Yamanishi, A decision-theoretic extension of stochastic complexity and its applications to learning, IEEE Transaction on Infortmation Theory.,44(4): 1424-1439, 1998. Google ScholarDigital Library
Index Terms
- Mining from open answers in questionnaire data
Recommendations
Mining Open Answers in Questionnaire Data
Surveys are an important part of marketing and customer relationship management,and open answers (answers to open questions) in particular could contain valuableinformation and provide an important basis for making business decisions. The authors ...
Mining fuzzy association rules from questionnaire data
Association rule mining is one of most popular data analysis methods that can discover associations within data. Association rule mining algorithms have been applied to various datasets, due to their practical usefulness. Little attention has been paid, ...
Mining uncertain data
As an important data mining and knowledge discovery task, association rule mining searches for implicit, previously unknown, and potentially useful pieces of information—in the form of rules revealing associative relationships—that are embedded in the ...
Comments