Article

Mining from open answers in questionnaire data

Authors:
Hang Li

NEC Corporation, 4-1-1 Miyazaki, Miyamae-ku, Kawasaki, Kanagawa, Japan

NEC Corporation, 4-1-1 Miyazaki, Miyamae-ku, Kawasaki, Kanagawa, Japan
View Profile

,
Kenji Yamanishi

NEC Corporation, 4-1-1 Miyazaki, Miyamae-ku, Kawasaki, Kanagawa, Japan

NEC Corporation, 4-1-1 Miyazaki, Miyamae-ku, Kawasaki, Kanagawa, Japan
View Profile

KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data miningAugust 2001Pages 443–449https://doi.org/10.1145/502512.502579

Published:26 August 2001Publication History

KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining

Pages 443–449

ABSTRACT

Surveys are an important part of marketing and customer relationship management, and open answers (i.e., answers to open questions) in particular may contain valuable information and provide an important basis for making business decisions. We have developed a text mining system that provides a new way for analyzing open answers in questionnaire data. The product is able to perform the following two functions: (A) accurate extraction of characteristics for individual analysis targets, (B) accurate extraction of the relationships among characteristics of analysis targets. In this paper, we describe the working of our text mining system. It employs two statistical learning techniques: rule analysis and Correspondence Analysis for performing the two functions. Our text mining system has already been put into use by a number of large corporations in Japan in the performance of text mining on various types of survey data, including open answers about brand images, open answers about company images, complaints about products, comments written on home pages, business reports, and help desk records. In this it has been found to be useful in forming a basis for effective business decisions.

References

1.M.R. Anderberg, Cluster Analysis for Applications, Academic Press, 1973.Google Scholar
2.J.P. Benzecri, Correspondence Analysis Handbook. Mercel Dekker, i 992.Google Scholar
3.Jochen Dorre and Peter Gerstl and Roland Seiffert, Text mining: finding nuggets in mountains of textual data, Proceedings of the 5th International Conference on Knowledge Discovery and Data Mining, 398-401, 1999. Google ScholarDigital Library
4.Ronen Feldman and Ido Dagan, Knowledge discovery in textual databases (KDT), Proceedings of First International Conference on Knowledge Discovery and Data Mining, 1995.Google Scholar
5.Fujitsu, Symfoware World http://www.fuiitsu.co.ip/ip/soft/symfoware/index.html, 2001.Google Scholar
6.Marko Grobelnik, Dunja Mladenic, and Natasa Milic-Fraling (Ed.) Proceedings of KDD-2000 Workshop on Text Mining, 2000.Google Scholar
7.Marti Hearst, Untangling text data mining, Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, 3-10, 1999. Google ScholarDigital Library
8.Komatsu Soft, Information Mining Tool VextSearch (in Japanese) http://www.komatsusoft.co.jp/develp/vxtsc/index.html, 2001.Google Scholar
9.Brian Lent and Rakesh Agrawal and Ramakrishnan Srikant, Discovering trends in text databases, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, 227-230, 1997.Google ScholarDigital Library
10.Hang Li and Kenji Yamanishi, Text classification using ESC-based stochastic decision lists, Proceedings of the 8th International Conference on Information and Knowledge Management, 122-130, 1999. Google ScholarDigital Library
11.Hang Li and Kenji Yamanishi, Topic analysis using a finite mixture model, Proceedings of 2000 Joint ACL-SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 35-44, 2000. Google ScholarDigital Library
12.Jorma Rissanen, Fisher information and stochastic complexity, IEEE Transaction on Information Theory, 42(1):40- 47, 1996. Google ScholarDigital Library
13.Russell Swan and James Allan, Extracting significant time varying features from text, Proceedings of the 8th International Conference on Information and Knowledge Management, 45, 1999. Google ScholarDigital Library
14.Mark Shewhart and Mark Wasson, Monitoring a newsfeed for hot topics, Proceedings of the 5th International Conference on Knowledge Discovery and Data Mining, 402-404, 1999. Google ScholarDigital Library
15.Kenji Yamanishi, A learning criterion for stochastic rules, Machine Learning, 9:165-203, 1992. Google ScholarDigital Library
16.Kenji Yamanishi, A decision-theoretic extension of stochastic complexity and its applications to learning, IEEE Transaction on Infortmation Theory.,44(4): 1424-1439, 1998. Google ScholarDigital Library

Index Terms

Mining from open answers in questionnaire data
1. Information systems
  1. Data management systems
    1. Database management system engines
      1. Triggers and rules
  2. Information systems applications
    1. Data mining

Recommendations

Mining Open Answers in Questionnaire Data

Surveys are an important part of marketing and customer relationship management,and open answers (answers to open questions) in particular could contain valuableinformation and provide an important basis for making business decisions. The authors ...
Read More
Mining fuzzy association rules from questionnaire data

Association rule mining is one of most popular data analysis methods that can discover associations within data. Association rule mining algorithms have been applied to various datasets, due to their practical usefulness. Little attention has been paid, ...
Read More
Mining uncertain data

As an important data mining and knowledge discovery task, association rule mining searches for implicit, previously unknown, and potentially useful pieces of information—in the form of rules revealing associative relationships—that are embedded in the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
August 2001
493 pages
ISBN:158113391X
DOI:10.1145/502512
Conference Chair:
Doheon Lee
Chonnam National University, Korea
,
General Chair:
Mario Schkolnick
SGI
,
Program Chairs:
Foster Provost
New York University
,
Ramakrishnan Srikant
IBM Almaden Research Center
Copyright © 2001 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 26 August 2001
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Association Rules
Classification Rules
Correspondence Analysis
Open Question
Questionnaire Data
Survey
Text Mining
Qualifiers
- Article
Conference

Acceptance Rates
KDD '01 Paper Acceptance Rate31of237submissions,13%Overall Acceptance Rate1,133of8,635submissions,13%
More
Upcoming Conference
KDD '24

Sponsor:

sigkdd

sigkdd

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 22
  Total Citations
  View Citations
- 1,845
  Total Downloads
- Downloads (Last 12 months)15
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Mining from open answers in questionnaire data

KDD '01: Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Mining Open Answers in Questionnaire Data

Mining fuzzy association rules from questionnaire data

Mining uncertain data