Weitere Kapitel dieses Buchs durch Wischen aufrufen
Question answering field has evolved alongside the Natural Language processing. There are several small-scale applications that use the linguistic, semantic, and syntactic interpretations of text and consume it in further processing. In case of a web search, knowing what the query is intended for can save hours of CPU processing and decrease response time tremendously. We have taken a small step in this direction by treating the query as a question and classifying it with the best suited classification algorithm. In this paper, we have tried to find out that when a perfect-informer of the question (knowing what is asked) is provided as an input for classification algorithms like SVN, Naïve Base, and decision tree, we want to observe their accuracy on the same data set of questions. In our experiment, we have used the concept of CRF to find question features that are relevant. CRF is a probabilistic model that treats features as observation sequence and emits all sequence labels with probability values.
Bitte loggen Sie sich ein, um Zugang zu diesem Inhalt zu erhalten
Sie möchten Zugang zu diesem Inhalt erhalten? Dann informieren Sie sich jetzt über unsere Produkte:
Xin Li and D. Roth.: Learning question classifiers: The role of semantic information. Natural Language Engineering. 12(03):229–249 (2006).
Vapnik. V, Golow1ich. S, and Smola. A.J.: Support vector method for function approximation, regression estimation, and signal processing. Advances in Neural Information Processing Systems. MIT Press (1996).
Zhang, D. and Lee. W.: Question classification using support vector machines. SIGIR, pages 26–32, (2003).
Motoda, Hiroshi, and Huan Liu. “Feature selection, extraction and construction.” Communication of IICM (Institute of Information and Computing Machinery, Taiwan) Vol 5, pp: 67–72, (2002).
Lafferty, J., McCallum, A. and Pereira. F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. ICML (2001).
Srivastava, Ashok N., and Mehran Sahami, eds. Text mining: Classification, clustering, and applications. CRC Press, (2009).
Shi, Guoliang, and Yanqing Kong. “Advances in theories and applications of text mining.” Information Science and Engineering (ICISE), 2009 1st International Conference on. IEEE, (2009).
Lowd, D. and Domingos. P.: Naive Bayes Models for Probability Estimation. Proceedings of the 22nd International Conference on Machine Learning, Bonn, Germany, pp: 529–536. (2005).
Cohen. S., Rokach. L, and Maimon. O.: Decision-Tree Instance-Space Decomposition with Grouped Gain-Ratio. International Journal of Information Science. Volume 177 Issue 17, pp: 3592–3612. (2007).
- Enhancing Web Search Through Question Classifier
Neha V. Sharma
- Springer Singapore
Neuer Inhalt/© ITandMEDIA, Best Practices für die Mitarbeiter-Partizipation in der Produktentwicklung/© astrosystem | stock.adobe.com