- 1.P. F. Brown, P. V. deSouza, R. L. Mercer, V. J. Della Pietra, and J. C. Lai. Class-based n-gram models of natural language. Computational Linguistics, 18(4):467-479, 1992. Google ScholarDigital Library
- 2.Thomas Cover and Peter Hart. Nearest neighbor pattern classification. IEEE Transactions on Information Theory, 13(1):21-27, 1967.Google ScholarDigital Library
- 3.Thomas M. Cover and Joy A. Thomas. Elements of Information Theory. john Wiley, 1991. Google ScholarDigital Library
- 4.Mark Craven, Daniel DiPasquo, Dayne Freitag, Andrew McCallum, Tom Mitchell, Kamal Nigam, and Sean Slattery. Learning to extract symbolic knowledge from the World Wide Web. In Proceedings of the Fifteenth National Conference on Artificial Intelligence (AAAI-98), 1998. Google ScholarDigital Library
- 5.Ido Dagan, Fernando Pereira, and Lillian Lee. Similarity-based estimation of word cooccurrence probabilities. In Proceedings of the 32rid Annual Meeting of the Association .for Computational Linguistics, 1994. Google ScholarDigital Library
- 6.S. C. Deerwester, S. T. Dumais, T. K. Landaner, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6):391-407, 1990.Google ScholarCross Ref
- 7.P. Domingos and M. Pazzani. Beyond independence: Conditions for the optimality of the simple bayesian classifier. Machine Learnin9, 29:103-130, 1997. Google ScholarDigital Library
- 8.Susan T. Dumais. Using LSI for information filtering: TREC-3 experiments. Technical Report 500- 225, National Institute of Standards and Technology, 1995.Google Scholar
- 9.Jerome H. Friedman. On bias, variance, 0/1 - loss, and the curse-of-dimensionality. Data Mining and Knowledge Discovery, 1:55-77, 1997. Google ScholarDigital Library
- 10.Thorsten Joachims. A probabilistic analysis of the Rocchio algorithm with TFIDF for text categorization. In International Conference on Machine Learning (ICML), 1997. Google ScholarDigital Library
- 11.j. D. Jobson. Applied Multivariate Data Analysis - Volume iI: Categorical and Multivariate Methods. Springer Verlag, 1992.Google Scholar
- 12.R. Kerber. Chimerge: Discretization of numeric attributes. In Proceedings of Tenth National Conference on Artificial Intelligence (AAAI-9e), 1992.Google Scholar
- 13.D. Koller and M. Sahami. Toward optimal feature selection. In Proceedings of Thirteenth International Conference on Machine Learning (ICML-96), 1996.Google Scholar
- 14.Ken Lang. Newsweeder: Learning to filter netnews. In International Conference on Machine Learning (ICML), pages 331-339, 1995.Google Scholar
- 15.Lillian Lee. Similarity-Based Approaches to Natural Language Processing. PhD thesis, Harvard University, 1997. (also Technical Report TR-11-97). Google ScholarDigital Library
- 16.David Lewis and Marc Ringuette. A comparison of two learning algorithms for text categorization. In Third Annual Symposium on Document Analysis and Information Retrieval, pages 81-93, 1994.Google Scholar
- 17.David D. Lewis and Kimberly A. Knowles. Threading electronic mail: A preliminary study. Information Processing and Management, 33(2):209-217, 1997. Google ScholarDigital Library
- 18.H. Liu and R. Setiono. Chi2: Feature selection and discretization of numeric attributes. In Proceedings of 7th IEEE Int'l Conference on Tools with Artificial Intelligence, 1995. Google ScholarDigital Library
- 19.Andrew McCallum and Kamal Nigam. A comparison of event models for naive Bayes text classification. In AAAI-98 Workshop on Learning for Text Categorization, 1998. http://www, cs.cmu.edu/-#mccallum.Google Scholar
- 20.Fernando Pereira, Naftali Tishby, and Lillian Lee. Distributional clustering of english words. In Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics, pages 183-90, 1993. Google ScholarDigital Library
- 21.WiseWire. http://www.wisewire.com.Google Scholar
- 22.Yiming Yang. Noise reduction in a statistical approach to text categorization. In Proceedings of the 18th Annual International A CM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'95), pages 256-263, 1995. Google ScholarDigital Library
- 23.Yiming Yang and Jan Pederson. Feature selection in statistical learning of text categorization. In ICML- 97, pages 412-420, 1997. Google ScholarDigital Library
Index Terms
- Distributional clustering of words for text classification
Recommendations
Detecting misspelled words in Turkish text using syllable n-gram frequencies
PReMI'07: Proceedings of the 2nd international conference on Pattern recognition and machine intelligenceIn this study, we have designed and implemented a system which decides whether or not a word is misspelled in Turkish text. Firstly, three databases of syllable monogram, bigram and trigram frequencies are constructed using the syllables that are ...
Semantic classification of Chinese unknown words
ACL '03: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2This paper describes a classifier that assigns semantic thesaurus categories to unknown Chinese words (words not already in the CiLin thesaurus and the Chinese Electronic Dictionary, but in the Sinica Corpus). The focus of the paper differs in two ways ...
Multi-prototype Morpheme Embedding for Text Classification
SMA 2020: The 9th International Conference on Smart Media and ApplicationsRepresenting a word into a continuous space, also known as a word vector, has been successful in various NLP tasks. The word-based embedding has two problems; one is the out-of-vocabulary problem and the other is does not take into account the context ...
Comments