skip to main content
10.1145/290941.290970acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article
Free Access

Distributional clustering of words for text classification

Authors Info & Claims
Published:01 August 1998Publication History
First page image

References

  1. 1.P. F. Brown, P. V. deSouza, R. L. Mercer, V. J. Della Pietra, and J. C. Lai. Class-based n-gram models of natural language. Computational Linguistics, 18(4):467-479, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.Thomas Cover and Peter Hart. Nearest neighbor pattern classification. IEEE Transactions on Information Theory, 13(1):21-27, 1967.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.Thomas M. Cover and Joy A. Thomas. Elements of Information Theory. john Wiley, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4.Mark Craven, Daniel DiPasquo, Dayne Freitag, Andrew McCallum, Tom Mitchell, Kamal Nigam, and Sean Slattery. Learning to extract symbolic knowledge from the World Wide Web. In Proceedings of the Fifteenth National Conference on Artificial Intelligence (AAAI-98), 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.Ido Dagan, Fernando Pereira, and Lillian Lee. Similarity-based estimation of word cooccurrence probabilities. In Proceedings of the 32rid Annual Meeting of the Association .for Computational Linguistics, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.S. C. Deerwester, S. T. Dumais, T. K. Landaner, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6):391-407, 1990.Google ScholarGoogle ScholarCross RefCross Ref
  7. 7.P. Domingos and M. Pazzani. Beyond independence: Conditions for the optimality of the simple bayesian classifier. Machine Learnin9, 29:103-130, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8.Susan T. Dumais. Using LSI for information filtering: TREC-3 experiments. Technical Report 500- 225, National Institute of Standards and Technology, 1995.Google ScholarGoogle Scholar
  9. 9.Jerome H. Friedman. On bias, variance, 0/1 - loss, and the curse-of-dimensionality. Data Mining and Knowledge Discovery, 1:55-77, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.Thorsten Joachims. A probabilistic analysis of the Rocchio algorithm with TFIDF for text categorization. In International Conference on Machine Learning (ICML), 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.j. D. Jobson. Applied Multivariate Data Analysis - Volume iI: Categorical and Multivariate Methods. Springer Verlag, 1992.Google ScholarGoogle Scholar
  12. 12.R. Kerber. Chimerge: Discretization of numeric attributes. In Proceedings of Tenth National Conference on Artificial Intelligence (AAAI-9e), 1992.Google ScholarGoogle Scholar
  13. 13.D. Koller and M. Sahami. Toward optimal feature selection. In Proceedings of Thirteenth International Conference on Machine Learning (ICML-96), 1996.Google ScholarGoogle Scholar
  14. 14.Ken Lang. Newsweeder: Learning to filter netnews. In International Conference on Machine Learning (ICML), pages 331-339, 1995.Google ScholarGoogle Scholar
  15. 15.Lillian Lee. Similarity-Based Approaches to Natural Language Processing. PhD thesis, Harvard University, 1997. (also Technical Report TR-11-97). Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16.David Lewis and Marc Ringuette. A comparison of two learning algorithms for text categorization. In Third Annual Symposium on Document Analysis and Information Retrieval, pages 81-93, 1994.Google ScholarGoogle Scholar
  17. 17.David D. Lewis and Kimberly A. Knowles. Threading electronic mail: A preliminary study. Information Processing and Management, 33(2):209-217, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.H. Liu and R. Setiono. Chi2: Feature selection and discretization of numeric attributes. In Proceedings of 7th IEEE Int'l Conference on Tools with Artificial Intelligence, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19.Andrew McCallum and Kamal Nigam. A comparison of event models for naive Bayes text classification. In AAAI-98 Workshop on Learning for Text Categorization, 1998. http://www, cs.cmu.edu/-#mccallum.Google ScholarGoogle Scholar
  20. 20.Fernando Pereira, Naftali Tishby, and Lillian Lee. Distributional clustering of english words. In Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics, pages 183-90, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. 21.WiseWire. http://www.wisewire.com.Google ScholarGoogle Scholar
  22. 22.Yiming Yang. Noise reduction in a statistical approach to text categorization. In Proceedings of the 18th Annual International A CM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'95), pages 256-263, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. 23.Yiming Yang and Jan Pederson. Feature selection in statistical learning of text categorization. In ICML- 97, pages 412-420, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Distributional clustering of words for text classification

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                SIGIR '98: Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
                August 1998
                394 pages
                ISBN:1581130155
                DOI:10.1145/290941

                Copyright © 1998 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 1 August 1998

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • Article

                Acceptance Rates

                Overall Acceptance Rate792of3,983submissions,20%

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader