skip to main content
10.1145/336597.336662acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
Article
Free Access

Content-based book recommending using learning for text categorization

Authors Info & Claims
Published:01 June 2000Publication History

ABSTRACT

Recommender systems improve access to relevant products and information by making personalized suggestions based on previous examples of a user's likes and dislikes. Most existing recommender systems use collaborative filtering methods that base recommendations on other users' preferences. By contrast,content-based methods use information about an item itself to make suggestions.This approach has the advantage of being able to recommend previously unrated items to users with unique interests and to provide explanations for its recommendations. We describe a content-based book recommending system that utilizes information extraction and a machine-learning algorithm for text categorization. Initial experimental results demonstrate that this approach can produce accurate recommendations.

References

  1. 1.R. B. Allen. User models: Method, theory, and practice. International Journal of Man-Machine Studies, 32:511-543, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.J. Alspector, A. Kolcz, and N. Karunanithi. Comparing feature-based and clique-based user models for movie selection. In Proceedings of the Third ACM Conference on Digital Libraries, pages 11-18, Pittsburgh, PA, June 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.T. Anderson and J. D. Finn. The New Statistical Analysis of Data. Springer Verlag, New York, 1996.Google ScholarGoogle ScholarCross RefCross Ref
  4. 4.S. Baker. Laying a firm foundation: Administrative support for readers' advisory services. Collection Building, 12(3-4):13-18, 1993.Google ScholarGoogle ScholarCross RefCross Ref
  5. 5.M. Balabanovic and Y. Shoham. Fab: Content-based, collaborative recommendation. Communications of the Association for Computing Machinery, 40(3):66-72, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.C. Basu, H. Hirsh, and W. W. Cohen. Recommendation as classification: Using social and content-based information in recommendation. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, pages 714-720, Madison, WI, July 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.D. Billsus and M. J. Pazzani. Learning collaborative information filters. In Proceedings of the Fifteenth International Conference on Machine Learning, pages 46- 54, Madison, WI, 1998. Morgan Kaufman. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8.M. E. Califf and R. J. Mooney. Relational learning of pattern-match rules for information extraction. In Proceedings of the Sixteenth National Conference on Artificial Intelligence, pages 328-334, Orlando, FL, July 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. 9.C. Cardie. Empirical methods in information extraction. AI Magazine, 18(4):65-79, 1997.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.W. W. Cohen. Learning trees and rules with set-valued features. In Proceedings of the Thirteenth National Conference on Artificial Intelligence, pages 709-716, Portland, OR, August 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.D. Cohn, L. Atlas, and R. Ladner. Improving generalization with active learning. Machine Learning, 15(2):201-221, 1994. Google ScholarGoogle ScholarCross RefCross Ref
  12. 12.DARPA, editor. Proceedings of the 6th Message Understanding Conference, San Mateo, CA, 1995. Morgan Kaufman.Google ScholarGoogle Scholar
  13. 13.C. Fellbaum. WordNet: An Electronic Lexical Database. MIT Press, Cambridge, MA, 1998.Google ScholarGoogle ScholarCross RefCross Ref
  14. 14.J. Furnkranz, T. Mitchell, and E. Riloff. A case study in using linguistic phrases for text categorization on the WWW. In Papers from the AAAI 1998 Workshop on Text Categorization, pages 5-12, Madison, WI, 1998.Google ScholarGoogle Scholar
  15. 15.B. Gelfand, M. Wulfekuler, and W. F. Punch. Automated concept extraction from plain text. In Papers from the AAAI 1998 Workshop on Text Categorization, pages 13-17, Madison, WI, 1998.Google ScholarGoogle Scholar
  16. 16.D. Goldberg, D. Nichols, B. Oki, and D. Terry. Using collaborative filtering to weave an information tapestry. Communications of the Association for Computing Machinery, 35(12):61-70, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17.N. Good, J. B. Schafer, J. A. Konstan, A. Borchers, B. Sarwar, J. Herlocker, and J. Riedl. Combining collaborative filtering with personal agents for better recommendations. In Proceedings of the Sixteenth National Conference on Artificial Intelligence, pages 439- 446, Orlando, FL, July 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.T. Joachims. A probabilistic analysis of the Rocchio algorithm with TFIDF for text categorization. In Proceedings of the Fourteenth International Conference on Machine Learning, pages 143-151, San Francisco, CA, 1997. Morgan Kaufman. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19.H. Kautz, editor. Papers from the AAAI 1998 Workshop on Recommender Systems, Madison, WI, 1998. AAAI Press.Google ScholarGoogle Scholar
  20. 20.A. Kent and et al. Use of Library Materials: The University of Pittsburgh Study. Dekker, New York, 1979.Google ScholarGoogle Scholar
  21. 21.R. Kohavi, B. Becker, and D. Sommerfield. Improving simple Bayes. In Proceedings of the European Conference on Machine Learning, 1997.Google ScholarGoogle Scholar
  22. 22.N. Kushmerick, K. Weld, and R. Doorenbos. Wrapper induction for information extraction. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, pages 729-735, Nagoya, Japan, 1997.Google ScholarGoogle Scholar
  23. 23.K. Lang. NewsWeeder: Learning to filter netnews. In Proceedings of the Twelfth International Conference on Machine Learning, pages 331-339, San Francisco, CA, 1995. Morgan Kaufman.Google ScholarGoogle ScholarCross RefCross Ref
  24. 24.W. Lehnert and B. Sundheim. A performance evaluation of text-analysis technologies. AI Magazine, 12(3):81-94, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. 25.D. D. Lewis and J. Catlett. Heterogeneous uncertainty sampling for supervised learning. In Proceedings of the Eleventh International Conference on Machine Learning, pages 148-156, San Francisco, CA, July 1994. Morgan Kaufman.Google ScholarGoogle ScholarCross RefCross Ref
  26. 26.R. Liere and P. Tadepalli. Active learning with committees for text categorization. In Proceedings of the Fourteenth National Conference on Artificial Intelligence, pages 591-596, Providence, RI, July 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. 27.P. Maes. Agents that reduce work and information overload. Communications of the Association for Computing Machinery, 37(7):31-40, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. 28.A. McCallum and K. Nigam. A comparison of event models for naive Bayes text classification. In Papers from the AAAI 1998 Workshop on Text Categorization, pages 41-48, Madison, WI, 1998.Google ScholarGoogle Scholar
  29. 29.K. McCook and G. O. Rolstad, editors. Developing Readers' Advisory Services: Concepts and Committments. Neal-Schuman, New York, 1993.Google ScholarGoogle Scholar
  30. 30.T. Mitchell. Machine Learning. McGraw-Hill, New York, NY, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. 31.M. Mitra, C. Buckley, C. Cardie, and A. Singhal. An analysis of statistical and syntactic phrases. In Proceedings of the 5th RIAO Conference, Computer-Assisted Information Searching on the Internet, pages 200-214, 1997.Google ScholarGoogle Scholar
  32. 32.H. T. Ng, W. B. Goh, and K. L. Low. Feature selection, perceptron learning, and a usability case study for text categorization. In Proceedings of 20th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 67-73, Philadelphia, PA, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. 33.K. Nigam, A. McCallum, S. Thrun, and T. Mitchell. Learning to classify text from labeled and unlabeled documents. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, pages 792-799, Madison, WI, July 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. 34.D. Ourston and R. J. Mooney. Theory refinement combining analytical and empirical methods. Artificial Intelligence, 66:311-344, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. 35.M. Pazzani and D. Billsus. Learning and revising user profiles: The identification of interesting web sites. Machine Learning, 27(3):313-331, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. 36.M. Pazzani and D. Kibler. The utility of background knowledge in inductive learning. Machine Learning, 9:57-94, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. 37.M. Pazzani, J. Muramatsu, and D. Billsus. Syskill & Webert: Identifying interesting web sites. In Proceedings of the Thirteenth National Conference on Artificial Intelligence, pages 54-61, Portland, OR, August 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. 38.J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo,CA, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. 39.P. Resnick, N. Iacovou, M. Sushak, P. Bergstrom, and J. Reidl. GroupLens: An open architecture for collaborative filtering of netnews. In Proceedings of the 1994 Computer Supported Cooperative Work Conference, New York. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. 40.P. Resnik and H. R. Varian. Introduction (to the special section on recommender systems). Communications of the Association for Computing Machinery, 40(3):56- 59, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. 41.E. Rich. User modeling via stereotypes. Cognitive Science, 3:329-354, 1979.Google ScholarGoogle ScholarCross RefCross Ref
  42. 42.E. Rich. Users are individuals: Individualizing user models. International Journal of Man-Machine Studies, 18:199-214, 1983.Google ScholarGoogle ScholarCross RefCross Ref
  43. 43.J. Rocchio. Relevance feedback in information retrieval. In G. Salton, editor, The SMART Retrieval System: Experiments in Automatic Document Processing, pages 313-323. Prentice Hall, 1971.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. 44.D. E. Rumelhart, G. E. Hinton, and J. R. Williams. Learning internal representations by error propagation. In D. E. Rumelhart and J. L. McClelland, editors, Parallel Distributed Processing, Vol. I, pages 318-362. MIT Press, Cambridge, MA, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. 45.G. Salton and C. Buckley. Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science, 41:288-297, 1990.Google ScholarGoogle ScholarCross RefCross Ref
  46. 46.I. Soboroff, C. Nicholas, and M. Pazzani, editors. Papers from the SIGIR-99 Recommender Systems Workshop, Berkeley, CA, 1999. ACM SIGIR.Google ScholarGoogle Scholar
  47. 47.G. G. Towell and J. W. Shavlik. Knowledge-based artificial neural networks. Artificial Intelligence, 70:119- 165, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. 48.Y. Yang. An evaluation of statistical approaches to text categorization. Information Retrieval Journal, May 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. 49.Y. Yang and X. Liu. A re-examination of text cateogrization methods. In Proceedings of 22nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, CA, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. 50.Y. Yang and J. O. Pedersen. A comparative study on feature selection in text categorization. In Proceedings of the Fourteenth International Conference on Machine Learning, pages 412-420, San Francisco, CA, 1997. Morgan Kaufman. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Content-based book recommending using learning for text categorization

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              DL '00: Proceedings of the fifth ACM conference on Digital libraries
              June 2000
              294 pages
              ISBN:158113231X
              DOI:10.1145/336597

              Copyright © 2000 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 1 June 2000

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • Article

              Acceptance Rates

              DL '00 Paper Acceptance Rate44of132submissions,33%Overall Acceptance Rate95of346submissions,27%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader