ABSTRACT
Recommender systems improve access to relevant products and information by making personalized suggestions based on previous examples of a user's likes and dislikes. Most existing recommender systems use collaborative filtering methods that base recommendations on other users' preferences. By contrast,content-based methods use information about an item itself to make suggestions.This approach has the advantage of being able to recommend previously unrated items to users with unique interests and to provide explanations for its recommendations. We describe a content-based book recommending system that utilizes information extraction and a machine-learning algorithm for text categorization. Initial experimental results demonstrate that this approach can produce accurate recommendations.
- 1.R. B. Allen. User models: Method, theory, and practice. International Journal of Man-Machine Studies, 32:511-543, 1990. Google ScholarDigital Library
- 2.J. Alspector, A. Kolcz, and N. Karunanithi. Comparing feature-based and clique-based user models for movie selection. In Proceedings of the Third ACM Conference on Digital Libraries, pages 11-18, Pittsburgh, PA, June 1998. Google ScholarDigital Library
- 3.T. Anderson and J. D. Finn. The New Statistical Analysis of Data. Springer Verlag, New York, 1996.Google ScholarCross Ref
- 4.S. Baker. Laying a firm foundation: Administrative support for readers' advisory services. Collection Building, 12(3-4):13-18, 1993.Google ScholarCross Ref
- 5.M. Balabanovic and Y. Shoham. Fab: Content-based, collaborative recommendation. Communications of the Association for Computing Machinery, 40(3):66-72, 1997. Google ScholarDigital Library
- 6.C. Basu, H. Hirsh, and W. W. Cohen. Recommendation as classification: Using social and content-based information in recommendation. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, pages 714-720, Madison, WI, July 1998. Google ScholarDigital Library
- 7.D. Billsus and M. J. Pazzani. Learning collaborative information filters. In Proceedings of the Fifteenth International Conference on Machine Learning, pages 46- 54, Madison, WI, 1998. Morgan Kaufman. Google ScholarDigital Library
- 8.M. E. Califf and R. J. Mooney. Relational learning of pattern-match rules for information extraction. In Proceedings of the Sixteenth National Conference on Artificial Intelligence, pages 328-334, Orlando, FL, July 1999. Google ScholarDigital Library
- 9.C. Cardie. Empirical methods in information extraction. AI Magazine, 18(4):65-79, 1997.Google ScholarDigital Library
- 10.W. W. Cohen. Learning trees and rules with set-valued features. In Proceedings of the Thirteenth National Conference on Artificial Intelligence, pages 709-716, Portland, OR, August 1996. Google ScholarDigital Library
- 11.D. Cohn, L. Atlas, and R. Ladner. Improving generalization with active learning. Machine Learning, 15(2):201-221, 1994. Google ScholarCross Ref
- 12.DARPA, editor. Proceedings of the 6th Message Understanding Conference, San Mateo, CA, 1995. Morgan Kaufman.Google Scholar
- 13.C. Fellbaum. WordNet: An Electronic Lexical Database. MIT Press, Cambridge, MA, 1998.Google ScholarCross Ref
- 14.J. Furnkranz, T. Mitchell, and E. Riloff. A case study in using linguistic phrases for text categorization on the WWW. In Papers from the AAAI 1998 Workshop on Text Categorization, pages 5-12, Madison, WI, 1998.Google Scholar
- 15.B. Gelfand, M. Wulfekuler, and W. F. Punch. Automated concept extraction from plain text. In Papers from the AAAI 1998 Workshop on Text Categorization, pages 13-17, Madison, WI, 1998.Google Scholar
- 16.D. Goldberg, D. Nichols, B. Oki, and D. Terry. Using collaborative filtering to weave an information tapestry. Communications of the Association for Computing Machinery, 35(12):61-70, 1992. Google ScholarDigital Library
- 17.N. Good, J. B. Schafer, J. A. Konstan, A. Borchers, B. Sarwar, J. Herlocker, and J. Riedl. Combining collaborative filtering with personal agents for better recommendations. In Proceedings of the Sixteenth National Conference on Artificial Intelligence, pages 439- 446, Orlando, FL, July 1999. Google ScholarDigital Library
- 18.T. Joachims. A probabilistic analysis of the Rocchio algorithm with TFIDF for text categorization. In Proceedings of the Fourteenth International Conference on Machine Learning, pages 143-151, San Francisco, CA, 1997. Morgan Kaufman. Google ScholarDigital Library
- 19.H. Kautz, editor. Papers from the AAAI 1998 Workshop on Recommender Systems, Madison, WI, 1998. AAAI Press.Google Scholar
- 20.A. Kent and et al. Use of Library Materials: The University of Pittsburgh Study. Dekker, New York, 1979.Google Scholar
- 21.R. Kohavi, B. Becker, and D. Sommerfield. Improving simple Bayes. In Proceedings of the European Conference on Machine Learning, 1997.Google Scholar
- 22.N. Kushmerick, K. Weld, and R. Doorenbos. Wrapper induction for information extraction. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, pages 729-735, Nagoya, Japan, 1997.Google Scholar
- 23.K. Lang. NewsWeeder: Learning to filter netnews. In Proceedings of the Twelfth International Conference on Machine Learning, pages 331-339, San Francisco, CA, 1995. Morgan Kaufman.Google ScholarCross Ref
- 24.W. Lehnert and B. Sundheim. A performance evaluation of text-analysis technologies. AI Magazine, 12(3):81-94, 1991. Google ScholarDigital Library
- 25.D. D. Lewis and J. Catlett. Heterogeneous uncertainty sampling for supervised learning. In Proceedings of the Eleventh International Conference on Machine Learning, pages 148-156, San Francisco, CA, July 1994. Morgan Kaufman.Google ScholarCross Ref
- 26.R. Liere and P. Tadepalli. Active learning with committees for text categorization. In Proceedings of the Fourteenth National Conference on Artificial Intelligence, pages 591-596, Providence, RI, July 1997. Google ScholarDigital Library
- 27.P. Maes. Agents that reduce work and information overload. Communications of the Association for Computing Machinery, 37(7):31-40, 1994. Google ScholarDigital Library
- 28.A. McCallum and K. Nigam. A comparison of event models for naive Bayes text classification. In Papers from the AAAI 1998 Workshop on Text Categorization, pages 41-48, Madison, WI, 1998.Google Scholar
- 29.K. McCook and G. O. Rolstad, editors. Developing Readers' Advisory Services: Concepts and Committments. Neal-Schuman, New York, 1993.Google Scholar
- 30.T. Mitchell. Machine Learning. McGraw-Hill, New York, NY, 1997. Google ScholarDigital Library
- 31.M. Mitra, C. Buckley, C. Cardie, and A. Singhal. An analysis of statistical and syntactic phrases. In Proceedings of the 5th RIAO Conference, Computer-Assisted Information Searching on the Internet, pages 200-214, 1997.Google Scholar
- 32.H. T. Ng, W. B. Goh, and K. L. Low. Feature selection, perceptron learning, and a usability case study for text categorization. In Proceedings of 20th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 67-73, Philadelphia, PA, 1997. Google ScholarDigital Library
- 33.K. Nigam, A. McCallum, S. Thrun, and T. Mitchell. Learning to classify text from labeled and unlabeled documents. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, pages 792-799, Madison, WI, July 1998. Google ScholarDigital Library
- 34.D. Ourston and R. J. Mooney. Theory refinement combining analytical and empirical methods. Artificial Intelligence, 66:311-344, 1994. Google ScholarDigital Library
- 35.M. Pazzani and D. Billsus. Learning and revising user profiles: The identification of interesting web sites. Machine Learning, 27(3):313-331, 1997. Google ScholarDigital Library
- 36.M. Pazzani and D. Kibler. The utility of background knowledge in inductive learning. Machine Learning, 9:57-94, 1992. Google ScholarDigital Library
- 37.M. Pazzani, J. Muramatsu, and D. Billsus. Syskill & Webert: Identifying interesting web sites. In Proceedings of the Thirteenth National Conference on Artificial Intelligence, pages 54-61, Portland, OR, August 1996. Google ScholarDigital Library
- 38.J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo,CA, 1993. Google ScholarDigital Library
- 39.P. Resnick, N. Iacovou, M. Sushak, P. Bergstrom, and J. Reidl. GroupLens: An open architecture for collaborative filtering of netnews. In Proceedings of the 1994 Computer Supported Cooperative Work Conference, New York. ACM. Google ScholarDigital Library
- 40.P. Resnik and H. R. Varian. Introduction (to the special section on recommender systems). Communications of the Association for Computing Machinery, 40(3):56- 59, 1997. Google ScholarDigital Library
- 41.E. Rich. User modeling via stereotypes. Cognitive Science, 3:329-354, 1979.Google ScholarCross Ref
- 42.E. Rich. Users are individuals: Individualizing user models. International Journal of Man-Machine Studies, 18:199-214, 1983.Google ScholarCross Ref
- 43.J. Rocchio. Relevance feedback in information retrieval. In G. Salton, editor, The SMART Retrieval System: Experiments in Automatic Document Processing, pages 313-323. Prentice Hall, 1971.Google ScholarDigital Library
- 44.D. E. Rumelhart, G. E. Hinton, and J. R. Williams. Learning internal representations by error propagation. In D. E. Rumelhart and J. L. McClelland, editors, Parallel Distributed Processing, Vol. I, pages 318-362. MIT Press, Cambridge, MA, 1986. Google ScholarDigital Library
- 45.G. Salton and C. Buckley. Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science, 41:288-297, 1990.Google ScholarCross Ref
- 46.I. Soboroff, C. Nicholas, and M. Pazzani, editors. Papers from the SIGIR-99 Recommender Systems Workshop, Berkeley, CA, 1999. ACM SIGIR.Google Scholar
- 47.G. G. Towell and J. W. Shavlik. Knowledge-based artificial neural networks. Artificial Intelligence, 70:119- 165, 1994. Google ScholarDigital Library
- 48.Y. Yang. An evaluation of statistical approaches to text categorization. Information Retrieval Journal, May 1999. Google ScholarDigital Library
- 49.Y. Yang and X. Liu. A re-examination of text cateogrization methods. In Proceedings of 22nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, CA, 1999. Google ScholarDigital Library
- 50.Y. Yang and J. O. Pedersen. A comparative study on feature selection in text categorization. In Proceedings of the Fourteenth International Conference on Machine Learning, pages 412-420, San Francisco, CA, 1997. Morgan Kaufman. Google ScholarDigital Library
Index Terms
- Content-based book recommending using learning for text categorization
Recommendations
Recommending Followees Based on Content Weighted User Interest Homophily
ICIMCS'16: Proceedings of the International Conference on Internet Multimedia Computing and ServiceWe study the problem of recommending followees to users on content curation social networks (CCSNs). Different from existing friendship-oriented user recommendation approaches, we exploit user interest homophily to recommend users of similar interests, ...
Blended Recommending: Integrating Interactive Information Filtering and Algorithmic Recommender Techniques
CHI '15: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing SystemsWe present a novel approach that integrates algorithmic recommender techniques with interactive faceted filtering methods. We refer to this approach as blended recommending. It allows users to interact with a set of filter facets representing criteria ...
Getting to know you: learning new user preferences in recommender systems
IUI '02: Proceedings of the 7th international conference on Intelligent user interfacesRecommender systems have become valuable resources for users seeking intelligent ways to search through the enormous volume of information available to them. One crucial unsolved problem for recommender systems is how best to learn about a new user. In ...
Comments