Article

Free Access

Content-based book recommending using learning for text categorization

Authors:
Raymond J. Mooney

Department of Computer Sciences, University of Texas, Austin, TX

Department of Computer Sciences, University of Texas, Austin, TX
View Profile

,
Loriene Roy

Graduate School of Library and Information Science, University of Texas, Austin, TX

Graduate School of Library and Information Science, University of Texas, Austin, TX
View Profile

DL '00: Proceedings of the fifth ACM conference on Digital librariesJune 2000Pages 195–204https://doi.org/10.1145/336597.336662

Published:01 June 2000Publication History

DL '00: Proceedings of the fifth ACM conference on Digital libraries

Pages 195–204

ABSTRACT

Recommender systems improve access to relevant products and information by making personalized suggestions based on previous examples of a user's likes and dislikes. Most existing recommender systems use collaborative filtering methods that base recommendations on other users' preferences. By contrast,content-based methods use information about an item itself to make suggestions.This approach has the advantage of being able to recommend previously unrated items to users with unique interests and to provide explanations for its recommendations. We describe a content-based book recommending system that utilizes information extraction and a machine-learning algorithm for text categorization. Initial experimental results demonstrate that this approach can produce accurate recommendations.

References

1.R. B. Allen. User models: Method, theory, and practice. International Journal of Man-Machine Studies, 32:511-543, 1990. Google ScholarDigital Library
2.J. Alspector, A. Kolcz, and N. Karunanithi. Comparing feature-based and clique-based user models for movie selection. In Proceedings of the Third ACM Conference on Digital Libraries, pages 11-18, Pittsburgh, PA, June 1998. Google ScholarDigital Library
3.T. Anderson and J. D. Finn. The New Statistical Analysis of Data. Springer Verlag, New York, 1996.Google ScholarCross Ref
4.S. Baker. Laying a firm foundation: Administrative support for readers' advisory services. Collection Building, 12(3-4):13-18, 1993.Google ScholarCross Ref
5.M. Balabanovic and Y. Shoham. Fab: Content-based, collaborative recommendation. Communications of the Association for Computing Machinery, 40(3):66-72, 1997. Google ScholarDigital Library
6.C. Basu, H. Hirsh, and W. W. Cohen. Recommendation as classification: Using social and content-based information in recommendation. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, pages 714-720, Madison, WI, July 1998. Google ScholarDigital Library
7.D. Billsus and M. J. Pazzani. Learning collaborative information filters. In Proceedings of the Fifteenth International Conference on Machine Learning, pages 46- 54, Madison, WI, 1998. Morgan Kaufman. Google ScholarDigital Library
8.M. E. Califf and R. J. Mooney. Relational learning of pattern-match rules for information extraction. In Proceedings of the Sixteenth National Conference on Artificial Intelligence, pages 328-334, Orlando, FL, July 1999. Google ScholarDigital Library
9.C. Cardie. Empirical methods in information extraction. AI Magazine, 18(4):65-79, 1997.Google ScholarDigital Library
10.W. W. Cohen. Learning trees and rules with set-valued features. In Proceedings of the Thirteenth National Conference on Artificial Intelligence, pages 709-716, Portland, OR, August 1996. Google ScholarDigital Library
11.D. Cohn, L. Atlas, and R. Ladner. Improving generalization with active learning. Machine Learning, 15(2):201-221, 1994. Google ScholarCross Ref
12.DARPA, editor. Proceedings of the 6th Message Understanding Conference, San Mateo, CA, 1995. Morgan Kaufman.Google Scholar
13.C. Fellbaum. WordNet: An Electronic Lexical Database. MIT Press, Cambridge, MA, 1998.Google ScholarCross Ref
14.J. Furnkranz, T. Mitchell, and E. Riloff. A case study in using linguistic phrases for text categorization on the WWW. In Papers from the AAAI 1998 Workshop on Text Categorization, pages 5-12, Madison, WI, 1998.Google Scholar
15.B. Gelfand, M. Wulfekuler, and W. F. Punch. Automated concept extraction from plain text. In Papers from the AAAI 1998 Workshop on Text Categorization, pages 13-17, Madison, WI, 1998.Google Scholar
16.D. Goldberg, D. Nichols, B. Oki, and D. Terry. Using collaborative filtering to weave an information tapestry. Communications of the Association for Computing Machinery, 35(12):61-70, 1992. Google ScholarDigital Library
17.N. Good, J. B. Schafer, J. A. Konstan, A. Borchers, B. Sarwar, J. Herlocker, and J. Riedl. Combining collaborative filtering with personal agents for better recommendations. In Proceedings of the Sixteenth National Conference on Artificial Intelligence, pages 439- 446, Orlando, FL, July 1999. Google ScholarDigital Library
18.T. Joachims. A probabilistic analysis of the Rocchio algorithm with TFIDF for text categorization. In Proceedings of the Fourteenth International Conference on Machine Learning, pages 143-151, San Francisco, CA, 1997. Morgan Kaufman. Google ScholarDigital Library
19.H. Kautz, editor. Papers from the AAAI 1998 Workshop on Recommender Systems, Madison, WI, 1998. AAAI Press.Google Scholar
20.A. Kent and et al. Use of Library Materials: The University of Pittsburgh Study. Dekker, New York, 1979.Google Scholar
21.R. Kohavi, B. Becker, and D. Sommerfield. Improving simple Bayes. In Proceedings of the European Conference on Machine Learning, 1997.Google Scholar
22.N. Kushmerick, K. Weld, and R. Doorenbos. Wrapper induction for information extraction. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, pages 729-735, Nagoya, Japan, 1997.Google Scholar
23.K. Lang. NewsWeeder: Learning to filter netnews. In Proceedings of the Twelfth International Conference on Machine Learning, pages 331-339, San Francisco, CA, 1995. Morgan Kaufman.Google ScholarCross Ref
24.W. Lehnert and B. Sundheim. A performance evaluation of text-analysis technologies. AI Magazine, 12(3):81-94, 1991. Google ScholarDigital Library
25.D. D. Lewis and J. Catlett. Heterogeneous uncertainty sampling for supervised learning. In Proceedings of the Eleventh International Conference on Machine Learning, pages 148-156, San Francisco, CA, July 1994. Morgan Kaufman.Google ScholarCross Ref
26.R. Liere and P. Tadepalli. Active learning with committees for text categorization. In Proceedings of the Fourteenth National Conference on Artificial Intelligence, pages 591-596, Providence, RI, July 1997. Google ScholarDigital Library
27.P. Maes. Agents that reduce work and information overload. Communications of the Association for Computing Machinery, 37(7):31-40, 1994. Google ScholarDigital Library
28.A. McCallum and K. Nigam. A comparison of event models for naive Bayes text classification. In Papers from the AAAI 1998 Workshop on Text Categorization, pages 41-48, Madison, WI, 1998.Google Scholar
29.K. McCook and G. O. Rolstad, editors. Developing Readers' Advisory Services: Concepts and Committments. Neal-Schuman, New York, 1993.Google Scholar
30.T. Mitchell. Machine Learning. McGraw-Hill, New York, NY, 1997. Google ScholarDigital Library
31.M. Mitra, C. Buckley, C. Cardie, and A. Singhal. An analysis of statistical and syntactic phrases. In Proceedings of the 5th RIAO Conference, Computer-Assisted Information Searching on the Internet, pages 200-214, 1997.Google Scholar
32.H. T. Ng, W. B. Goh, and K. L. Low. Feature selection, perceptron learning, and a usability case study for text categorization. In Proceedings of 20th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 67-73, Philadelphia, PA, 1997. Google ScholarDigital Library
33.K. Nigam, A. McCallum, S. Thrun, and T. Mitchell. Learning to classify text from labeled and unlabeled documents. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, pages 792-799, Madison, WI, July 1998. Google ScholarDigital Library
34.D. Ourston and R. J. Mooney. Theory refinement combining analytical and empirical methods. Artificial Intelligence, 66:311-344, 1994. Google ScholarDigital Library
35.M. Pazzani and D. Billsus. Learning and revising user profiles: The identification of interesting web sites. Machine Learning, 27(3):313-331, 1997. Google ScholarDigital Library
36.M. Pazzani and D. Kibler. The utility of background knowledge in inductive learning. Machine Learning, 9:57-94, 1992. Google ScholarDigital Library
37.M. Pazzani, J. Muramatsu, and D. Billsus. Syskill & Webert: Identifying interesting web sites. In Proceedings of the Thirteenth National Conference on Artificial Intelligence, pages 54-61, Portland, OR, August 1996. Google ScholarDigital Library
38.J. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo,CA, 1993. Google ScholarDigital Library
39.P. Resnick, N. Iacovou, M. Sushak, P. Bergstrom, and J. Reidl. GroupLens: An open architecture for collaborative filtering of netnews. In Proceedings of the 1994 Computer Supported Cooperative Work Conference, New York. ACM. Google ScholarDigital Library
40.P. Resnik and H. R. Varian. Introduction (to the special section on recommender systems). Communications of the Association for Computing Machinery, 40(3):56- 59, 1997. Google ScholarDigital Library
41.E. Rich. User modeling via stereotypes. Cognitive Science, 3:329-354, 1979.Google ScholarCross Ref
42.E. Rich. Users are individuals: Individualizing user models. International Journal of Man-Machine Studies, 18:199-214, 1983.Google ScholarCross Ref
43.J. Rocchio. Relevance feedback in information retrieval. In G. Salton, editor, The SMART Retrieval System: Experiments in Automatic Document Processing, pages 313-323. Prentice Hall, 1971.Google ScholarDigital Library
44.D. E. Rumelhart, G. E. Hinton, and J. R. Williams. Learning internal representations by error propagation. In D. E. Rumelhart and J. L. McClelland, editors, Parallel Distributed Processing, Vol. I, pages 318-362. MIT Press, Cambridge, MA, 1986. Google ScholarDigital Library
45.G. Salton and C. Buckley. Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science, 41:288-297, 1990.Google ScholarCross Ref
46.I. Soboroff, C. Nicholas, and M. Pazzani, editors. Papers from the SIGIR-99 Recommender Systems Workshop, Berkeley, CA, 1999. ACM SIGIR.Google Scholar
47.G. G. Towell and J. W. Shavlik. Knowledge-based artificial neural networks. Artificial Intelligence, 70:119- 165, 1994. Google ScholarDigital Library
48.Y. Yang. An evaluation of statistical approaches to text categorization. Information Retrieval Journal, May 1999. Google ScholarDigital Library
49.Y. Yang and X. Liu. A re-examination of text cateogrization methods. In Proceedings of 22nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, CA, 1999. Google ScholarDigital Library
50.Y. Yang and J. O. Pedersen. A comparative study on feature selection in text categorization. In Proceedings of the Fourteenth International Conference on Machine Learning, pages 412-420, San Francisco, CA, 1997. Morgan Kaufman. Google ScholarDigital Library

Index Terms

Content-based book recommending using learning for text categorization

Recommendations

Recommending Followees Based on Content Weighted User Interest Homophily
ICIMCS'16: Proceedings of the International Conference on Internet Multimedia Computing and Service

We study the problem of recommending followees to users on content curation social networks (CCSNs). Different from existing friendship-oriented user recommendation approaches, we exploit user interest homophily to recommend users of similar interests, ...
Read More
Blended Recommending: Integrating Interactive Information Filtering and Algorithmic Recommender Techniques
CHI '15: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems

We present a novel approach that integrates algorithmic recommender techniques with interactive faceted filtering methods. We refer to this approach as blended recommending. It allows users to interact with a set of filter facets representing criteria ...
Read More
Getting to know you: learning new user preferences in recommender systems
IUI '02: Proceedings of the 7th international conference on Intelligent user interfaces

Recommender systems have become valuable resources for users seeking intelligent ways to search through the enormous volume of information available to them. One crucial unsolved problem for recommender systems is how best to learn about a new user. In ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
DL '00: Proceedings of the fifth ACM conference on Digital libraries
June 2000
294 pages
ISBN:158113231X
DOI:10.1145/336597
Chairmen:
Peter J. Nürnberg
Aalborg Univ., Esbjerg, Denmark
,
David L. Hicks
Aalborg Univ., Esbjerg, Denmark
,
Richard Furuta
Texas A & M Univ., College Station
Copyright © 2000 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 June 2000
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
information filtering
machine learning
recommender systems
text categorization
Qualifiers
- Article
Conference

Acceptance Rates
DL '00 Paper Acceptance Rate44of132submissions,33%Overall Acceptance Rate95of346submissions,27%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 768
  Total Citations
  View Citations
- 6,089
  Total Downloads
- Downloads (Last 12 months)681
- Downloads (Last 6 weeks)95
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Content-based book recommending using learning for text categorization

DL '00: Proceedings of the fifth ACM conference on Digital libraries

ABSTRACT

References

Cited By

Index Terms

Recommendations

Recommending Followees Based on Content Weighted User Interest Homophily

Blended Recommending: Integrating Interactive Information Filtering and Algorithmic Recommender Techniques

Getting to know you: learning new user preferences in recommender systems

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Content-based book recommending using learning for text categorization

DL '00: Proceedings of the fifth ACM conference on Digital libraries

ABSTRACT

References

Cited By

Index Terms

Recommendations

Recommending Followees Based on Content Weighted User Interest Homophily

Blended Recommending: Integrating Interactive Information Filtering and Algorithmic Recommender Techniques

Getting to know you: learning new user preferences in recommender systems

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media