Article

Personalized web search by mapping user queries to categories

Authors:
Fang Liu

University of Illinois at Chicago, Chicago, IL

University of Illinois at Chicago, Chicago, IL
View Profile

,
Clement Yu

University of Illinois at Chicago, Chicago, IL

University of Illinois at Chicago, Chicago, IL
View Profile

,
Weiyi Meng

SUNY at Binghamton, Binghamton, NY

SUNY at Binghamton, Binghamton, NY
View Profile

CIKM '02: Proceedings of the eleventh international conference on Information and knowledge managementNovember 2002Pages 558–565https://doi.org/10.1145/584792.584884

Published:04 November 2002Publication History

CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management

Pages 558–565

ABSTRACT

Current web search engines are built to serve all users, independent of the needs of any individual user. Personalization of web search is to carry out retrieval for each user incorporating his/her interests. We propose a novel technique to map a user query to a set of categories, which represent the user's search intention. This set of categories can serve as a context to disambiguate the words in the user's query. A user profile and a general profile are learned from the user's search history and a category hierarchy respectively. These two profiles are combined to map a user query into a set of categories. Several learning and combining algorithms are evaluated and found to be effective. Among the algorithms to learn a user profile, we choose the Rocchio-based method for its simplicity, efficiency and its ability to be adaptive. Experimental results indicate that our technique to personalize web search is both effective and efficient.

References

J. Allan. Incremental relevance feedback for information filtering. SIGIR, 1996 Google ScholarDigital Library
M. Balabanovic and Y. Shoham. Learning information retrieval agents: Experiments with automated Web browsing. In On-line Working Notes of the AAAI Spring Symposium Series on Information Gathering from Distributed, Heterogeneous Environments, 1995.Google Scholar
K. Bollacker, S. Lawrence, and C. Lee Giles. A system for automatic personalized tracking of scientific literature on the web. ACM DL, 1999. Google ScholarDigital Library
J. Budzik and J. K. Hammond. Watson: Anticipating and contextualizing information needs. In Proceedings of the Sixty-second Annual Meeting of the American Society for Information Science, 1999Google Scholar
U. Çetintemel, M. J. Franklin, and C. Lee Giles. Self-Adaptive User Profiles for Large-Scale Data Delivery.ICDE, 2000Google Scholar
L. Chen and K. Sycara. WebMate: A Personal Agent for Browsing and Searching. Autonomous Agents and Multi Agent Systems, 1998. Google ScholarDigital Library
S. Deerwester, S. T. Dumais, G. Furnas, T. Landauer, and R. Harshman. Indexing by latent semantic analysis. JASIS, 18(2), 1990.Google Scholar
R. Dolin, D. Agrawal, A. El Abbadi and J. Pearlman. Using Automated Classification for Summarizating and Selecting Heterogeneous Information Sources. D-Lib Magazine, 1998. Google ScholarDigital Library
W. Foltz and S. T. Dumais. Personalized information delivery: An analysis of information filtering methods. CACM,1992. Google ScholarDigital Library
W. Frakes, and R. Baeza-Yates. Information Retrieval: Data Structures and Algorithms. 1992. Google ScholarDigital Library
S. Gauch, G. Wang, M. Gomez. ProFusion: Intelligent Fusion from Multiple, Distributed Search Engines. Journal of Universal Computer Science, 2(9), 1996Google Scholar
E. Glover, G. Flake, S. Lawrence, W. Birmingham, A. Kruger, C. Giles, and D. Pennock. Improving Category Specific Web Search by Learning Query Modifications. SAINT, 2001 Google ScholarDigital Library
G. H. Golub and C. F. Van Loan. Matrix Computations. Third Edition, 1996 Google ScholarDigital Library
L. Gravano, and H. Garcia-Molina. Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies. VLDB, 1995. Google ScholarDigital Library
D. Grossman and O. Frieder. Information Retrieval: Algorithms and Heuristics. 1998. Google ScholarDigital Library
A. E. Howe and D. Dreilinger. SavvySearch: A meta-search engine that learns which search engines to query. AI Magazine, 18(2), 1997.Google Scholar
P. Ipeirotis, L. Gravano, and M. Sahami. Probe, Count, and Classify: Categorizing Hidden Web Databases. ACM SIGMOD, 2001. Google ScholarDigital Library
Joachims, T., Freitag, D., and Mitchell, T. Webwatcher: A tour guide for the World Wide Web. IJCAI, 1997Google Scholar
D. Koller and M. Sahami. Hierarchically classifying documents using very few words. ICML, 1997 Google ScholarDigital Library
Y. Labrou and T. Finin. Yahoo! as an ontology: using Yahoo! categories to describe documents. CIKM, 1999 Google ScholarDigital Library
H. Lieberman. Letizia: An agent that assists Web browsing. IJCAI, 1995.Google Scholar
W. Meng, W. Wang, H. Sun and C. Yu. Concept Hierarchy Based Text Database Categorization. International Journal on Knowledge and Information Systems, March 2002.Google Scholar
T. Mitchell. Machine Learning, 1997. Google ScholarDigital Library
M. Pazzani and D. Billsus. Learning and Revising User Profiles: The identification of interesting web sites. Machine Learning, 1997. Google ScholarDigital Library
A. L. Powell, J. C. French, J. P. Callan and M. Connell. The impact of database selection on distributed searching. SIGIR, 2000. Google ScholarDigital Library
A. Pretschner and S. Gauch. Ontology based personalized search. ICTAI, 1999 Google ScholarDigital Library
J. Rocchio. Relevance feedback in information retrieval. In The smart retrieval system: Experiments in automatic document processing, 1971.Google Scholar
S. Robertson and I. Soboroff. The TREC-10 Filtering Track Final Report. TREC-10, 2001.Google Scholar
G. Salton and M. J. McGill. Introduction to Modern Information Retrieval, 1983. Google ScholarDigital Library
D. H. Widyantoro, T. R. Ioerger and J. Yen. An adaptive algorithm for learning changes in user interests. CIKM, 1999 Google ScholarDigital Library
T. W. Yan and H. Garcia-Molina. SIFT -- A Tool for Wide-Area Information Dissemination. USENIX Technical Conference, 1995. Google ScholarDigital Library
Y. Yang and C. G. Chute. An example-based mapping method for text categorization and retrieval. TOIS, 1994 Google ScholarDigital Library
Y. Yang. Noise Reduction in a Statistical Approach to Text Categorization. SIGIR 1995 Google ScholarDigital Library
Y. Yang and X. Liu, A re-examination of text categorization methods. SIGIR 1999. Google ScholarDigital Library
C. Yu, W. Meng, W. Wu and K. Liu. Efficient and Effective Metasearch for Text Databases Incorporating Linkages among Documents. ACM SIGMOD, 2001. Google ScholarDigital Library

Index Terms

Personalized web search by mapping user queries to categories
1. Information systems
  1. Information retrieval
    1. Users and interactive retrieval
      1. Personalization
  2. World Wide Web
    1. Web searching and information discovery
      1. Personalization

Recommendations

Personalized Web Search For Improving Retrieval Effectiveness

Abstract--Current Web search engines are built to serve all users, independent of the special needs of any individual user. Personalization of Web search is to carry out retrieval for each user incorporating his/her interests. We propose a novel ...
Read More
Deriving Concept-Based User Profiles from Search Engine Logs

User profiling is a fundamental component of any personalization applications. Most existing user profiling strategies are based on objects that users are interested in (i.e., positive preferences), but not the objects that users dislike (i.e., negative ...
Read More
A personalized URL re-ranking methodology using user's browsing behavior
KES-AMSTA'08: Proceedings of the 2nd KES International conference on Agent and multi-agent systems: technologies and applications

This paper proposes a personalized re-ranking of URLs returned by a search engine using user's browsing behaviors. Our personalization method constructs an index of the anchor text retrieved from the web pages that the user has clicked during his/her ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management
November 2002
704 pages
ISBN:1581134924
DOI:10.1145/584792
General Chair:
Charles Nicholas
University of Maryland Baltimore County
,
Program Chairs:
David Grossman
Illinois Institute of Technology
,
Konstantinos Kalpakis
University of Maryland Baltimore County
,
Sajda Qureshi
Erasmus University, Rotterdam
,
Han van Dissel
Erasmus University, Rotterdam
,
Len Seligman
The MITRE Corporation
Copyright © 2002 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 November 2002
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
category hierarchy
information filtering
personalization
search engine
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,861of8,427submissions,22%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 128
  Total Citations
  View Citations
- 1,921
  Total Downloads
- Downloads (Last 12 months)15
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Personalized web search by mapping user queries to categories

CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Personalized Web Search For Improving Retrieval Effectiveness

Deriving Concept-Based User Profiles from Search Engine Logs

A personalized URL re-ranking methodology using user's browsing behavior