skip to main content
10.1145/1242572.1242652acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
Article

Privacy-enhancing personalized web search

Published:08 May 2007Publication History

ABSTRACT

Personalized web search is a promising way to improve search quality by customizing search results for people with individual information goals. However, users are uncomfortable with exposing private preference information to search engines. On the other hand, privacy is not absolute, and often can be compromised if there is a gain in service or profitability to the user. Thus, a balance must be struck between search quality and privacy protection. This paper presents a scalable way for users to automatically build rich user profiles. These profiles summarize a user.s interests into a hierarchical organization according to specific interests. Two parameters for specifying privacy requirements are proposed to help the user to choose the content and degree of detail of the profile information that is exposed to the search engine. Experiments showed that the user profile improved search quality when compared to standard MSN rankings. More importantly, results verified our hypothesis that a significant improvement on search quality can be achieved by only sharing some higher-level user profile information, which is potentially less sensitive than detailed personal information.

References

  1. J. Pitkow, H. Schuetze, T. Cass, R. Cooley, D. Turnbull, A. Edmonds, E. Adar, and T. Breuel. Personalized search. Communications of the ACM, 45(9):50--55, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Google personalized search: http://www.google.com/psearchGoogle ScholarGoogle Scholar
  3. Yahoo! My Web 2.0: http://myweb2.search.yahoo.com/Google ScholarGoogle Scholar
  4. W. Gasarch. A survey on private information retrieval. The bulletin of the European Association for Theoretical Computer Science (EATCS), 82:72--107, 2004.Google ScholarGoogle Scholar
  5. Glen Jeh, and Jennifer Widom. Scaling personalized web search. In Proc. of the 12th International World Wide Web Conference (WWW), Budapest, Hungary, May 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. T. H. Haveliwala. Topic-sensitive PageRank. In Proc. of the 11th International World Wide Web Conference (WWW), Honolulu, Hawaii, May 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. K. Sugiyama, K. Hatano and M. Yoshikawa. Adaptive Web search based on user profile constructed without any effort from users, In Proc. of the 13th International World Wide Web Conference (WWW), New York, New York, May 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Teevan, S. T. Dumais, and Eric Horvitz. Personalizing search via automated analysis of interests and activities. In the Prof. of 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Salvador, Brazil. August, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Paolo Ferragina, and Antonio Gulli. A personalized search engine based on Web-Snippet hierarchical clustering. In Proc. of the 14th International World Wide Web Conference (WWW), Chiba, Japan, May 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. P. A. Chirita, W. Nejdl, R. Paiu, and C. Kohlschutter. Using ODP metadata to personalize search. In the Prof. of 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), Salvador, Brazil, August, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. H. R. Kim, and Philip K. Chan. Learning implicit user interest hierarchy for context in personalization. In Proc. of International Conference on Intelligent User Interface (IUI), Miami, Florida, January, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Speretta, and S. Gauch, Personalizing search based on user search history. In Proc. of International Conference of Knowledge Management(CIKM), Washington D.C., 2004.Google ScholarGoogle Scholar
  13. P. Anick. Using terminological feed back for Web search refinement: a log-based study. In Proc. of the 13th International World Wide Web Conference (WWW), New York, New York, May 2004.Google ScholarGoogle Scholar
  14. K. R. McKeown, N. Elhadad, and V. Hatzivassiloglou. Leveraging a common representation for personalized search and summarization in a medical digital library. In Proc. of International Conference on Digital Library, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Kritikopoulos, and M. Sideri. The compass Filter: Search engine result personalization using web communities. In Proc. of Intelligent Techniques in Web Personalization (ITWP), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. B. Fung, K. Wang and M. Ester. Hierarchical document clustering using frequent itemsets. In Proc. Of SIAM International Conference on Data Mining, San Francisco, May 2003.Google ScholarGoogle ScholarCross RefCross Ref
  17. K. Wang, C. Xu, B. Ling, "Clustering transactions using large items", In Proc. of the 8th Conference on Information and Knowledge Management (CIKM), Kansas City, November, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. Sun, H. Zeng, H. Liu, Y. Lu, and Z. Chen. CubeSVD: A Novel Approach to Personalized Web Search. In Proc. of the 14th International World Wide Web Conference (WWW), Chiba, Japan, May 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. R. Agrawal, and R. Skriant. Privacy preserving data mining. In Proc. of the ACM SIGMOD Conference on Management of Data (SIGMOD), Dallas, Texas, May 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. A. Evfimievski, J. Gehrke and R. Srikant. Limiting privacy breaches in privacy preserving data mining. In Proc. of the ACM SIGMOD/PODS(PODS), San Diego, CA, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. L. Sweeney. k-anonymity: a model for protecting privacy. International Journal on Uncertainty, Fuzziness and Knowledge-based Systems, 10 (5), 2002; 557--570. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. R. Baeza-Yates, and B. Ribeiro-Neto, Modern Information Retrieval. Addison Wesley Longman, MA, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. W. Alan. Privacy and Freedom. Atheneum Press, Boston, 1967.Google ScholarGoogle Scholar
  24. J. Carroll and M. Rosson. The paradox of the active user. In J. M. Carroll (Ed.), Interfacing Thought: Cognitive Aspects of Human-Computer interaction, MIT Press, Cambridge, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. M. Tribus. Thermostatics and Thermodynamics, D. Van Nostrand, New York, NY, 1961.Google ScholarGoogle Scholar
  26. T. M. Cover and J. A. Thomas. Elements of Information Theory, 1st Edition. Wiley-InterScience, New York, NY, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. J. Han. Data Mining Concepts and Techniques, Morgan Kaufmann Publishers, San Francisco, CA, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. F. Qiu, and J. Cho. Automatic identification of user interest for personalized search. In Proc. of the 12th International World Wide Web Conference (WWW), Edinburgh, Scotland, May 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Privacy-enhancing personalized web search

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          WWW '07: Proceedings of the 16th international conference on World Wide Web
          May 2007
          1382 pages
          ISBN:9781595936547
          DOI:10.1145/1242572

          Copyright © 2007 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 8 May 2007

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          Overall Acceptance Rate1,899of8,196submissions,23%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader