Skip to main content
Erschienen in: Knowledge and Information Systems 9/2021

27.07.2021 | Regular Paper

Privacy protection of user profiles in online search via semantic randomization

verfasst von: Mercedes Rodriguez-Garcia, Montserrat Batet, David Sánchez, Alexandre Viejo

Erschienen in: Knowledge and Information Systems | Ausgabe 9/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Querying a search engine is one of the most frequent activities performed by Internet users. As queries are submitted, the server collects and aggregates them to build detailed user profiles. While user profiles are used to offer personalized search services, they may also be employed in behavioral targeting or, even worse, be transferred to third parties. Proactive protection of users' privacy in front of search engines has been tackled by submitting fake queries that aim at distorting the users' real profile. However, most approaches submit either random queries (which do not allow controlling the profile distortion) or queries constructed by following deterministic algorithms (which may be detected by aware search engines). In this paper, we propose a semantically grounded method to generate fake queries that (i) is driven by the privacy requirements of the user, (ii) submits the least number of fake queries needed to fulfill the requirements and (iii) creates queries in a non-deterministic way. Unlike related works, we accurately analyze and exploit the semantics underlying to user queries and their influence in the resulting profile. As a result, our approach offers more control—because users can tailor how their profile should be protected—and greater efficiency—because the desired protection is achieved with fewer fake queries. The experimental results on real query logs illustrate the benefits of our approach.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Viejo A, Sánchez D (2014) Profiling social networks to provide useful and privacy-preserving web search. J Am Soc Inf Sci 65(12):2444–2458 Viejo A, Sánchez D (2014) Profiling social networks to provide useful and privacy-preserving web search. J Am Soc Inf Sci 65(12):2444–2458
2.
Zurück zum Zitat Gómez-Boix A, Laperdrix P, Baudry B (2018) Hiding in the crowd: an analysis of the effectiveness of browser fingerprinting at large scale. In: WWW2018—TheWebConf 2018: 27th international world wide web conference. 2018. Lyon, France Gómez-Boix A, Laperdrix P, Baudry B (2018) Hiding in the crowd: an analysis of the effectiveness of browser fingerprinting at large scale. In: WWW2018—TheWebConf 2018: 27th international world wide web conference. 2018. Lyon, France
3.
Zurück zum Zitat Tegegne G, van der Weide TP (2014) Enriching queries with user preferences in healthcare. Inf Process Manag 50(4):599–620CrossRef Tegegne G, van der Weide TP (2014) Enriching queries with user preferences in healthcare. Inf Process Manag 50(4):599–620CrossRef
4.
Zurück zum Zitat Bordogna G et al (2012) Disambiguated query suggestions and personalized content-similarity and novelty ranking of clustered results to optimize web searches. Inf Process Manag 48(3):419–437CrossRef Bordogna G et al (2012) Disambiguated query suggestions and personalized content-similarity and novelty ranking of clustered results to optimize web searches. Inf Process Manag 48(3):419–437CrossRef
5.
Zurück zum Zitat Selvaretnam B, Belkhatir M (2019) Coupled intrinsic and extrinsic human language resource-based query expansion. Knowl Inf Syst 60:1397–1426CrossRef Selvaretnam B, Belkhatir M (2019) Coupled intrinsic and extrinsic human language resource-based query expansion. Knowl Inf Syst 60:1397–1426CrossRef
6.
Zurück zum Zitat Raza MA, Mokhtar R, Ahmad N (2019) A survey of statistical apporaches for query expansion. Knowl Inf Syst 61:1–25CrossRef Raza MA, Mokhtar R, Ahmad N (2019) A survey of statistical apporaches for query expansion. Knowl Inf Syst 61:1–25CrossRef
7.
Zurück zum Zitat Chen J, Stallaert J (2014) An Economic Analysis of Online Advertising Using Behavioral Targeting. MIS Q 38(2):429–449CrossRef Chen J, Stallaert J (2014) An Economic Analysis of Online Advertising Using Behavioral Targeting. MIS Q 38(2):429–449CrossRef
8.
Zurück zum Zitat Ramirez E et al (2014) Data brokers: a call for transparency and accountability, in report. 2014, U.S. Federal Trade Commission Ramirez E et al (2014) Data brokers: a call for transparency and accountability, in report. 2014, U.S. Federal Trade Commission
9.
Zurück zum Zitat Nissenbaum HF, Howe D (2009) Trackmenot: resisting surveillance in web search. In: Kerr I, Lucock C, Steeves V (eds) Lessons from the identity trail: anonymity, privacy, and identity in a networked society. Oxford University Press, Oxford Nissenbaum HF, Howe D (2009) Trackmenot: resisting surveillance in web search. In: Kerr I, Lucock C, Steeves V (eds) Lessons from the identity trail: anonymity, privacy, and identity in a networked society. Oxford University Press, Oxford
10.
Zurück zum Zitat Romero-Tris C, Castellà-Roca J, Viejo A (2011) Multi-party private web search with untrusted partners. In: 7th International ICST conference on security and privacy in communication networks—SecureComm’11. Springer Romero-Tris C, Castellà-Roca J, Viejo A (2011) Multi-party private web search with untrusted partners. In: 7th International ICST conference on security and privacy in communication networks—SecureComm’11. Springer
11.
Zurück zum Zitat Viejo A, Castellà-Roca J (2010) Using social networks to distort users’ profiles generated by web search engines. Comput Netw 54:1343–1357CrossRef Viejo A, Castellà-Roca J (2010) Using social networks to distort users’ profiles generated by web search engines. Comput Netw 54:1343–1357CrossRef
12.
Zurück zum Zitat Castellà-Roca J, Viejo A, Herrera-Joancomarti J (2009) Preserving user’s privacy in web search engines. Comput Commun 32:1541–1551CrossRef Castellà-Roca J, Viejo A, Herrera-Joancomarti J (2009) Preserving user’s privacy in web search engines. Comput Commun 32:1541–1551CrossRef
13.
Zurück zum Zitat Lindell Y, Waisbard E (2010) Private web search with malicious adversaries. In: 10th International conference on privacy enhancing technologies—PETS’10 Lindell Y, Waisbard E (2010) Private web search with malicious adversaries. In: 10th International conference on privacy enhancing technologies—PETS’10
14.
Zurück zum Zitat Romero-Tris C et al (2015) Design of a P2P network that protects users’ privacy in front of Web Search Engines. Comput Commun 57:37–49CrossRef Romero-Tris C et al (2015) Design of a P2P network that protects users’ privacy in front of Web Search Engines. Comput Commun 57:37–49CrossRef
15.
Zurück zum Zitat Kaaniche N et al (2020) Privacy preserving cooperative computation for personalized web search applications. I:n 35th Annual ACM symposium on applied computing. ACM, Brno, Czech Republic Kaaniche N et al (2020) Privacy preserving cooperative computation for personalized web search applications. I:n 35th Annual ACM symposium on applied computing. ACM, Brno, Czech Republic
16.
Zurück zum Zitat Petit A, Cerqueus T, Mokhtar SB, Brunie L (2015) Kosch. PEAS: private, efficient and accurate web search. In: 14th IEEE international conference on trust, security and privacy in computing and communications Petit A, Cerqueus T, Mokhtar SB, Brunie L (2015) Kosch. PEAS: private, efficient and accurate web search. In: 14th IEEE international conference on trust, security and privacy in computing and communications
17.
18.
Zurück zum Zitat Domingo-Ferrer J, Solanas A, Castellà-Roca J (2009) h(k)-Private information retrieval from privacy-uncooperative queryable databases. J Online Inf Rev 33(4):1468–4527 Domingo-Ferrer J, Solanas A, Castellà-Roca J (2009) h(k)-Private information retrieval from privacy-uncooperative queryable databases. J Online Inf Rev 33(4):1468–4527
19.
Zurück zum Zitat Peddinti ST, Saxena N (2010) On the privacy of web search based on query obfuscation: a case study of trackmenot. In: 10th International conference on privacy enhancing technologies—PETS’10 Peddinti ST, Saxena N (2010) On the privacy of web search based on query obfuscation: a case study of trackmenot. In: 10th International conference on privacy enhancing technologies—PETS’10
20.
Zurück zum Zitat Shou L, Bai H, Chen K, Chen G (2012) Supporting privacy protection in personalized web search. IEEE Trans Knowl Data Eng 26(2):453–467CrossRef Shou L, Bai H, Chen K, Chen G (2012) Supporting privacy protection in personalized web search. IEEE Trans Knowl Data Eng 26(2):453–467CrossRef
21.
Zurück zum Zitat Shapira B et al (2005) PRAW—a PRivAcy model for the Web. J Am Soc Inf Sci Technol 56:159–172CrossRef Shapira B et al (2005) PRAW—a PRivAcy model for the Web. J Am Soc Inf Sci Technol 56:159–172CrossRef
22.
Zurück zum Zitat Sánchez D, Castellà-Roca J, Viejo A (2013) Knowledge-based scheme to create privacy-preserving but semantically-related queries for web search engines. Inf Sci 218:17–30CrossRef Sánchez D, Castellà-Roca J, Viejo A (2013) Knowledge-based scheme to create privacy-preserving but semantically-related queries for web search engines. Inf Sci 218:17–30CrossRef
23.
Zurück zum Zitat Ahmad WU, Chang K-W, Wang H (2018) Intent-aware query obfuscation for privacy protection in personalized web search. In: 41st International ACM SIGIR conference on research and development in information retrieval. ACM, Ann Arbor, MI, USA Ahmad WU, Chang K-W, Wang H (2018) Intent-aware query obfuscation for privacy protection in personalized web search. In: 41st International ACM SIGIR conference on research and development in information retrieval. ACM, Ann Arbor, MI, USA
24.
Zurück zum Zitat Rodrigo-Ginés FJ et al (2018) PrivacySearch: an end-user and query generalization tool for privacy enhancement in web search. in international conference on network and system security—NSS 2018 Rodrigo-Ginés FJ et al (2018) PrivacySearch: an end-user and query generalization tool for privacy enhancement in web search. in international conference on network and system security—NSS 2018
25.
Zurück zum Zitat Wu Z et al (2020) A dummy-based user privacy protection approach for text information retrieval. Knowl Based Syst 195:105679CrossRef Wu Z et al (2020) A dummy-based user privacy protection approach for text information retrieval. Knowl Based Syst 195:105679CrossRef
26.
Zurück zum Zitat Guarino N (1998) Formal ontology in information systems. In: 1st International conference on formal ontology in information systems, FOIS 1998. IOS Press, Trento, Italy Guarino N (1998) Formal ontology in information systems. In: 1st International conference on formal ontology in information systems, FOIS 1998. IOS Press, Trento, Italy
28.
Zurück zum Zitat Wu Z, Palmer M (1994) Verbs semantics and lexical selection. In: Proceeding of the annual meeting of the association for computational linguistics. pp 133–139 Wu Z, Palmer M (1994) Verbs semantics and lexical selection. In: Proceeding of the annual meeting of the association for computational linguistics. pp 133–139
29.
Zurück zum Zitat Sánchez D et al (2012) Enabling semantic similarity estimation across multiple ontologies: an evaluation in the biomedical domain. J Biomed Inform 45(1):141–155CrossRef Sánchez D et al (2012) Enabling semantic similarity estimation across multiple ontologies: an evaluation in the biomedical domain. J Biomed Inform 45(1):141–155CrossRef
30.
Zurück zum Zitat Batet M et al (2014) An information theoretic approach to improve semantic similarity assessments across multiple ontologies. Inf Sci 283:197–210CrossRef Batet M et al (2014) An information theoretic approach to improve semantic similarity assessments across multiple ontologies. Inf Sci 283:197–210CrossRef
31.
Zurück zum Zitat Martínez S, Valls A, Sánchez D (2012) Semantically-grounded construction of centroids for datasets with textual attributes. Knowl Based Syst 35:160–172CrossRef Martínez S, Valls A, Sánchez D (2012) Semantically-grounded construction of centroids for datasets with textual attributes. Knowl Based Syst 35:160–172CrossRef
33.
Zurück zum Zitat Viejo A, Sánchez D, Castellà-Roca J (2012) Preventing automatic user profiling in Web 2.0 applications. Knowl Based Syst 36:191–205CrossRef Viejo A, Sánchez D, Castellà-Roca J (2012) Preventing automatic user profiling in Web 2.0 applications. Knowl Based Syst 36:191–205CrossRef
34.
Zurück zum Zitat Fellbaum C (1998) WordNet: an electronic lexical database. MIT Press, CambridgeCrossRef Fellbaum C (1998) WordNet: an electronic lexical database. MIT Press, CambridgeCrossRef
Metadaten
Titel
Privacy protection of user profiles in online search via semantic randomization
verfasst von
Mercedes Rodriguez-Garcia
Montserrat Batet
David Sánchez
Alexandre Viejo
Publikationsdatum
27.07.2021
Verlag
Springer London
Erschienen in
Knowledge and Information Systems / Ausgabe 9/2021
Print ISSN: 0219-1377
Elektronische ISSN: 0219-3116
DOI
https://doi.org/10.1007/s10115-021-01597-x

Weitere Artikel der Ausgabe 9/2021

Knowledge and Information Systems 9/2021 Zur Ausgabe