Skip to main content

2018 | OriginalPaper | Buchkapitel

Qualitative Instead of Quantitative: Towards Practical Data Analysis Under Differential Privacy

verfasst von : Xuanyu Bai, Jianguo Yao, Mingyuan Yuan, Jia Zeng, Haibing Guan

Erschienen in: Database Systems for Advanced Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Differential privacy (DP) has become the de facto standard in the academic and industrial communities. Although DP can provide strong privacy guarantee, it also brings a major of performance loss for data mining systems. Recently there has been a flood of research into the quantitative mining of DP based algorithms, which are designed to improve the performance of data mining systems. However, industrial applications demand accurate quantitative mining results. Results containing noise are actually difficult to use. This paper rethinks to apply DP in industrial big data from another perspective: qualitative analysis, which aims to dig the data about rank, pattern, important set, etc. It does not require accurate results and naturally has a greater ability to accommodate noise. We design a framework about DP data publication based attribute importance rank to support the qualitative analysis of DP, which assists data buyers to perform qualitative analysis tasks and to know the credibility of their results. We show the realization of this framework using two typical qualitative tasks. Experimental results on public data and industrial data show that making use of this framework, qualitative analysis tasks can be completed with a high confidence support even when privacy budget \(\epsilon \) is very small (e.g., 0.05). Our observations suggest that qualitative analysis of DP has the potential ability to realize applying DP in industrial applications.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Blum, A., et al.: Practical privacy: the SuLQ framework. In: PODS (2005) Blum, A., et al.: Practical privacy: the SuLQ framework. In: PODS (2005)
4.
Zurück zum Zitat Chaudhuri, K., et al.: Privacy-preserving logistic regression. In: NIPS (2008) Chaudhuri, K., et al.: Privacy-preserving logistic regression. In: NIPS (2008)
5.
Zurück zum Zitat Chen, R., et al.: Differentially private high-dimensional data publication via sampling-based inference. In: SIGKDD (2015) Chen, R., et al.: Differentially private high-dimensional data publication via sampling-based inference. In: SIGKDD (2015)
6.
Zurück zum Zitat Chen, T., et al.: XGBoost: a scalable tree boosting system. In: SIGKDD (2016) Chen, T., et al.: XGBoost: a scalable tree boosting system. In: SIGKDD (2016)
7.
Zurück zum Zitat Domingoferrer, J., et al.: A critique of k-anonymity and some of its enhancements. In: ARES (2008) Domingoferrer, J., et al.: A critique of k-anonymity and some of its enhancements. In: ARES (2008)
8.
Zurück zum Zitat Dwork, C.: Differential privacy. In: ICALP (2006) Dwork, C.: Differential privacy. In: ICALP (2006)
9.
Zurück zum Zitat Dwork, C.: A firm foundation for private data analysis. In: CACM (2011) Dwork, C.: A firm foundation for private data analysis. In: CACM (2011)
11.
Zurück zum Zitat Friedman, A., et al.: Data mining with differential privacy. In: SIGKDD (2010) Friedman, A., et al.: Data mining with differential privacy. In: SIGKDD (2010)
12.
Zurück zum Zitat To, H., et al.: A framework for protecting worker location privacy in spatial crowdsourcing. Proc. VLDB Endow. 7, 919–930 (2014)CrossRef To, H., et al.: A framework for protecting worker location privacy in spatial crowdsourcing. Proc. VLDB Endow. 7, 919–930 (2014)CrossRef
13.
Zurück zum Zitat Hu, X., et al.: Differential privacy in telco big data platform. In: VLDB (2015) Hu, X., et al.: Differential privacy in telco big data platform. In: VLDB (2015)
14.
Zurück zum Zitat Machanavajjhala, A., et al.: L-diversity: privacy beyond k-anonymity. In: TKDD (2007) Machanavajjhala, A., et al.: L-diversity: privacy beyond k-anonymity. In: TKDD (2007)
15.
Zurück zum Zitat McSherry, F.: Privacy integrated queries: an extensible platform for privacy-preserving data analysis. In: CACM (2010) McSherry, F.: Privacy integrated queries: an extensible platform for privacy-preserving data analysis. In: CACM (2010)
16.
Zurück zum Zitat Bai, X., et al.: Embedding differential privacy in decision tree algorithm with different depths. SCIS 60, 082104 (2017) Bai, X., et al.: Embedding differential privacy in decision tree algorithm with different depths. SCIS 60, 082104 (2017)
17.
Zurück zum Zitat Mohammed, N., et al.: Differentially private data release for data mining. In: SIGKDD (2011) Mohammed, N., et al.: Differentially private data release for data mining. In: SIGKDD (2011)
18.
Zurück zum Zitat Qardaji, W., et al.: Differentially private grids for geospatial data. In: ICDE (2013) Qardaji, W., et al.: Differentially private grids for geospatial data. In: ICDE (2013)
19.
Zurück zum Zitat Xiao, Q., et al.: Differentially private network data release via structural inference. In: SIGKDD (2014) Xiao, Q., et al.: Differentially private network data release via structural inference. In: SIGKDD (2014)
20.
Zurück zum Zitat Zhang, J., et al.: PrivBayes: private data release via Bayesian networks. In: SIGMOD (2014) Zhang, J., et al.: PrivBayes: private data release via Bayesian networks. In: SIGMOD (2014)
Metadaten
Titel
Qualitative Instead of Quantitative: Towards Practical Data Analysis Under Differential Privacy
verfasst von
Xuanyu Bai
Jianguo Yao
Mingyuan Yuan
Jia Zeng
Haibing Guan
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-91458-9_46