Skip to main content

2016 | OriginalPaper | Buchkapitel

On Mutual Information over Non-Euclidean Spaces, Data Mining and Data Privacy Levels

verfasst von : Yoan Miche, Ian Oliver, Silke Holtmanns, Anton Akusok, Amaury Lendasse, Kaj-Mikael Björk

Erschienen in: Proceedings of ELM-2015 Volume 2

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we propose a framework for measuring the impact of data privacy techniques, in information theoretic and in data mining terms. The need for data privacy and anonymization is often hampered by the fact that the privacy functions alter the data in non-measurable amounts and details. We propose here to use Mutual Information over non-Euclidean spaces as a means of measuring this distortion. In addition, and following the same principle, we also propose to use Machine Learning techniques in order to quantify the impact of the data obfuscation in terms of further data mining goals.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Nissenbaum, H.: A contextual approach to privacy online. Daedalus, 140(4):32–48 (Fall 2011) Nissenbaum, H.: A contextual approach to privacy online. Daedalus, 140(4):32–48 (Fall 2011)
2.
Zurück zum Zitat Gürses, S., Troncoso, C.G., Diaz, C.: Engineering privacy by design. In: Computers, Privacy and Data Protection (2011) Gürses, S., Troncoso, C.G., Diaz, C.: Engineering privacy by design. In: Computers, Privacy and Data Protection (2011)
3.
Zurück zum Zitat Oliver, I.: Privacy Engineering: A Data Flow and Ontological Approach. CreateSpace (2014) Oliver, I.: Privacy Engineering: A Data Flow and Ontological Approach. CreateSpace (2014)
4.
Zurück zum Zitat Ciriani, V., Capitani di Vimercati, S., Foresti, S., Samarati, P.: \(\kappa \)-anonymity. In: Ting, Y., Jajodia, S. (eds.) Secure Data Management in Decentralized Systems. Advances in Information Security, vol. 33, pp. 323–353. Springer, New York (2007) Ciriani, V., Capitani di Vimercati, S., Foresti, S., Samarati, P.: \(\kappa \)-anonymity. In: Ting, Y., Jajodia, S. (eds.) Secure Data Management in Decentralized Systems. Advances in Information Security, vol. 33, pp. 323–353. Springer, New York (2007)
5.
Zurück zum Zitat Machanavajjhala, A., Gehrke, J., Kifer, D., Venkitasubramaniam, M.: \(\ell \)-diversity: Privacy beyond \(\kappa \)-anonymity. In: 2013 IEEE 29th International Conference on Data Engineering (ICDE), 0:24 (2006) Machanavajjhala, A., Gehrke, J., Kifer, D., Venkitasubramaniam, M.: \(\ell \)-diversity: Privacy beyond \(\kappa \)-anonymity. In: 2013 IEEE 29th International Conference on Data Engineering (ICDE), 0:24 (2006)
6.
Zurück zum Zitat Dwork, C.: Differential privacy: a survey of results. In: Theory and Applications of Models of Computation. Lecture Notes in Computer Science, vol. 4978, pp. 1–19. Springer, Berlin (2008) Dwork, C.: Differential privacy: a survey of results. In: Theory and Applications of Models of Computation. Lecture Notes in Computer Science, vol. 4978, pp. 1–19. Springer, Berlin (2008)
7.
Zurück zum Zitat The UK Cabinet Office. Security policy framework (April 2013) The UK Cabinet Office. Security policy framework (April 2013)
8.
Zurück zum Zitat Huang, G., Chen, L., Siew, C.-K., Huang, G.-B., Chen, L., Siew, C.-K.: Universal approximation using incremental constructive feedforward neural networks with random hidden nodes. IEEE Trans. Neural Networks 17(4), 879–892 (2006) Huang, G., Chen, L., Siew, C.-K., Huang, G.-B., Chen, L., Siew, C.-K.: Universal approximation using incremental constructive feedforward neural networks with random hidden nodes. IEEE Trans. Neural Networks 17(4), 879–892 (2006)
9.
Zurück zum Zitat Huang, G.-B., Zhu, Q.-Y., Siew, C.-K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006) Huang, G.-B., Zhu, Q.-Y., Siew, C.-K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)
10.
11.
Zurück zum Zitat Miche, Y., Sorjamaa, A., Bas, P., Simula, O., Jutten, C., Lendasse, A.: OP-ELM: optimally-pruned extreme learning machine. IEEE Trans. Neural Networks 21(1), 158–162 (2010) Miche, Y., Sorjamaa, A., Bas, P., Simula, O., Jutten, C., Lendasse, A.: OP-ELM: optimally-pruned extreme learning machine. IEEE Trans. Neural Networks 21(1), 158–162 (2010)
12.
Zurück zum Zitat Miche, Y., van Heeswijk, M., Bas, P., Simula, O., Lendasse, A.: TROP-ELM: a double-regularized ELM using LARS and Tikhonov regularization. Neurocomputing 74(16), 2413–2421 (2011) Miche, Y., van Heeswijk, M., Bas, P., Simula, O., Lendasse, A.: TROP-ELM: a double-regularized ELM using LARS and Tikhonov regularization. Neurocomputing 74(16), 2413–2421 (2011)
13.
Zurück zum Zitat Van Heeswijk, M., Miche, Y., Oja, E., Lendasse, A.: GPU-accelerated and parallelized ELM ensembles for large-scale regression. Neurocomputing 74(16), 2430–2437 (2011) Van Heeswijk, M., Miche, Y., Oja, E., Lendasse, A.: GPU-accelerated and parallelized ELM ensembles for large-scale regression. Neurocomputing 74(16), 2430–2437 (2011)
14.
Zurück zum Zitat Cambria, E., Huang, G.-B., Kasun, L.L.C., Zhou, H., Vong, C.M., Lin, J., Yin, J., Cai, Z., Liu, Q., Li, K., Leung, V.C.M., Feng, L., Ong, Y.-S., Lim, M.-H., Akusok, A., Lendasse, A., Corona, F., Nian, R., Miche, Y., Gastaldo, P., Zunino, R., Decherchi, S., Yang, X., Mao, K., Oh, B.-S., Jeon, J., Toh, K.-A., Teoh, A.B.J., Kim, J., Yu, H., Chen, Y., Liu, J.: Extreme learning machines (trends and controversies). IEEE Intell. Syst. 28(6), 30–59 (2013) Cambria, E., Huang, G.-B., Kasun, L.L.C., Zhou, H., Vong, C.M., Lin, J., Yin, J., Cai, Z., Liu, Q., Li, K., Leung, V.C.M., Feng, L., Ong, Y.-S., Lim, M.-H., Akusok, A., Lendasse, A., Corona, F., Nian, R., Miche, Y., Gastaldo, P., Zunino, R., Decherchi, S., Yang, X., Mao, K., Oh, B.-S., Jeon, J., Toh, K.-A., Teoh, A.B.J., Kim, J., Yu, H., Chen, Y., Liu, J.: Extreme learning machines (trends and controversies). IEEE Intell. Syst. 28(6), 30–59 (2013)
15.
Zurück zum Zitat Radhakrishna Rao, C., Mitra, S.K.: Generalized Inverse of Matrices and Its Applications. Wiley, New York (1972) Radhakrishna Rao, C., Mitra, S.K.: Generalized Inverse of Matrices and Its Applications. Wiley, New York (1972)
16.
Zurück zum Zitat Kraskov, A., Stögbauer, H., Grassberger, P.: Estimating mutual information. Phys. Rev. E 69, 066138 (2004) Kraskov, A., Stögbauer, H., Grassberger, P.: Estimating mutual information. Phys. Rev. E 69, 066138 (2004)
17.
Zurück zum Zitat Pál, D., Póczos, B., Szepesvári, C.: Estimation of Rényi entropy and mutual information based on generalized nearest-neighbor graphs. ArXiv e-prints (2010) Pál, D., Póczos, B., Szepesvári, C.: Estimation of Rényi entropy and mutual information based on generalized nearest-neighbor graphs. ArXiv e-prints (2010)
18.
Zurück zum Zitat Pál, D., Póczos, B., Szepesvári, C.: Estimation of rényi entropy and mutual information based on generalized nearest-neighbor graphs. In: Lafferty, J.D., Williams, C.K.I., Shawe-Taylor, J., Zemel, R.S., Culotta, A. (eds.) Advances in Neural Information Processing Systems 23, pp. 1849–1857. Curran Associates, Inc. (2010) Pál, D., Póczos, B., Szepesvári, C.: Estimation of rényi entropy and mutual information based on generalized nearest-neighbor graphs. In: Lafferty, J.D., Williams, C.K.I., Shawe-Taylor, J., Zemel, R.S., Culotta, A. (eds.) Advances in Neural Information Processing Systems 23, pp. 1849–1857. Curran Associates, Inc. (2010)
20.
Zurück zum Zitat Bogachev, V.I., Kolesnikov, A.V.: The Monge-Kantorovich problem: achievements, connections, and perspectives. Russ. Math. Surv. 67, 785–890 (2012)CrossRefMathSciNetMATH Bogachev, V.I., Kolesnikov, A.V.: The Monge-Kantorovich problem: achievements, connections, and perspectives. Russ. Math. Surv. 67, 785–890 (2012)CrossRefMathSciNetMATH
Metadaten
Titel
On Mutual Information over Non-Euclidean Spaces, Data Mining and Data Privacy Levels
verfasst von
Yoan Miche
Ian Oliver
Silke Holtmanns
Anton Akusok
Amaury Lendasse
Kaj-Mikael Björk
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-28373-9_32