Skip to main content
Erschienen in: Programming and Computer Software 5/2023

01.10.2023

Application of Computer Simulation to the Anonymization of Personal Data: Synthesis-Based Anonymization Model and Algorithm

verfasst von: A. V. Borisov, A. V. Bosov, A. V. Ivanov

Erschienen in: Programming and Computer Software | Ausgabe 5/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This paper describes the second part of our study devoted to automated anonymization of personal data. The overview and analysis of research prospects is supplemented with a practical result. An anonymization model is proposed, which reduces anonymization of personal data to manipulation of samples of random elements of different types. The key idea of our approach to anonymization of personal data with preservation of their usefulness is the use of the synthesis method, i.e., the complete replacement of all non-anonymized data with synthetic values. In the proposed model, a set of element types is selected, for which corresponding synthesys templates are proposed. The set of templates constitutes a synthesis-based anonymization algorithm. Technically, each template is based on a well-known statistical tool: frequency estimates of probabilities, Parzen–Rosenblatt kernel density estimates, statistical means, and covariances. The proposed approach is illustrated by a simple example from civil aviation.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Borisov, A.V., Bosov, A.V., and Ivanov, A.V., Application of Computer Simulation to the Anonymization of Personal Data: State-of-the-Art and Key Points, Program. Comput. Software, 2023, no. 4, pp. 232–246. Borisov, A.V., Bosov, A.V., and Ivanov, A.V., Application of Computer Simulation to the Anonymization of Personal Data: State-of-the-Art and Key Points, Program. Comput. Software, 2023, no. 4, pp. 232–246.
2.
Zurück zum Zitat Aggarwal, C.C. and Yu, P.S., On privacy-preservation of text and sparse binary data with sketches, Proc. SIAM Conf. Data Mining, 2007. Aggarwal, C.C. and Yu, P.S., On privacy-preservation of text and sparse binary data with sketches, Proc. SIAM Conf. Data Mining, 2007.
3.
Zurück zum Zitat Sweeney, L., K-anonymity: A model for protecting privacy, Int. J. Uncertainty, Fuzziness Knowl.-Based Syst., 2002, vol. 10, no. 5, pp. 557–570.MathSciNetCrossRefMATH Sweeney, L., K-anonymity: A model for protecting privacy, Int. J. Uncertainty, Fuzziness Knowl.-Based Syst., 2002, vol. 10, no. 5, pp. 557–570.MathSciNetCrossRefMATH
4.
Zurück zum Zitat Samarati, P. and Sweeney, L., Generalizing data to provide anonymity when disclosing information (Abstract), Proc. ACM Symp. Principles of Database Systems, 1998, p. 188. Samarati, P. and Sweeney, L., Generalizing data to provide anonymity when disclosing information (Abstract), Proc. ACM Symp. Principles of Database Systems, 1998, p. 188.
5.
Zurück zum Zitat Samarati, P., Protecting respondents' identities in microdata release, IEEE Trans. Knowl. Data Eng., 2001, vol. 13, no. 6, pp. 1010–1027.CrossRef Samarati, P., Protecting respondents' identities in microdata release, IEEE Trans. Knowl. Data Eng., 2001, vol. 13, no. 6, pp. 1010–1027.CrossRef
6.
Zurück zum Zitat Bayardo, R.J. and Agrawal, R., Data privacy through optimal k-anonymization, Proc. ICDE Conf., 2005, pp. 217–228. Bayardo, R.J. and Agrawal, R., Data privacy through optimal k-anonymization, Proc. ICDE Conf., 2005, pp. 217–228.
7.
Zurück zum Zitat Fung, B., Wang, K., and Yu, P., Top-down specialization for information and privacy preservation, Proc. ICDE Conf., 2005. Fung, B., Wang, K., and Yu, P., Top-down specialization for information and privacy preservation, Proc. ICDE Conf., 2005.
8.
Zurück zum Zitat Wang, K., Yu, P., and Chakraborty, S., Bottom-up generalization: A data mining solution to privacy protection, Proc. ICDM Conf., 2004. Wang, K., Yu, P., and Chakraborty, S., Bottom-up generalization: A data mining solution to privacy protection, Proc. ICDM Conf., 2004.
9.
Zurück zum Zitat Domingo-Ferrer, J. and Mateo-Sanz, J., Practical data-oriented micro-aggregation for statistical disclosure control, IEEE TKDE, 2002, vol. 14, no. 1. Domingo-Ferrer, J. and Mateo-Sanz, J., Practical data-oriented micro-aggregation for statistical disclosure control, IEEE TKDE, 2002, vol. 14, no. 1.
10.
Zurück zum Zitat Winkler, W., Using simulated annealing for k-anonymity, Technical Report 7, US Census Bureau, Washington D.C. 20233, 2002. Winkler, W., Using simulated annealing for k-anonymity, Technical Report 7, US Census Bureau, Washington D.C. 20233, 2002.
11.
Zurück zum Zitat Iyengar, V.S., Transforming data to satisfy privacy constraints, Proc. KDD Conference, 2002. Iyengar, V.S., Transforming data to satisfy privacy constraints, Proc. KDD Conference, 2002.
12.
Zurück zum Zitat Lakshmanan, L., Ng, R., and Ramesh, G., To do or not to do: The dilemma of disclosing anonymized data, Proc. ACM SIGMOD Conf., 2005. Lakshmanan, L., Ng, R., and Ramesh, G., To do or not to do: The dilemma of disclosing anonymized data, Proc. ACM SIGMOD Conf., 2005.
13.
Zurück zum Zitat Aggarwal, C.C. and Yu, P.S., On variable constraints in privacy-preserving data mining, Proc. SIAM Conf., 2005. Aggarwal, C.C. and Yu, P.S., On variable constraints in privacy-preserving data mining, Proc. SIAM Conf., 2005.
14.
Zurück zum Zitat Aggarwal, C.C., On k-anonymity and the curse of dimensionality, Proc. VLDB Conf., 2005. Aggarwal, C.C., On k-anonymity and the curse of dimensionality, Proc. VLDB Conf., 2005.
15.
Zurück zum Zitat Iyengar, V.S., Transforming data to satisfy privacy constraints, Proc. KDD Conf., 2002. Iyengar, V.S., Transforming data to satisfy privacy constraints, Proc. KDD Conf., 2002.
16.
Zurück zum Zitat Machanavajjhala, A., Gehrke, J., Kifer, D., and Venkitasubramaniam, M., L-Diversity: Privacy beyond k‑anonymity, Proc. ICDE Conf., 2006. Machanavajjhala, A., Gehrke, J., Kifer, D., and Venkitasubramaniam, M., L-Diversity: Privacy beyond k‑anonymity, Proc. ICDE Conf., 2006.
17.
Zurück zum Zitat Fung, B., Wang, K., and Yu, P., Top-down specialization for information and privacy preservation, Proc. ICDE Conf., 2005. Fung, B., Wang, K., and Yu, P., Top-down specialization for information and privacy preservation, Proc. ICDE Conf., 2005.
18.
Zurück zum Zitat Wang, K., Yu, P., and Chakraborty, S., Bottom-up generalization: A data mining solution to privacy protection, Proc. ICDM Conf., 2004. Wang, K., Yu, P., and Chakraborty, S., Bottom-up generalization: A data mining solution to privacy protection, Proc. ICDM Conf., 2004.
19.
Zurück zum Zitat Rosenblatt, M., Remarks on some nonparametric estimates of a density function, Ann. Math. Stat., 1956, vol. 27, no. 3, pp. 832–837.MathSciNetCrossRefMATH Rosenblatt, M., Remarks on some nonparametric estimates of a density function, Ann. Math. Stat., 1956, vol. 27, no. 3, pp. 832–837.MathSciNetCrossRefMATH
20.
Zurück zum Zitat Parzen, E., On estimation of a probability density function and mode, Ann. Math. Stat., 1962, vol. 33, no. 3, pp. 1065–1076.MathSciNetCrossRefMATH Parzen, E., On estimation of a probability density function and mode, Ann. Math. Stat., 1962, vol. 33, no. 3, pp. 1065–1076.MathSciNetCrossRefMATH
21.
Zurück zum Zitat Silverman, B.W., Density Estimation for Statistics and Data Analysis, London: Chapman & Hall/CRC, 1986.MATH Silverman, B.W., Density Estimation for Statistics and Data Analysis, London: Chapman & Hall/CRC, 1986.MATH
22.
Metadaten
Titel
Application of Computer Simulation to the Anonymization of Personal Data: Synthesis-Based Anonymization Model and Algorithm
verfasst von
A. V. Borisov
A. V. Bosov
A. V. Ivanov
Publikationsdatum
01.10.2023
Verlag
Pleiades Publishing
Erschienen in
Programming and Computer Software / Ausgabe 5/2023
Print ISSN: 0361-7688
Elektronische ISSN: 1608-3261
DOI
https://doi.org/10.1134/S036176882305002X

Weitere Artikel der Ausgabe 5/2023

Programming and Computer Software 5/2023 Zur Ausgabe

Premium Partner