Skip to main content
Top

2020 | OriginalPaper | Chapter

Matching Anonymized Individuals with Errors for Service Systems

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Data privacy is of great importance for the healthy development of service systems. Companies and governments that provide services to people often have big concerns in sharing their data. Because of that, data must be preprocessed (e.g., anonymized) before they can be shared. However, without identification, it is difficult to match data from different sources and thus the data cannot be used together. This paper investigates how the performance of two simple individual matching methods was affected by errors in the similarity scores between individuals. The first method is a greedy method (GM) that simply matches individuals based on the maximum similarity scores. The second method is an optimal assignment problem (AP), which maximizes the total similarity scores of the matched individuals. Consistent with the literature, we found that GM outperforms AP in most situations. However, we also discovered that AP could be better in fixing errors.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference R.K. Ahuja, T.L. Magnanti, J.B. Orlin, Network Flows: Theory, Algorithms, and Applications (Prentice-Hall, Inc, 1993) R.K. Ahuja, T.L. Magnanti, J.B. Orlin, Network Flows: Theory, Algorithms, and Applications (Prentice-Hall, Inc, 1993)
2.
go back to reference P. Christen, in Data Matching—Concepts and Techniques for Record Linkage, Entity Resolution, and Duplicate Detection, ed. by M.J. Carey, S. Ceri (Berlin, Springer, 2012a) P. Christen, in Data Matching—Concepts and Techniques for Record Linkage, Entity Resolution, and Duplicate Detection, ed. by M.J. Carey, S. Ceri (Berlin, Springer, 2012a)
3.
go back to reference P. Christen, A survey of indexing techniques for scalable record linkage and deduplication. IEEE Trans. Knowl. Data Eng. 24(9), 1537–1555 (2012b)CrossRef P. Christen, A survey of indexing techniques for scalable record linkage and deduplication. IEEE Trans. Knowl. Data Eng. 24(9), 1537–1555 (2012b)CrossRef
4.
go back to reference I.P. Fellegi, A.B. Sunter, A theory for record linkage. J. Am. Stat. Assoc. 64(328), 1183–1210 (1969)CrossRef I.P. Fellegi, A.B. Sunter, A theory for record linkage. J. Am. Stat. Assoc. 64(328), 1183–1210 (1969)CrossRef
5.
go back to reference J. Fisher, P. Christen, Q. Wang, E. Rahm, A clustering-based framework to control block sizes for entity resolution, in Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2783396 (ACM, 2015), pp. 279–288 J. Fisher, P. Christen, Q. Wang, E. Rahm, A clustering-based framework to control block sizes for entity resolution, in Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2783396 (ACM, 2015), pp. 279–288
6.
go back to reference M. Franke, Z. Sehili, M. Gladbach, E. Rahm, Post-processing methods for high quality privacy-preserving record linkage (Cham, 2018) M. Franke, Z. Sehili, M. Gladbach, E. Rahm, Post-processing methods for high quality privacy-preserving record linkage (Cham, 2018)
7.
go back to reference Z. Fu, P. Christen, M. Boot, Automatic cleaning and linking of historical census data using household information, in 2011 IEEE 11th International Conference on Data Mining Workshops, December 2011 (2011), pp. 11–11 Z. Fu, P. Christen, M. Boot, Automatic cleaning and linking of historical census data using household information, in 2011 IEEE 11th International Conference on Data Mining Workshops, December 2011 (2011), pp. 11–11
8.
go back to reference Z. Fu, P. Christen, J. Zhou, A graph matching method for historical census household linkage (Cham, 2014) Z. Fu, P. Christen, J. Zhou, A graph matching method for historical census household linkage (Cham, 2014)
9.
go back to reference C. Hsu, Service Science: Design for Scaling and Transformation (World Scientific and Imperial College Press, Singapore, 2009)CrossRef C. Hsu, Service Science: Design for Scaling and Transformation (World Scientific and Imperial College Press, Singapore, 2009)CrossRef
10.
go back to reference A. Karakasidis, V.S. Verykios, Advances in privacy preserving record linkage, in E-Activity and Innovative Technology, Advances in Applied Intelligence Technologies Book Series, Igi Global (2010) A. Karakasidis, V.S. Verykios, Advances in privacy preserving record linkage, in E-Activity and Innovative Technology, Advances in Applied Intelligence Technologies Book Series, Igi Global (2010)
11.
go back to reference J. Kilian, Uses of Randomness in Algorithms and Protocols (MIT Press, Cambridge, MA, 1990) J. Kilian, Uses of Randomness in Algorithms and Protocols (MIT Press, Cambridge, MA, 1990)
12.
go back to reference H. Kum, A. Krishnamurthy, A. Machanavajjhala, S.C. Ahalt, Social genome: putting Big Data to work for population informatics. Computer 47(1), 56–63 (2014)CrossRef H. Kum, A. Krishnamurthy, A. Machanavajjhala, S.C. Ahalt, Social genome: putting Big Data to work for population informatics. Computer 47(1), 56–63 (2014)CrossRef
13.
go back to reference P. Maglio, C. Kieliszewski, J. Spohrer, Handbook of Service Science (Springer, New York, NY, 2010)CrossRef P. Maglio, C. Kieliszewski, J. Spohrer, Handbook of Service Science (Springer, New York, NY, 2010)CrossRef
14.
go back to reference K. McCormack, M. Smyth, Privacy protection for Big Data linking using the identity correlation approach. 统计科学与应用: 英文版 (3), 81–90 (2017) K. McCormack, M. Smyth, Privacy protection for Big Data linking using the identity correlation approach. 统计科学与应用: 英文版 (3), 81–90 (2017)
15.
go back to reference B. Schneier, Applied Cryptography: Protocols, Algorithms, and Source Code in C, 2nd edn. (Wiley, New York, 1996) B. Schneier, Applied Cryptography: Protocols, Algorithms, and Source Code in C, 2nd edn. (Wiley, New York, 1996)
16.
go back to reference J. Spohrer, P.P. Maglio, The emergence of service science: toward systematic service innovations to accelerate co-creation of value. Prod. Oper. Manag. 17(3), 238–246 (2008)CrossRef J. Spohrer, P.P. Maglio, The emergence of service science: toward systematic service innovations to accelerate co-creation of value. Prod. Oper. Manag. 17(3), 238–246 (2008)CrossRef
17.
go back to reference S.L. Vargo, M.A. Akaka, Value cocreation and service systems (re)formation: a service ecosystems view. Serv. Sci. 4(3), 207–217 (2012)CrossRef S.L. Vargo, M.A. Akaka, Value cocreation and service systems (re)formation: a service ecosystems view. Serv. Sci. 4(3), 207–217 (2012)CrossRef
18.
go back to reference D. Vatsalan, P. Christen, V.S. Verykios, A taxonomy of privacy-preserving record linkage techniques. Inf. Syst. 38(6), 946–969 (2013)CrossRef D. Vatsalan, P. Christen, V.S. Verykios, A taxonomy of privacy-preserving record linkage techniques. Inf. Syst. 38(6), 946–969 (2013)CrossRef
Metadata
Title
Matching Anonymized Individuals with Errors for Service Systems
Author
Wai Kin (Victor) Chan
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-30967-1_15