Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 1/2015

01.02.2015 | Original Article

A new privacy-preserving proximal support vector machine for classification of vertically partitioned data

verfasst von: Li Sun, Wei-Song Mu, Biao Qi, Zhi-Jian Zhou

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 1/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

A new privacy-preserving proximal support vector machine (P3SVM) is formulated for classification of vertically partitioned data. Our classifier is based on the concept of global random reduced kernel which is composed of local reduced kernels. Each of them is computed using local reduced matrix with Gaussian perturbation, which is privately generated by only one of the parties, and never made public. This formulation leads to an extremely simple and fast privacy-preserving algorithm, for generating a linear or nonlinear classifier that merely requires the solution of a single system of linear equations. Comprehensive experiments are conducted on multiple publicly available benchmark datasets to evaluate the performance of the proposed algorithms and the results indicate that: (a) Our P3SVM achieves better performance than the recently proposed privacy-preserving SVM via random kernels in terms of both classification accuracy and computational time. (b) A significant improvement of accuracy is attained by our P3SVM when compared to classifiers generated only using each party’s own data. (c) The generated classifier has comparable accuracy to an ordinary PSVM classifier trained on the entire dataset, without releasing any private data.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
1.
Zurück zum Zitat Agrawal R (1999) Data mining: crossing the chasm. In: Proceedings of the 5th ACM SIGKDD International Conference on knowledge discovery and data mining, San Diego. doi:10.1145/312129.312167 Agrawal R (1999) Data mining: crossing the chasm. In: Proceedings of the 5th ACM SIGKDD International Conference on knowledge discovery and data mining, San Diego. doi:10.​1145/​312129.​312167
3.
Zurück zum Zitat Agrawal D, Aggarwal CC (2001) On the design and quantification of privacy preserving data mining algorithms. In: Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of database systems. ACM, pp 247–255. doi:10.1145/375551.375602 Agrawal D, Aggarwal CC (2001) On the design and quantification of privacy preserving data mining algorithms. In: Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of database systems. ACM, pp 247–255. doi:10.​1145/​375551.​375602
4.
Zurück zum Zitat Vaidya J, Clifton C (2002) Privacy preserving association rule mining in vertically partitioned data. In: Proceedings of the eighth ACM SIGKDD International Conference on Knowledge discovery and data mining. ACM, pp 639–644. doi:10.1145/775047.775142 Vaidya J, Clifton C (2002) Privacy preserving association rule mining in vertically partitioned data. In: Proceedings of the eighth ACM SIGKDD International Conference on Knowledge discovery and data mining. ACM, pp 639–644. doi:10.​1145/​775047.​775142
5.
Zurück zum Zitat Bertino E, Lin D, Jiang W (2008) Privacy-preserving data mining, advances in database systems. In: Aggarwal C, Yu P (eds) A survey of quantification of privacy preserving data mining algorithms, 34th edn. Springer, US, pp 183–205. doi:10.1007/978-0-387-70992-5_8 CrossRef Bertino E, Lin D, Jiang W (2008) Privacy-preserving data mining, advances in database systems. In: Aggarwal C, Yu P (eds) A survey of quantification of privacy preserving data mining algorithms, 34th edn. Springer, US, pp 183–205. doi:10.​1007/​978-0-387-70992-5_​8 CrossRef
6.
Zurück zum Zitat Chen K, Liu L (2005) Privacy preserving data classification with rotation perturbation. In: Proceedings of Fifth IEEE International Conference on Data Mining (ICDM’05), pp 589–592. doi:10.1109/ICDM.2005.121 Chen K, Liu L (2005) Privacy preserving data classification with rotation perturbation. In: Proceedings of Fifth IEEE International Conference on Data Mining (ICDM’05), pp 589–592. doi:10.​1109/​ICDM.​2005.​121
7.
Zurück zum Zitat Xiao MJ, Huang LS, Luo YL, et al. (2005) Privacy preserving ID3 algorithm over horizontally partitioned data. In: Sixth International Conference on parallel and distributed computing applications and technologies (PDCAT’05), pp 239–243. doi:10.1109/PDCAT.2005.191 Xiao MJ, Huang LS, Luo YL, et al. (2005) Privacy preserving ID3 algorithm over horizontally partitioned data. In: Sixth International Conference on parallel and distributed computing applications and technologies (PDCAT’05), pp 239–243. doi:10.​1109/​PDCAT.​2005.​191
9.
Zurück zum Zitat Vaidya J, Clifton C (2005) Data and applications security XIX. In: Jajodia S, Wijesekera D (eds) Privacy-preserving decision trees over vertically partitioned data, 3654th edn., Lecture notes in computer scienceSpringer, Berlin, pp 139–152. doi:10.1007/11535706_11 Vaidya J, Clifton C (2005) Data and applications security XIX. In: Jajodia S, Wijesekera D (eds) Privacy-preserving decision trees over vertically partitioned data, 3654th edn., Lecture notes in computer scienceSpringer, Berlin, pp 139–152. doi:10.​1007/​11535706_​11
10.
Zurück zum Zitat Vapnik VN (1998) Statistical learning theory. Wiley, New YorkMATH Vapnik VN (1998) Statistical learning theory. Wiley, New YorkMATH
12.
Zurück zum Zitat Joachims T (1999) Transductive inference for text classification using support vector machines. In: Proceedings of the Sixteenth International Conference on Machine Learning (ICML’99), pp 200–209 Joachims T (1999) Transductive inference for text classification using support vector machines. In: Proceedings of the Sixteenth International Conference on Machine Learning (ICML’99), pp 200–209
13.
Zurück zum Zitat Wang X-Z, He Q, Chen D-G, Yeung D (2005) A genetic algorithm for solving the inverse problem of support vector machines. Neurocomputing 68:225–238CrossRef Wang X-Z, He Q, Chen D-G, Yeung D (2005) A genetic algorithm for solving the inverse problem of support vector machines. Neurocomputing 68:225–238CrossRef
14.
15.
Zurück zum Zitat Yu H, Jiang X, Vaidya J (2006) Privacy-preserving SVM using nonlinear kernels on horizontally partitioned data. In: Proceedings of the 2006 ACM symposium on Applied computing (SAC’06). ACM, New York, pp 603–610. doi:10.1145/1141277.1141415 Yu H, Jiang X, Vaidya J (2006) Privacy-preserving SVM using nonlinear kernels on horizontally partitioned data. In: Proceedings of the 2006 ACM symposium on Applied computing (SAC’06). ACM, New York, pp 603–610. doi:10.​1145/​1141277.​1141415
16.
Zurück zum Zitat Yu H, Vaidya J, Jiang X (2006) Advances in knowledge discovery and data mining. In: Ng W-K, Kitsuregawa M, Li J, Chang K (eds) Privacy-preserving SVM classification on vertically partitioned data, 3918th edn., Lecture notes in computer scienceSpringer, Berlin, pp 647–656. doi:10.1007/11731139_74 Yu H, Vaidya J, Jiang X (2006) Advances in knowledge discovery and data mining. In: Ng W-K, Kitsuregawa M, Li J, Chang K (eds) Privacy-preserving SVM classification on vertically partitioned data, 3918th edn., Lecture notes in computer scienceSpringer, Berlin, pp 647–656. doi:10.​1007/​11731139_​74
17.
18.
Zurück zum Zitat Mangasarian OL, Wild EW (2008) Privacy-preserving classification of horizontally partitioned data via random kernel. In: Proceedings of the DMIN08, vol 2, pp 473–479 Mangasarian OL, Wild EW (2008) Privacy-preserving classification of horizontally partitioned data via random kernel. In: Proceedings of the DMIN08, vol 2, pp 473–479
19.
Zurück zum Zitat Mangasarian OL, Wild EW (2010) Data mining. In: Stahlbock R, Crone SF, Lessmann S (eds) Privacy-preserving random kernel classification of checkerboard partitioned data, 8th edn., Annals of information systemsSpringer, US, pp 375–387. doi:10.1007/978-1-4419-1280-0_17 Mangasarian OL, Wild EW (2010) Data mining. In: Stahlbock R, Crone SF, Lessmann S (eds) Privacy-preserving random kernel classification of checkerboard partitioned data, 8th edn., Annals of information systemsSpringer, US, pp 375–387. doi:10.​1007/​978-1-4419-1280-0_​17
21.
Zurück zum Zitat Lin KP, Chen MS (2010) Privacy-preserving outsourcing support vector machines with random transformation. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge discovery and data mining. ACM, pp 363–372. doi:10.1145/1835804.1835852 Lin KP, Chen MS (2010) Privacy-preserving outsourcing support vector machines with random transformation. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge discovery and data mining. ACM, pp 363–372. doi:10.​1145/​1835804.​1835852
22.
Zurück zum Zitat Wang X-Z, Lu S-X, Zhai J-H (2008) Fast fuzzy multi-category SVM based on support vector domain description. Int J Pattern Recognit Artif Intell 22(1):109–120CrossRef Wang X-Z, Lu S-X, Zhai J-H (2008) Fast fuzzy multi-category SVM based on support vector domain description. Int J Pattern Recognit Artif Intell 22(1):109–120CrossRef
26.
Zurück zum Zitat Fung G, Mangasarian O L (2001) Proximal support vector machine classifiers. In: Proceedings of the seventh ACM SIGKDD International Conference on Knowledge discovery and data mining (KDD’01). ACM, New York, pp 77–86. doi:10.1145/502512.502527 Fung G, Mangasarian O L (2001) Proximal support vector machine classifiers. In: Proceedings of the seventh ACM SIGKDD International Conference on Knowledge discovery and data mining (KDD’01). ACM, New York, pp 77–86. doi:10.​1145/​502512.​502527
28.
Zurück zum Zitat Horn RA, Johnson CR (2012) Matrix analysis. Cambridge university press, CambridgeCrossRef Horn RA, Johnson CR (2012) Matrix analysis. Cambridge university press, CambridgeCrossRef
30.
Zurück zum Zitat Duda RO, Hart PE, Stork DG (2012) Pattern classification. Wiley, New York Duda RO, Hart PE, Stork DG (2012) Pattern classification. Wiley, New York
31.
Zurück zum Zitat Lee YJ, Mangasarian OL (2001) RSVM: Reduced support vector machines. In: Proceedings of the first SIAM International Conference on data mining. SIAM, Philadelphia, pp 5–7 Lee YJ, Mangasarian OL (2001) RSVM: Reduced support vector machines. In: Proceedings of the first SIAM International Conference on data mining. SIAM, Philadelphia, pp 5–7
32.
Zurück zum Zitat Lee YJ, Huang SY (2007) Reduced support vector machines: a statistical theory. IEEE Trans Neural Netw 18(1):1–13CrossRef Lee YJ, Huang SY (2007) Reduced support vector machines: a statistical theory. IEEE Trans Neural Netw 18(1):1–13CrossRef
33.
Zurück zum Zitat Mitchell TM (1997) Machine learning. McGraw-Hill, BostonMATH Mitchell TM (1997) Machine learning. McGraw-Hill, BostonMATH
Metadaten
Titel
A new privacy-preserving proximal support vector machine for classification of vertically partitioned data
verfasst von
Li Sun
Wei-Song Mu
Biao Qi
Zhi-Jian Zhou
Publikationsdatum
01.02.2015
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 1/2015
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-014-0245-1

Weitere Artikel der Ausgabe 1/2015

International Journal of Machine Learning and Cybernetics 1/2015 Zur Ausgabe

Neuer Inhalt