Skip to main content
Top

2021 | OriginalPaper | Chapter

Collaborative Data Analysis: Non-model Sharing-Type Machine Learning for Distributed Data

Authors : Akira Imakura, Xiucai Ye, Tetsuya Sakurai

Published in: Knowledge Management and Acquisition for Intelligent Systems

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper proposes a novel non-model sharing-type collaborative learning method for distributed data analysis, in which data are partitioned in both samples and features. Analyzing these types of distributed data are essential tasks in many applications, e.g., medical data analysis and manufacturing data analysis due to privacy and confidentiality concerns. By centralizing the intermediate representations which are individually constructed in each party, the proposed method achieves collaborative analysis without revealing the individual data, while the learning models remain distributed over local parties. Numerical experiments indicate that the proposed method achieves higher recognition performance for artificial and real-world problems than individual analysis.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Abadi, M., et al.: Deep learning with differential privacy. In: Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, pp. 308–318. ACM (2016) Abadi, M., et al.: Deep learning with differential privacy. In: Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, pp. 308–318. ACM (2016)
2.
go back to reference Bishop, C.M.: Pattern Recognition and Machine Learning (Information Science and Statistics). Springer, Heidelberg (2006)MATH Bishop, C.M.: Pattern Recognition and Machine Learning (Information Science and Statistics). Springer, Heidelberg (2006)MATH
3.
go back to reference Bogdanova, A., Nakai, A., Okada, Y., Imakura, A., Sakurai, T.: Federated learning system without model sharing through integration of dimensional reduced data representations. In: International Workshop on Federated Learning for User Privacy and Data Confidentiality in Conjunction with IJCAI 2020 (FL-IJCAI 2020) (2020, accepted) Bogdanova, A., Nakai, A., Okada, Y., Imakura, A., Sakurai, T.: Federated learning system without model sharing through integration of dimensional reduced data representations. In: International Workshop on Federated Learning for User Privacy and Data Confidentiality in Conjunction with IJCAI 2020 (FL-IJCAI 2020) (2020, accepted)
5.
go back to reference Cho, H., Wu, D.J., Berger, B.: Secure genome-wide association analysis using multiparty computation. Nat. Biotechnol. 36(6), 547 (2018)CrossRef Cho, H., Wu, D.J., Berger, B.: Secure genome-wide association analysis using multiparty computation. Nat. Biotechnol. 36(6), 547 (2018)CrossRef
7.
go back to reference Fisher, R.A.: The use of multiple measurements in taxonomic problems. Ann. Hum. Genet. 7(2), 179–188 (1936) Fisher, R.A.: The use of multiple measurements in taxonomic problems. Ann. Hum. Genet. 7(2), 179–188 (1936)
8.
go back to reference Gentry, C.: Fully homomorphic encryption using ideal lattices. In: Stoc, vol. 9, pp. 169–178 (2009) Gentry, C.: Fully homomorphic encryption using ideal lattices. In: Stoc, vol. 9, pp. 169–178 (2009)
9.
go back to reference Gilad-Bachrach, R., Dowlin, N., Laine, K., Lauter, K., Naehrig, M., Wernsing, J.: CryptoNets: applying neural networks to encrypted data with high throughput and accuracy. In: International Conference on Machine Learning, pp. 201–210 (2016) Gilad-Bachrach, R., Dowlin, N., Laine, K., Lauter, K., Naehrig, M., Wernsing, J.: CryptoNets: applying neural networks to encrypted data with high throughput and accuracy. In: International Conference on Machine Learning, pp. 201–210 (2016)
10.
go back to reference He, X., Niyogi, P.: Locality preserving projections. In: Advances in Neural Information Processing Systems, pp. 153–160 (2004) He, X., Niyogi, P.: Locality preserving projections. In: Advances in Neural Information Processing Systems, pp. 153–160 (2004)
11.
go back to reference Imakura, A., Matsuda, M., Ye, X., Sakurai, T.: Complex moment-based supervised eigenmap for dimensionality reduction. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 3910–3918 (2019) Imakura, A., Matsuda, M., Ye, X., Sakurai, T.: Complex moment-based supervised eigenmap for dimensionality reduction. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 3910–3918 (2019)
13.
go back to reference Imakura, A., Sakurai, T.: Data collaboration analysis framework using centralization of individual intermediate representations for distributed data sets. ASCE-ASME J. Risk Uncertain. Eng. Syst. Part A Civil Eng. 6, 04020018 (2020)CrossRef Imakura, A., Sakurai, T.: Data collaboration analysis framework using centralization of individual intermediate representations for distributed data sets. ASCE-ASME J. Risk Uncertain. Eng. Syst. Part A Civil Eng. 6, 04020018 (2020)CrossRef
14.
go back to reference Ito, S., Murota, K.: An algorithm for the generalized eigenvalue problem for nonsquare matrix pencils by minimal perturbation approach. SIAM J. Matrix. Anal. Appl. 37, 409–419 (2016)MathSciNetCrossRef Ito, S., Murota, K.: An algorithm for the generalized eigenvalue problem for nonsquare matrix pencils by minimal perturbation approach. SIAM J. Matrix. Anal. Appl. 37, 409–419 (2016)MathSciNetCrossRef
16.
go back to reference Ji, Z., Lipton, Z.C., Elkan, C.: Differential privacy and machine learning: a survey and review. arXiv preprint arXiv:1412.7584 (2014) Ji, Z., Lipton, Z.C., Elkan, C.: Differential privacy and machine learning: a survey and review. arXiv preprint arXiv:​1412.​7584 (2014)
17.
go back to reference Konečnỳ, J., McMahan, H.B., Ramage, D., Richtarik, P.: Federated optimization: distributed machine learning for on-device intelligence. arXiv preprint arXiv:1610.02527 (2016) Konečnỳ, J., McMahan, H.B., Ramage, D., Richtarik, P.: Federated optimization: distributed machine learning for on-device intelligence. arXiv preprint arXiv:​1610.​02527 (2016)
18.
go back to reference Konečnỳ, J., McMahan, H.B., Yu, F.X., Richtarik, P., Suresh, A.T., Bacon, D.: Federated learning: strategies for improving communication efficiency. In: NIPS Workshop on Private Multi-Party Machine Learning (2016). https://arxiv.org/abs/1610.05492 Konečnỳ, J., McMahan, H.B., Yu, F.X., Richtarik, P., Suresh, A.T., Bacon, D.: Federated learning: strategies for improving communication efficiency. In: NIPS Workshop on Private Multi-Party Machine Learning (2016). https://​arxiv.​org/​abs/​1610.​05492
20.
go back to reference Li, X., Chen, M., Nie, F., Wang, Q.: Locality adaptive discriminant analysis. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence, pp. 2201–2207. AAAI Press (2017) Li, X., Chen, M., Nie, F., Wang, Q.: Locality adaptive discriminant analysis. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence, pp. 2201–2207. AAAI Press (2017)
21.
go back to reference van der Maaten, L., Hinton, G., Visualizing data using t-SNE: J. Machine Learn. Res. 9, 2579–2605 (2008) van der Maaten, L., Hinton, G., Visualizing data using t-SNE: J. Machine Learn. Res. 9, 2579–2605 (2008)
22.
go back to reference McMahan, H.B., Moore, E., Ramage, D., Hampson, S., et al.: Communication-efficient learning of deep networks from decentralized data. arXiv preprint arXiv:1602.05629 (2016) McMahan, H.B., Moore, E., Ramage, D., Hampson, S., et al.: Communication-efficient learning of deep networks from decentralized data. arXiv preprint arXiv:​1602.​05629 (2016)
23.
go back to reference Pearson, K.: LIII. On lines and planes of closest fit to systems of points in space. London Edinburgh Dublin Philos. Mag. J. Sci. 2(11), 559–572 (1901)CrossRef Pearson, K.: LIII. On lines and planes of closest fit to systems of points in space. London Edinburgh Dublin Philos. Mag. J. Sci. 2(11), 559–572 (1901)CrossRef
24.
go back to reference Samaria, F., Harter, A.: Parameterisation of a stochastic model for human face identification. In: Proceeding of IEEE Workshop on Applications of Computer Vision (1994) Samaria, F., Harter, A.: Parameterisation of a stochastic model for human face identification. In: Proceeding of IEEE Workshop on Applications of Computer Vision (1994)
25.
go back to reference Saunders, C., Gammerman, A., Vovk, V.: Ridge regression learning algorithm in dual variables (1998) Saunders, C., Gammerman, A., Vovk, V.: Ridge regression learning algorithm in dual variables (1998)
26.
go back to reference Strehl, A., Ghosh, J.: Cluster ensembles-a knowledge reuse framework for combining multiple partitions. J. Mach. Learn. Res. 3, 583–617 (2002)MathSciNetMATH Strehl, A., Ghosh, J.: Cluster ensembles-a knowledge reuse framework for combining multiple partitions. J. Mach. Learn. Res. 3, 583–617 (2002)MathSciNetMATH
27.
go back to reference Sugiyama, M.: Dimensionality reduction of multimodal labeled data by local Fisher discriminant analysis. J. Mach. Learn. Res. 8(May), 1027–1061 (2007)MATH Sugiyama, M.: Dimensionality reduction of multimodal labeled data by local Fisher discriminant analysis. J. Mach. Learn. Res. 8(May), 1027–1061 (2007)MATH
28.
go back to reference Tibshirani, R.: Regression shrinkage and selection via the lasso. J. Royal Stat. Soci. (Series B) 58, 267–288 (1996)MathSciNetMATH Tibshirani, R.: Regression shrinkage and selection via the lasso. J. Royal Stat. Soci. (Series B) 58, 267–288 (1996)MathSciNetMATH
30.
go back to reference Yang, Q., Liu, Y., Chen, T., Tong, Y.: Federated machine learning: concept and applications. ACM Trans. Intell. Syst. Technol. 10(2), Article 12 (2019) Yang, Q., Liu, Y., Chen, T., Tong, Y.: Federated machine learning: concept and applications. ACM Trans. Intell. Syst. Technol. 10(2), Article 12 (2019)
31.
go back to reference Ye, X., Li, H., Imakura, A., Sakurai, T.: Distributed collaborative feature selection based on intermediate representation. In: The 28th International Joint Conference on Artificial Intelligence (IJCAI-19). pp. 4142–4149 (2019) Ye, X., Li, H., Imakura, A., Sakurai, T.: Distributed collaborative feature selection based on intermediate representation. In: The 28th International Joint Conference on Artificial Intelligence (IJCAI-19). pp. 4142–4149 (2019)
Metadata
Title
Collaborative Data Analysis: Non-model Sharing-Type Machine Learning for Distributed Data
Authors
Akira Imakura
Xiucai Ye
Tetsuya Sakurai
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-69886-7_2

Premium Partner