Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 11/2023

27.05.2023 | Original Article

Personalized federated learning based on multi-head attention algorithm

verfasst von: Shanshan Jiang, Meixia Lu, Kai Hu, Jiasheng Wu, Yaogen Li, Liguo Weng, Min Xia, Haifeng Lin

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 11/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Federated Learning (FL) is an algorithm for the encrypted exchange of model parameters while ensuring the independence of participants. Classic federated learning does not take into account the correlation between features, nor does it take into account the data differences caused by the reasonable personalization of each client. Therefore, this paper proposes a personalized federated learning algorithm based on a multi-head attention mechanism. First, in order to improve the personalization of local models, attention mechanism is used to capture the relevance of local features. Then, when aggregating local models, the weight \(\lambda\) is generated for local models based on the differences between models, and finally aggregate them into a new global model. Finally, the multi-head attention is proposed to calculate the importance score of the global model parameters on the current local model, and assign it to the local model as the attention coefficient, so as to realize personalized federated learning. Through experiments on MNIST, SVHN and STL10 datasets, the validity of Personalized Federated Learning is verified, and the rationality of hyperparameter setting is discussed through visualizing results.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
2.
Zurück zum Zitat Xiao Junsheng, Huahu Xu, Gao Honghao, Bian Minjie, Li Yang (2021) A weakly supervised semantic segmentation network by aggregating seed cues: the multi-object proposal generation perspective. ACM Trans Multimed Comput Commun Appl 17:15CrossRef Xiao Junsheng, Huahu Xu, Gao Honghao, Bian Minjie, Li Yang (2021) A weakly supervised semantic segmentation network by aggregating seed cues: the multi-object proposal generation perspective. ACM Trans Multimed Comput Commun Appl 17:15CrossRef
4.
Zurück zum Zitat Brendan Mcmahan H, Eider Moore, Daniel Ramage, Seth Hampson, Blaise Arcas (2016) Communication-efficient learning of deep networks from decentralized data. PMLR 54:1273–1282 Brendan Mcmahan H, Eider Moore, Daniel Ramage, Seth Hampson, Blaise Arcas (2016) Communication-efficient learning of deep networks from decentralized data. PMLR 54:1273–1282
5.
Zurück zum Zitat Mcmahan H Brendan, Moore Eider, Ramage Daniel, Arcas Blaise (2017) Federated learning of deep networks using model averaging. arXiv preprint arXiv:1602.05629 Mcmahan H Brendan, Moore Eider, Ramage Daniel, Arcas Blaise (2017) Federated learning of deep networks using model averaging. arXiv preprint arXiv:​1602.​05629
6.
Zurück zum Zitat Ke Guolin, Meng Qi, Thomas Finley (2017) A highly efficient gradient boosting decision tree. NIPS 30:3149–3157 Ke Guolin, Meng Qi, Thomas Finley (2017) A highly efficient gradient boosting decision tree. NIPS 30:3149–3157
7.
Zurück zum Zitat Chen Lu, Xia Min, Lin Haifeng (2022) Multi-scale strip pooling feature aggregation network for cloud and cloud shadow segmentation. Neural Comput Appl 34:6149–6162CrossRef Chen Lu, Xia Min, Lin Haifeng (2022) Multi-scale strip pooling feature aggregation network for cloud and cloud shadow segmentation. Neural Comput Appl 34:6149–6162CrossRef
8.
Zurück zum Zitat T Xu M (2021) Research on long and short-term neural network recommendation model based on self-attention mechanism T Xu M (2021) Research on long and short-term neural network recommendation model based on self-attention mechanism
9.
Zurück zum Zitat Zhu Qiannan, Zhou Xiaofei, Song Zeliang, Tan Jianlong, Guo Li (2019) Dan: deep attention neural network for news recommendation. AAAI 33:5973–5980CrossRef Zhu Qiannan, Zhou Xiaofei, Song Zeliang, Tan Jianlong, Guo Li (2019) Dan: deep attention neural network for news recommendation. AAAI 33:5973–5980CrossRef
10.
Zurück zum Zitat An Mingxiao Wu, Chuhan Fangzhao, Wu, Kun Zhang, Zheng Liu, Xing Xie (2019) Neural news recommendation with long-and short-term user representations. ACL 57:336–345 An Mingxiao Wu, Chuhan Fangzhao, Wu, Kun Zhang, Zheng Liu, Xing Xie (2019) Neural news recommendation with long-and short-term user representations. ACL 57:336–345
11.
Zurück zum Zitat Smith Virginia, Chiang Chaokai, Sanjabi Maziar, Talwalkar Ameet (2017) Federated multi-task learning. NIPS:4427–4437 Smith Virginia, Chiang Chaokai, Sanjabi Maziar, Talwalkar Ameet (2017) Federated multi-task learning. NIPS:4427–4437
13.
Zurück zum Zitat Kewei Cheng, Tao Fan, Yilun Jin, Yang Liu, Tianjian Chen, Qiang Yang (2019) SecureBoost: A lossless federated learning framework. IEEE Intell Sys 36:7–98 Kewei Cheng, Tao Fan, Yilun Jin, Yang Liu, Tianjian Chen, Qiang Yang (2019) SecureBoost: A lossless federated learning framework. IEEE Intell Sys 36:7–98
14.
Zurück zum Zitat Deng Yuyang, Kamani, Mohammad Mahdi, Mahdavi Mehrdad (2020) Adaptive personalized federated learning. arXiv preprint arXiv:2003.13461 Deng Yuyang, Kamani, Mohammad Mahdi, Mahdavi Mehrdad (2020) Adaptive personalized federated learning. arXiv preprint arXiv:​2003.​13461
15.
Zurück zum Zitat Yang Qiang, Liu Yang, Cheng Yong (2019) Federated machine learning: concept and applications. ACM 10:1–19 Yang Qiang, Liu Yang, Cheng Yong (2019) Federated machine learning: concept and applications. ACM 10:1–19
16.
Zurück zum Zitat Zhuo Hankz Hankui, Feng Wenfeng, Lin Yufeng, Xu Qian, Yang Qiang (2019) Federated deep reinforcement learning. arXiv preprintarXiv:1812.03337 Zhuo Hankz Hankui, Feng Wenfeng, Lin Yufeng, Xu Qian, Yang Qiang (2019) Federated deep reinforcement learning. arXiv preprintarXiv:​1812.​03337
17.
Zurück zum Zitat Arivazhagan Manoj Ghuhan, Aggarwal Vinay, Singh Aaditya Kumar, Choudhary Sunav (2019) Federated learning with personalization layers. arXiv preprint arXiv:1901.08277v1 Arivazhagan Manoj Ghuhan, Aggarwal Vinay, Singh Aaditya Kumar, Choudhary Sunav (2019) Federated learning with personalization layers. arXiv preprint arXiv:​1901.​08277v1
18.
Zurück zum Zitat Jiang Yihan, Konečný Jakub, Rush Keith, Kannan Sreeram (2019) Improving federated learning personalization via model agnostic meta learning. arXiv preprint arXiv:1909.12488 Jiang Yihan, Konečný Jakub, Rush Keith, Kannan Sreeram (2019) Improving federated learning personalization via model agnostic meta learning. arXiv preprint arXiv:​1909.​12488
19.
Zurück zum Zitat Dinh Canh T, TranNguyen H, Nguyen Tuan Dung (2020) Personalized federated learning with moreau envelopes. arXiv preprintarXiv:2006.08848 Dinh Canh T, TranNguyen H, Nguyen Tuan Dung (2020) Personalized federated learning with moreau envelopes. arXiv preprintarXiv:​2006.​08848
20.
Zurück zum Zitat Fallah Alireza, Mokhtari Aryan, Ozdaglar Asuman (2020) Personalized federated learning: a meta-learning approach. arXiv preprint arXiv:2002.07948 Fallah Alireza, Mokhtari Aryan, Ozdaglar Asuman (2020) Personalized federated learning: a meta-learning approach. arXiv preprint arXiv:​2002.​07948
21.
22.
Zurück zum Zitat Wang Jialei, Kolar Mladen, Srerbo Nathan (2016) Distributed multi-task learning. Artif Intell Stat 51:751–760 Wang Jialei, Kolar Mladen, Srerbo Nathan (2016) Distributed multi-task learning. Artif Intell Stat 51:751–760
23.
24.
Zurück zum Zitat Liu Yang, Yang Qiang, Chen Tianjian (2019) Tutorial on federated learning and transfer learning for privacy, security and confidentiality.AAAI’19 Liu Yang, Yang Qiang, Chen Tianjian (2019) Tutorial on federated learning and transfer learning for privacy, security and confidentiality.AAAI’19
25.
Zurück zum Zitat Kairouz Peter, McMahan H. Brendan, Avent Brendan, Bellet Aurélien, Bennis Mehdi, Bhagoji Arjun Nitin, Bonawitz Kallista, et al Charles Zachary (2019) Advances and open problems in federated learning Kairouz Peter, McMahan H. Brendan, Avent Brendan, Bellet Aurélien, Bennis Mehdi, Bhagoji Arjun Nitin, Bonawitz Kallista, et al Charles Zachary (2019) Advances and open problems in federated learning
26.
Zurück zum Zitat Lecun Yann, Bottou Leon (1998) Gradient-based learning applied to document recognition. Proc IEEE 86:2278–2324CrossRef Lecun Yann, Bottou Leon (1998) Gradient-based learning applied to document recognition. Proc IEEE 86:2278–2324CrossRef
27.
Zurück zum Zitat Song Lei, Xia Min, Weng Liguo, Lin Haifeng, Qian Min, Chen Bingyu (2023) Axial cross attention meets cnn: bibranch fusion network for change detection. IEEE J Sel Top Appl Earth Observ Remote Sens 16:32–43CrossRef Song Lei, Xia Min, Weng Liguo, Lin Haifeng, Qian Min, Chen Bingyu (2023) Axial cross attention meets cnn: bibranch fusion network for change detection. IEEE J Sel Top Appl Earth Observ Remote Sens 16:32–43CrossRef
28.
Zurück zum Zitat Hao Yu, Yang Sen (2018) and Shenghuo Zhu. Demystifying why model averaging works for deep learning, parallel restarted sgd with faster convergence and less communication Hao Yu, Yang Sen (2018) and Shenghuo Zhu. Demystifying why model averaging works for deep learning, parallel restarted sgd with faster convergence and less communication
29.
Zurück zum Zitat Cho Kyunghyun, van Merrienboer Bart, Bahdanau Dzmitry, Yoshua Bengio (2014) Encoder-decoder approaches on the properties of neural machine translation. arXiv preprint arXiv:1409.1259 Cho Kyunghyun, van Merrienboer Bart, Bahdanau Dzmitry, Yoshua Bengio (2014) Encoder-decoder approaches on the properties of neural machine translation. arXiv preprint arXiv:​1409.​1259
30.
Zurück zum Zitat Krizhevsky Alex (2009) Learning multiple layers of features from tiny images. Tech Report. 1–60 Krizhevsky Alex (2009) Learning multiple layers of features from tiny images. Tech Report. 1–60
31.
Zurück zum Zitat Miao Shoukuan, Xia Min, Qian Ming, Zhang Yonghong, Liu Jia, Lin Haifeng (2022) Cloud/shadow segmentation based on multi-level feature enhanced network for remote sensing imagery. Int J Remote Sens 43(15–16):5940–5960CrossRef Miao Shoukuan, Xia Min, Qian Ming, Zhang Yonghong, Liu Jia, Lin Haifeng (2022) Cloud/shadow segmentation based on multi-level feature enhanced network for remote sensing imagery. Int J Remote Sens 43(15–16):5940–5960CrossRef
32.
Zurück zum Zitat Yi Qu, Xia Min, Zhang Yonghong (2021) Strip pooling channel spatial attention network for the segmentation of cloud and cloud shadow. Comput Geosci 157:104940CrossRef Yi Qu, Xia Min, Zhang Yonghong (2021) Strip pooling channel spatial attention network for the segmentation of cloud and cloud shadow. Comput Geosci 157:104940CrossRef
33.
Zurück zum Zitat Chen Bingyu, Xia Min, Qian Ming, Huang Junqing (2022) Manet: a multi-level aggregation network for semantic segmentation of high-resolution remote sensing images. Int J Remote Sens 43(15–16):5874–5894CrossRef Chen Bingyu, Xia Min, Qian Ming, Huang Junqing (2022) Manet: a multi-level aggregation network for semantic segmentation of high-resolution remote sensing images. Int J Remote Sens 43(15–16):5874–5894CrossRef
34.
Zurück zum Zitat Min Xia Xu, Zhang Wan’an Liu, Weng Liguo, Yiqing Xu (2020) Multi-stage feature constraints learning for age estimation. IEEE Trans Inf Forensics Secur 15:2417–2428CrossRef Min Xia Xu, Zhang Wan’an Liu, Weng Liguo, Yiqing Xu (2020) Multi-stage feature constraints learning for age estimation. IEEE Trans Inf Forensics Secur 15:2417–2428CrossRef
35.
Zurück zum Zitat Wang Zhiwei, Xia Min, Min Lu, Pan Lingling, Liu Jun (2022) Parameter identification in power transmission systems based on graph convolution network. IEEE Trans Power Deliv 37(4):3155–3163CrossRef Wang Zhiwei, Xia Min, Min Lu, Pan Lingling, Liu Jun (2022) Parameter identification in power transmission systems based on graph convolution network. IEEE Trans Power Deliv 37(4):3155–3163CrossRef
36.
Zurück zum Zitat Liu Jingjing, Liu Yefeng, Zhang Qichun (2022) A weight initialization method based on neural network with asymmetric activation function. Neurocomputing 483:171–182CrossRef Liu Jingjing, Liu Yefeng, Zhang Qichun (2022) A weight initialization method based on neural network with asymmetric activation function. Neurocomputing 483:171–182CrossRef
37.
Zurück zum Zitat Gao Jiahong, Weng Liguo, Xia Min, Lin Haifeng (2022) MLNet: multichannel feature fusion lozenge network for land segmentation. J Appl Remote Sens 16(1):1–19CrossRef Gao Jiahong, Weng Liguo, Xia Min, Lin Haifeng (2022) MLNet: multichannel feature fusion lozenge network for land segmentation. J Appl Remote Sens 16(1):1–19CrossRef
38.
Zurück zum Zitat Gao Liang, Fu Huazhu, Li Li, Chen Yingwen, Xu Ming, Xu Cheng-Zhong (2022) Feddc: Federated learning with non-iid data via local drift decoupling and correction. IEEE/CVF Conference on Computer Vision and Pattern Recognition, page 22094283 Gao Liang, Fu Huazhu, Li Li, Chen Yingwen, Xu Ming, Xu Cheng-Zhong (2022) Feddc: Federated learning with non-iid data via local drift decoupling and correction. IEEE/CVF Conference on Computer Vision and Pattern Recognition, page 22094283
39.
Zurück zum Zitat Reddi Sashank, Charles Zachary, Zaheer Manzil, Garrett Zachary, Rush Keith, Konecny Jakub, Kumar Sanjiv, McMahan H Brendan (2021) Adaptive federated optimization. International Conference on Representation Learning, page 13 Reddi Sashank, Charles Zachary, Zaheer Manzil, Garrett Zachary, Rush Keith, Konecny Jakub, Kumar Sanjiv, McMahan H Brendan (2021) Adaptive federated optimization. International Conference on Representation Learning, page 13
40.
Zurück zum Zitat Wei Zhao, Benyou Wang, Min Yang, Jianbo Ye, Zhou Zhao, Xiaojun Chen (2019) Leveraging long and short-term information in content-aware movie recommendation via adversarial training. IEEE Trans Cybern 50:4680–4693 Wei Zhao, Benyou Wang, Min Yang, Jianbo Ye, Zhou Zhao, Xiaojun Chen (2019) Leveraging long and short-term information in content-aware movie recommendation via adversarial training. IEEE Trans Cybern 50:4680–4693
41.
Zurück zum Zitat Robin C (2017) Geyer, Tassilo Klein, and Moin Nabi. A client level perspective. NIPS, differentially private federated learning Robin C (2017) Geyer, Tassilo Klein, and Moin Nabi. A client level perspective. NIPS, differentially private federated learning
42.
Zurück zum Zitat Mukund Deshpande, George Karypis (2004) Item-based top-n recommendation algorithms. ACM Trans Inf Syst 22:143–147CrossRef Mukund Deshpande, George Karypis (2004) Item-based top-n recommendation algorithms. ACM Trans Inf Syst 22:143–147CrossRef
Metadaten
Titel
Personalized federated learning based on multi-head attention algorithm
verfasst von
Shanshan Jiang
Meixia Lu
Kai Hu
Jiasheng Wu
Yaogen Li
Liguo Weng
Min Xia
Haifeng Lin
Publikationsdatum
27.05.2023
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 11/2023
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-023-01864-z

Weitere Artikel der Ausgabe 11/2023

International Journal of Machine Learning and Cybernetics 11/2023 Zur Ausgabe

Neuer Inhalt