Introduction
-
A novel heterogeneous federated learning method called Semi-HFL is proposed, which introduces a multi-branch model to solve the system heterogeneity problem in FL. There are two main innovations. First, a splitting method of multi-branching models is designed to split the global model into submodels of different complexities. Second, we give a novel aggregation method for aggregating the split submodels into a global one in the aggregation step of FL.
-
The semi-supervised learning method under the above FL framework is innovated. For heterogeneous federated settings, a “multi-teacher to multi-student” semi-supervised learning mode is formed using a modest quantity of labeled data on the server. It breaks the limitation that the traditional semi-supervised FL method applies only to the single-exit model.
-
The convergence of Semi-HFL is analyzed, and its feasibility is verified through image and text classification experiments on different data distributions. The effectiveness of Semi-HFL is proved theoretically and practically.
Related work
Heterogeneous federated learning
Fast inference
Semi-supervised learning
The proposed method: Semi-HFL
Heterogeneous federated learning
The training process
The inference process
Semi-supervised learning
Supervised learning on the server
Unsupervised learning on clients
Federated aggregation
Model fine-tuning
Algorithm
Hetergeneous FL
Semi-HFL
Convergence analysis
Experimental verification
Data set | Indicator | 1 + 2 | 1 + 3 | 2 + 3 | 1 + 2 + 3 | Avg2 | Avg3 | Prox2 | Prox3 | Proto-2 | Proto-3 |
---|---|---|---|---|---|---|---|---|---|---|---|
MNIST | Size | 195K | 1210K | 1230K | 851K | 280K | 2450K | 280K | 2450K | 4192K | 4203K |
FLOPS | 2281K | 2563K | 3675K | 2817K | 3689K | 4229K | 3689K | 4229K | 5067K | 5127K | |
Param | 45K | 303K | 308K | 212K | 66K | 617K | 66K | 617K | 197K | 197K | |
iid(%) | 98.67 | 98.55 | 98.72 | 98.15 | 97.32 | 97.00 | 97.37 | 97.57 | 98.61 | 98.72 | |
noniid(%) | 98.48 | 98.59 | 98.64 | 98.37 | 97.09 | 97.25 | 97.34 | 97.26 | 98.25 | 97.78 | |
Cifar10 | Size | 63 M | 206 M | 244 M | 174 M | 107 M | 427 M | 107 M | 427 M | 427 M | 427 M |
FLOPS | 3320 M | 3900 M | 4560 M | 3874 M | 4224 M | 5567 M | 4224 M | 5567 M | 374 M | 384 M | |
Param | 16 M | 54 M | 64 M | 45 M | 28 M | 112 M | 28 M | 112 M | 1180K | 1180K | |
iid(%) | 79.18 | 78.06 | 75.96 | 75.04 | 66.05 | 65.51 | 67.22 | 64.68 | 61.44 | 59.17 | |
noniid(%) | 78.52 | 75.55 | 72.58 | 74.89 | 66.37 | 65.11 | 65.21 | 62.91 | 30.41 | 29.61 | |
Size | 34.07 M | 34.36 M | 37.4 M | 33.82 M | 37.10 M | 42.20 M | 37.10 M | 42.20 M | 34.14 M | 36.71 M | |
Shake- | FLOPS | 601.32 M | 598.22 M | 605.36 M | 598.08 M | 643.97 M | 645.3 M | 643.97 M | 645.3 M | 643.20 M | 643.87 M |
speare | Param | 8.83 M | 8.91 M | 9.59 M | 8.72 M | 9.49 M | 11.05 M | 9.49 M | 11.05 M | 1867K | 1867K |
iid(%) | – | – | – | – | – | – | – | – | – | – | |
noniid(%) | 36.39 | 36.78 | 38.47 | 35.36 | 32.57 | 31.27 | 26.97 | 26.54 | 38.88 | 39.65 | |
MR | Size | 93 M | – | – | – | 102 M | – | 102 M | – | 101 M | – |
FLOPS | 82 M | – | – | – | 89 M | – | 89 M | – | 705 M | – | |
Param | 1926K | – | – | – | 2692K | – | 2692K | – | 22K | – | |
iid(%) | 61.82 | – | – | – | 59.75 | – | 59.56 | – | 59.87 | – | |
noniid(%) | 61.07 | – | – | – | 57.97 | – | 59.19 | – | 58.78 | – |