Device Sampling for Heterogeneous Federated Learning: Theory, Algorithms, and Implementation

The conventional federated learning (FedL) architecture distributes machine learning (ML) across worker devices by having them train local models that are periodically aggregated by a server. FedL ignores two important characteristics of contemporary wireless networks, however: (i) the network may c...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Annual Joint Conference of the IEEE Computer and Communications Societies s. 1 - 10
Hlavní autoři:	Wang, Su, Lee, Mengyuan, Hosseinalipour, Seyyedali, Morabito, Roberto, Chiang, Mung, Brinton, Christopher G.
Médium:	Konferenční příspěvek
Jazyk:	angličtina
Vydáno:	IEEE 10.05.2021
Témata:	Conferences Device-to-device communication Machine learning Network topology Servers Training Wireless networks
ISSN:	2641-9874
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	The conventional federated learning (FedL) architecture distributes machine learning (ML) across worker devices by having them train local models that are periodically aggregated by a server. FedL ignores two important characteristics of contemporary wireless networks, however: (i) the network may contain heterogeneous communication/computation resources, while (ii) there may be significant overlaps in devices' local data distributions. In this work, we develop a novel optimization methodology that jointly accounts for these factors via intelligent device sampling complemented by device-to-device (D2D) offloading. Our optimization aims to select the best combination of sampled nodes and data offloading configuration to maximize FedL training accuracy subject to realistic constraints on the network topology and device capabilities. Theoretical analysis of the D2D offloading subproblem leads to new FedL convergence bounds and an efficient sequential convex optimizer. Using this result, we develop a sampling methodology based on graph convolutional networks (GCNs) which learns the relationship between network attributes, sampled nodes, and resulting offloading that maximizes FedL accuracy. Through evaluation on real-world datasets and network measurements from our IoT testbed, we find that our methodology while sampling less than 5% of all devices outperforms conventional FedL substantially both in terms of trained model accuracy and required resource utilization.
ISSN:	2641-9874
DOI:	10.1109/INFOCOM42981.2021.9488906