Progressive Distribution Matching for Federated Semi-Supervised Learning

Dongping Liao; Xitong Gao; Yabo Xu; Cheng-Zhong Xu

doi:10.1609/aaai.v39i5.32551

Authors

Dongping Liao University of Macau
Xitong Gao Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences Shenzhen University of Advanced Technology
Yabo Xu DataStory Information Technology Co., Ltd
Cheng-Zhong Xu University of Macau

DOI:

https://doi.org/10.1609/aaai.v39i5.32551

Abstract

Federated Learning (FL) enables collaborative learning from distributed data while preserving the privacy of participating clients. While supervised federated learning with labeled data has made notable strides and achieved success, federated semi-supervised learning (FSSL) lags in its progress. Existing works for FSSL heavily rely on fully-labeled clients, while ignoring the distribution of pseudo-labels generated from skewed unlabeled data. In this work, we offer empirical and theoretical insights into the challenges encountered when applying conventional semi-supervised algorithms in the federated regime. Specifically, we highlight how the inherent data heterogeneity in FSSL can exacerbate issues within the pseudo-labeling process. Motivated by these observations, we propose federated learning with progressive distribution matching (FedPDM) to regularize the distribution of pseudo-labels, aiming to progressively reshape it to align with the ground-truth distribution. The matching problem could be formulated as an optimal transport (OT) problem and efficiently solved by Sinkhorn-Knopp iteration. Through extensive experiments, we demonstrate the superiority of FedPDM on a variety of models and datasets compared with prior arts for FSSL.

Progressive Distribution Matching for Federated Semi-Supervised Learning

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information