Asynchronous Federated Clustering with Unknown Number of Clusters

Authors

  • Yunfan Zhang School of Computer Science and Technology, Guangdong University of Technology, Guangzhou, China
  • Yiqun Zhang School of Computer Science and Technology, Guangdong University of Technology, Guangzhou, China Department of Computer Science, Hong Kong Baptist University, Hong Kong SAR, China
  • Yang Lu Fujian Key Laboratory of Sensing and Computing for Smart City, School of Informatics, Xiamen University, Xiamen, China
  • Mengke Li College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
  • Xi Chen Department of Computer Science, Hong Kong Baptist University, Hong Kong SAR, China
  • Yiu-ming Cheung Department of Computer Science, Hong Kong Baptist University, Hong Kong SAR, China

DOI:

https://doi.org/10.1609/aaai.v39i21.34429

Abstract

Federated Clustering (FC) is crucial to mining knowledge from unlabeled non-Independent Identically Distributed (non-IID) data provided by multiple clients while preserving their privacy. Most existing attempts learn cluster distributions at local clients, then securely pass the desensitized information to the server for aggregation. However, some tricky but common FC problems are still relatively unexplored, including the heterogeneity in terms of clients' communication capacity and the unknown number of proper clusters. To further bridge the gap between FC and real application scenarios, this paper first shows that the clients' communication asynchrony and unknown proper cluster numbers are complex coupling problems, and then proposes an Asynchronous Federated Cluster Learning (AFCL) method accordingly. It spreads the excessive number of seed points to clients as a learning medium and coordinates them across clients to form a consensus. To alleviate the distribution imbalance cumulated due to the unforeseen asynchronous uploading from the heterogeneous clients, we also design a balancing mechanism for seeds updating. As a result, the seeds gradually adapt to each other to reveal a proper number of clusters. Extensive experiments demonstrate the efficacy of AFCL.

Published

2025-04-11

How to Cite

Zhang, Y., Zhang, Y., Lu, Y., Li, M., Chen, X., & Cheung, Y.- ming. (2025). Asynchronous Federated Clustering with Unknown Number of Clusters. Proceedings of the AAAI Conference on Artificial Intelligence, 39(21), 22695–22703. https://doi.org/10.1609/aaai.v39i21.34429

Issue

Section

AAAI Technical Track on Machine Learning VII