Convex Clustering Redefined: Robust Learning with the Median of Means Estimator

Authors

  • Koustav Chowdhury Indian Statistical Institute, Kolkata
  • Bibhabasu Mandal Indian Statistical Institute, Kolkata
  • Sourav De Indian Statistical Institute, Kolkata
  • Sagar Ghosh The University of Texas at Austin
  • Swagatam Das Indian Statistical Institute
  • Debolina Paul University of Oxford
  • Saptarshi Chakraborty University of Michigan

DOI:

https://doi.org/10.1609/aaai.v40i25.39197

Abstract

Clustering approaches that utilize convex loss functions have recently attracted growing interest in the formation of compact data clusters. Although classical methods like kmeans and its wide family of variants are still widely used, all of them require the number of clusters (k) to be supplied as input, and many are notably sensitive to initialization. Convex clustering provides a more stable alternative by formulating the clustering task as a convex optimization problem, ensuring a unique global solution. However, it faces challenges in handling high dimensional data, especially in the presence of noise and outliers. Additionally, strong fusion regularization, controlled by the tuning parameter, can hinder effective cluster formation within a convex clustering framework. To overcome these challenges, we introduce a robust approach that integrates convex clustering with the Median of Means (MoM) estimator, thus developing an outlier resistant and efficient clustering framework that does not necessitate a prior knowledge of the number of clusters. By leveraging the robustness of MoM alongside the stability of convex clustering, our method enhances both performance and efficiency, especially on large scale datasets. Theoretical analysis demonstrates weak consistency under specific conditions, while experiments on synthetic and real world datasets validate the method’s superior performance compared to existing approaches.

Published

2026-03-14

How to Cite

Chowdhury, K., Mandal, B., De, S., Ghosh, S., Das, S., Paul, D., & Chakraborty, S. (2026). Convex Clustering Redefined: Robust Learning with the Median of Means Estimator. Proceedings of the AAAI Conference on Artificial Intelligence, 40(25), 20606–20614. https://doi.org/10.1609/aaai.v40i25.39197

Issue

Section

AAAI Technical Track on Machine Learning II