Sample-and-Search: An Effective Algorithm for Learning-Augmented k-Median Clustering in High Dimensions

Authors

  • Kangke Cheng University of Science and Technology of China, Hefei, China
  • Shihong Song School of Informatics, University of Edinburgh, Edinburgh, UK
  • Guanlin Mo University of Science and Technology of China, Hefei, China
  • Hu Ding University of Science and Technology of China, Hefei, China

DOI:

https://doi.org/10.1609/aaai.v40i25.39186

Abstract

In this paper, we investigate the learning-augmented k-median clustering problem, which aims to improve the performance of traditional clustering algorithms by preprocessing the point set with a predictor of error rate α ∈ [0,1). This preprocessing step assigns potential labels to the points before clustering. We introduce an algorithm for this problem based on a simple yet effective sampling method, which substantially improves upon the time complexities of existing algorithms. Moreover, we mitigate their exponential dependency on the dimensionality of the Euclidean space. Lastly, we conduct experiments to compare our method with several state-of-the-art learning-augmented k-median clustering methods. The experimental results suggest that our proposed approach can significantly reduce the computational complexity in practice, while achieving a lower clustering cost.

Downloads

Published

2026-03-14

How to Cite

Cheng, K., Song, S., Mo, G., & Ding, H. (2026). Sample-and-Search: An Effective Algorithm for Learning-Augmented k-Median Clustering in High Dimensions. Proceedings of the AAAI Conference on Artificial Intelligence, 40(25), 20509–20517. https://doi.org/10.1609/aaai.v40i25.39186

Issue

Section

AAAI Technical Track on Machine Learning II