A More Efficient Reduction from Outlier-Aware to Outlier-Free k-Median

Authors

  • Zhen Zhang Hunan University of Technology and Business Xiangjiang Laboratory
  • Han Peng Hunan University of Technology and Business Xiangjiang Laboratory
  • Limei Liu Hunan University of Technology and Business Xiangjiang Laboratory
  • Junyu Huang Central South University
  • Xiaolong Li Hunan University of Technology and Business Xiangjiang Laboratory
  • Qilong Feng Central South University

DOI:

https://doi.org/10.1609/aaai.v40i34.40090

Abstract

Given a non-negative integer \ell, the k-median with outliers problem extends the standard k-median problem by allowing the removal of up to \ell points and minimizing the clustering cost over the remaining ones. Algorithmic development in this setting remains an active area of research due to its relevance in processing noisy data. In this paper, we present a sampling-based reduction from the k-median with outliers problem to its outlier-free counterpart. The reduction incurs a multiplicative overhead of (kℓ⁻¹ + ε⁻¹)^O(ℓ) in the running time: it yields (kℓ⁻¹ + ε⁻¹)^O(ℓ) outlier-free instances, a solution to one of which can be directly transformed into a solution to the original instance with an arbitrarily small loss in the approximation ratio. This improves upon the previously known reduction with an overhead of ((k + ℓ)ε⁻¹)^O(ℓ). As applications, we obtain faster fixed-parameter tractable (FPT) algorithms with tight approximation guarantees for the k-median with outliers problem under various metric spaces. Furthermore, our approach naturally generalizes to constrained variants of the problem where additional constraints are imposed on the cluster sizes, and yields similar improvements in their FPT approximations.

Published

2026-03-14

How to Cite

Zhang, Z., Peng, H., Liu, L., Huang, J., Li, X., & Feng, Q. (2026). A More Efficient Reduction from Outlier-Aware to Outlier-Free k-Median. Proceedings of the AAAI Conference on Artificial Intelligence, 40(34), 28591–28599. https://doi.org/10.1609/aaai.v40i34.40090

Issue

Section

AAAI Technical Track on Machine Learning XI