An Optimal Transport View for Subspace Clustering and Spectral Clustering

Authors

  • Yuguang Yan School of Computer Science, Guangdong University of Technology, Guangzhou, China
  • Zhihao Xu School of Computer Science, Guangdong University of Technology, Guangzhou, China
  • Canlin Yang School of Computer Science, Guangdong University of Technology, Guangzhou, China
  • Jie Zhang Department of Mathematics, The University of Hong Kong, Hong Kong, China
  • Ruichu Cai School of Computer Science, Guangdong University of Technology, Guangzhou, China Peng Cheng Laboratory, Shenzhen, China
  • Michael Kwok-Po Ng Department of Mathematics, Hong Kong Baptist University, Hong Kong, China

DOI:

https://doi.org/10.1609/aaai.v38i15.29563

Keywords:

ML: Clustering

Abstract

Clustering is one of the most fundamental problems in machine learning and data mining, and many algorithms have been proposed in the past decades. Among them, subspace clustering and spectral clustering are the most famous approaches. In this paper, we provide an explanation for subspace clustering and spectral clustering from the perspective of optimal transport. Optimal transport studies how to move samples from one distribution to another distribution with minimal transport cost, and has shown a powerful ability to extract geometric information. By considering a self optimal transport model with only one group of samples, we observe that both subspace clustering and spectral clustering can be explained in the framework of optimal transport, and the optimal transport matrix bridges the spaces of features and spectral embeddings. Inspired by this connection, we propose a spectral optimal transport barycenter model, which learns spectral embeddings by solving a barycenter problem equipped with an optimal transport discrepancy and guidance of data. Based on our proposed model, we take advantage of optimal transport to exploit both feature and metric information involved in data for learning coupled spectral embeddings and affinity matrix in a unified model. We develop an alternating optimization algorithm to solve the resultant problems, and conduct experiments in different settings to evaluate the performance of our proposed methods.

Downloads

Published

2024-03-24

How to Cite

Yan, Y., Xu, Z., Yang, C., Zhang, J., Cai, R., & Ng, M. K.-P. (2024). An Optimal Transport View for Subspace Clustering and Spectral Clustering. Proceedings of the AAAI Conference on Artificial Intelligence, 38(15), 16281-16289. https://doi.org/10.1609/aaai.v38i15.29563

Issue

Section

AAAI Technical Track on Machine Learning VI