An Optimal Transport View for Subspace Clustering and Spectral Clustering
DOI:
https://doi.org/10.1609/aaai.v38i15.29563Keywords:
ML: ClusteringAbstract
Clustering is one of the most fundamental problems in machine learning and data mining, and many algorithms have been proposed in the past decades. Among them, subspace clustering and spectral clustering are the most famous approaches. In this paper, we provide an explanation for subspace clustering and spectral clustering from the perspective of optimal transport. Optimal transport studies how to move samples from one distribution to another distribution with minimal transport cost, and has shown a powerful ability to extract geometric information. By considering a self optimal transport model with only one group of samples, we observe that both subspace clustering and spectral clustering can be explained in the framework of optimal transport, and the optimal transport matrix bridges the spaces of features and spectral embeddings. Inspired by this connection, we propose a spectral optimal transport barycenter model, which learns spectral embeddings by solving a barycenter problem equipped with an optimal transport discrepancy and guidance of data. Based on our proposed model, we take advantage of optimal transport to exploit both feature and metric information involved in data for learning coupled spectral embeddings and affinity matrix in a unified model. We develop an alternating optimization algorithm to solve the resultant problems, and conduct experiments in different settings to evaluate the performance of our proposed methods.Downloads
Published
2024-03-24
How to Cite
Yan, Y., Xu, Z., Yang, C., Zhang, J., Cai, R., & Ng, M. K.-P. (2024). An Optimal Transport View for Subspace Clustering and Spectral Clustering. Proceedings of the AAAI Conference on Artificial Intelligence, 38(15), 16281-16289. https://doi.org/10.1609/aaai.v38i15.29563
Issue
Section
AAAI Technical Track on Machine Learning VI