Equivalence between Graph Spectral Clustering and Column Subset Selection (Student Abstract)
DOI:
https://doi.org/10.1609/aaai.v38i21.30521Keywords:
Column Subset Selection, NCut, Graph Spectral Clustering, RatioCut, Optimality Of Spectral Clustering, NP-hardness Of Column Subset SelectionAbstract
The common criteria for evaluating spectral clustering are NCut and RatioCut. The seemingly unrelated column subset selection (CSS) problem aims to compute a column subset that linearly approximates the entire matrix. A common criterion is the approximation error in the Frobenius norm (ApproxErr). We show that any algorithm for CSS can be viewed as a clustering algorithm that minimizes NCut by applying it to a matrix formed from graph edges. Conversely, any clustering algorithm can be seen as identifying a column subset from that matrix. In both cases, ApproxErr and NCut have the same value. Analogous results hold for RatioCut with a slightly different matrix. Therefore, established results for CSS can be mapped to spectral clustering. We use this to obtain new clustering algorithms, including an optimal one that is similar to A*. This is the first nontrivial clustering algorithm with such an optimality guarantee. A variant of the weighted A* runs much faster and provides bounds on the accuracy. Finally, we use the results from spectral clustering to prove the NP-hardness of CSS from sparse matrices.Downloads
Published
2024-03-24
How to Cite
Wan, G., Mao, W., Semenov, Y. R., & Schweitzer, H. (2024). Equivalence between Graph Spectral Clustering and Column Subset Selection (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 38(21), 23673-23675. https://doi.org/10.1609/aaai.v38i21.30521
Issue
Section
AAAI Student Abstract and Poster Program