Multi-Subspace Matrix Recovery from Permuted Data

Authors

  • Liangqi Xie School of Data Science, The Chinese University of HongKong, Shenzhen
  • Jicong Fan School of Data Science, The Chinese University of Hong Kong, Shenzhen

DOI:

https://doi.org/10.1609/aaai.v39i20.35471

Abstract

This paper aims to recover a multi-subspace matrix from permuted data: given a matrix, in which the columns are drawn from a union of low-dimensional subspaces and some columns are corrupted by permutations on their entries, recover the original matrix. The task has numerous practical applications such as data cleaning, integration, and de-anonymization, but it remains challenging and cannot be well addressed by existing techniques such as robust principal component analysis because of the presence of multiple subspaces and the permutations on the elements of vectors. To solve the challenge, we develop a novel four-stage algorithm pipeline including outlier identification, subspace reconstruction, outlier classification, and unsupervised sensing for permuted vector recovery. Particularly, we provide theoretical guarantees for the outlier classification step, ensuring reliable multi-subspace matrix recovery. Our pipeline is compared with state-of-the-art competitors on multiple benchmarks and shows superior performance.

Downloads

Published

2025-04-11

How to Cite

Xie, L., & Fan, J. (2025). Multi-Subspace Matrix Recovery from Permuted Data. Proceedings of the AAAI Conference on Artificial Intelligence, 39(20), 21670–21678. https://doi.org/10.1609/aaai.v39i20.35471

Issue

Section

AAAI Technical Track on Machine Learning VI