Disentangling Tabular Data Towards Better One-Class Anomaly Detection

Authors

  • Jianan Ye Xi'an Jiaotong-Liverpool University University of Liverpool
  • Zhaorui Tan Xi'an Jiaotong-Liverpool University University of Liverpool
  • Yijie Hu Xi'an Jiaotong-Liverpool University University of Liverpool
  • Xi Yang Xi'an Jiaotong-Liverpool University
  • Guangliang Cheng University of Liverpool
  • Kaizhu Huang Duke Kunshan University

DOI:

https://doi.org/10.1609/aaai.v39i12.33425

Abstract

Tabular anomaly detection under the one-class classification setting poses a significant challenge, as it involves accurately conceptualizing "normal" derived exclusively from a single category to discern anomalies from normal data variations. Capturing the intrinsic correlation among attributes within normal samples presents one promising method for learning the concept. To do so, the most recent effort relies on a learnable mask strategy with a reconstruction task. However, this wisdom may suffer from the risk of producing uniform masks, i.e., essentially nothing is masked, leading to less effective correlation learning. To address this issue, we presume that attributes related to others in normal samples can be divided into two non-overlapping and correlated subsets, defined as CorrSets, to capture the intrinsic correlation effectively. Accordingly, we introduce an innovative method that disentangles CorrSets from normal tabular data. To our knowledge, this is a pioneering effort to apply the concept of disentanglement for one-class anomaly detection on tabular data. Extensive experiments on 20 tabular datasets show that our method substantially outperforms the state-of-the-art methods and leads to an average performance improvement of 6.1% on AUC-PR and 2.1% on AUC-ROC.

Downloads

Published

2025-04-11

How to Cite

Ye, J., Tan, Z., Hu, Y., Yang, X., Cheng, G., & Huang, K. (2025). Disentangling Tabular Data Towards Better One-Class Anomaly Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 39(12), 13061–13068. https://doi.org/10.1609/aaai.v39i12.33425

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management II