Interpretable and Robust Behavior Abstraction via Environment-Disentangled Heterogeneous Graph

Authors

  • Zhibin Ni School of Software, Tsinghua University
  • Hai Wan School of Software, Tsinghua University Hunan Sanyou Environmental Technology Co., Ltd.
  • Xibin Zhao School of Software, Tsinghua University

DOI:

https://doi.org/10.1609/aaai.v40i2.37056

Abstract

To identify the root causes of attacks, behavior abstraction (BA) converts audit logs into multiple behavior graphs and finds similar ones, which has proven effective in bridging the semantic gap and reducing manual workload. Existing works fail to achieve both interpretability and generalization, while also exhibiting limited robustness when facing adversarial attacks. In this paper, we give the first attempt at interpretable and robust behavior abstraction and propose a novel method called Environment-Disentangled Heterogeneous Graph Neural Network (EDHGNN). Motivated by Information Bottleneck (IB) principle, we propose a Heterogeneous Subgraph Disentanglement (HSD) module to disentangle label-relevant and environmental subgraphs through single optimization. We also introduce an Adapted Graph-Level Attention (AGLA) module to extract minimal sufficient representations from label-relevant subgraphs, a Label-Guided Graph Reconstructor (LGGR) to maximize environmental information coverage via reconstruction, and a Relevance Discriminator (RD) to enhance disentanglement quality. Additionally, we construct a new dataset contains ground-truth explanations and 4,160 behavior graphs. Extensive experiments demonstrate that EDHGNN outperforms the state-of-the-art methods in terms of interpretability and robustness against adversarial attacks.

Published

2026-03-14

How to Cite

Ni, Z., Wan, H., & Zhao, X. (2026). Interpretable and Robust Behavior Abstraction via Environment-Disentangled Heterogeneous Graph. Proceedings of the AAAI Conference on Artificial Intelligence, 40(2), 881-889. https://doi.org/10.1609/aaai.v40i2.37056

Issue

Section

AAAI Technical Track on Application Domains II