Contrastive Adversarial Agents for Intentional Drift Induction in Network Intrusion Detection Systems
DOI:
https://doi.org/10.1609/aaaiss.v9i1.42946Abstract
Retraining and drift-adaptation methods for Network Intrusion Detection Systems (NIDS) are effective when adversarial behavior evolves gradually or remains predictable. However, these defenses often degrade when faced with coordinated or intentionally diverse attackers that induce abrupt and adversarial concept drift; a particularly critical vulnerability in client-server NIDS deployments for high-stakes domains such as healthcare. In this work, we investigate a stronger threat model in which an ensemble of reinforcement learning attackers collaborates to strategically disrupt adaptive defenses. We introduce a contrastive learning framework that trains multiple adversarial agents to learn distinct and non-redundant packet perturbation strategies. By encouraging behavioral diversity while maintaining attack effectiveness, the ensemble generates heterogeneous evasion patterns that are deployed sequentially or adaptively to destabilize retraining mechanisms. This approach enables systematic study of intentional, multi-modal adversarial drift and exposes vulnerabilities in standard NIDS adaptation pipelines. Our results demonstrate that contrastively trained attacker ensembles significantly reduce detection accuracy, highlighting the need for more robust and diversity-aware defensive strategies.Downloads
Published
2026-06-23
How to Cite
Rivas, E., & Piplai, A. (2026). Contrastive Adversarial Agents for Intentional Drift Induction in Network Intrusion Detection Systems. Proceedings of the AAAI Symposium Series, 9(1), 309–314. https://doi.org/10.1609/aaaiss.v9i1.42946
Issue
Section
Human-Aware AI Agents for the Cyber Battlefield: From Human Models to Autonomous Defense (Full Papers)