TriFusion-IDS: A Multimodal Graph-Tabular-Text Contrastive Framework for Cross-Dataset Intrusion Detection

Authors

  • Qinxin Zhao National Key Lab for Novel Software Technology, Nanjing University
  • Sheng Zhong National Key Lab for Novel Software Technology, Nanjing University

DOI:

https://doi.org/10.1609/aaai.v40i19.38682

Abstract

Traditional Intrusion Detection Systems (IDS) are typically trained in specific network environments, and their performance often degrades significantly when deployed in new environments with different attack categories. To address this challenge, we propose and define the task of cross-dataset intrusion detection and design a novel multimodal contrastive learning framework named TriFusion-IDS. This framework represents network traffic from three complementary dimensions: a graph view to capture structural communication patterns, a tabular view to model statistical features, and a textual view to define the semantics of attacks. TriFusion-IDS fuses the graph and tabular representations and aligns them with textual descriptions in a shared embedding space using a CLIP-style contrastive loss function. This semantics-based alignment mechanism enables the model to overcome the effects of zero-shot categories and thus generalize to new network environments. Our extensive experiments on several mainstream datasets demonstrate that this method significantly outperforms existing baselines in cross-dataset intrusion detection scenarios.

Downloads

Published

2026-03-14

How to Cite

Zhao, Q., & Zhong, S. (2026). TriFusion-IDS: A Multimodal Graph-Tabular-Text Contrastive Framework for Cross-Dataset Intrusion Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 40(19), 16433–16440. https://doi.org/10.1609/aaai.v40i19.38682

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management III