Toward Human-Aware Active Directory Defense with Fine-Tuned LLMs

Hung X. Nguyen; Tu Vu

doi:10.1609/aaaiss.v9i1.42949

Authors

Hung X. Nguyen Adelaide University
Tu Vu Adelaide University

DOI:

https://doi.org/10.1609/aaaiss.v9i1.42949

Abstract

Autonomous cyber defense agents increasingly need to query complex enterprise attack graphs under time pressure, yet today’s Active Directory (AD) tools still assume expert-authored Cypher queries. This paper studies natural-language-to-Cypher generation as a core tool-use action for human-aware AD defense agents operating over BloodHound-style graphs. We curate 346 executable English–Cypher pairs from practitioner queries, augment them to 2,768 samples via constrained paraphrasing, and fine-tune an open-weight Mixtral‑8×7B model using QLoRA. Across ten representative BloodHound-equivalency tasks on five synthetic AD graphs, fine-tuning raises parse success from 0.80 to 0.94 and correct-answer rate from 0.34 to 0.42. These gains should be interpreted as evidence that domain tuning can improve syntactic robustness in a controlled setting, rather than as evidence of production-ready semantic reliability or real-enterprise scalability. Error analysis reveals safety-critical semantic gaps in domain admin identification, unsupported-OS detection, and temporal constraints. We argue that, for cyber defense agents, schema prompting and fine-tuning alone are insufficient: safe autonomy over AD graphs requires execution-grounded correction, explicit domain guardrails, and interaction designs that calibrate human trust in LLM-generated queries.

Toward Human-Aware Active Directory Defense with Fine-Tuned LLMs

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information