AgentSeer: Visualizing and Evaluating Temporal Actions in Agentic AI Systems
DOI:
https://doi.org/10.1609/aaai.v40i48.42392Abstract
We present AgentSeer, an interactive observability framework for agentic AI systems. Unlike conventional tracing tools that expose raw spans or model-centric metrics, AgentSeer introduces a dual graph decomposition constructed through a deterministic rule-based parser: a temporal action graph, where each prompt or tool invocation is represented as a distinct action, and a component graph capturing architectural relations among agents, tools, and memory modules. Beyond visualization, AgentSeer enables action-level red teaming, where jailbreak payloads are systematically attached to every action node (including agent messages, tool calls, and memory retrievals) to uncover vulnerabilities invisible to model-level testing. Our demonstration features a six-agent hierarchical testbed with interactive visualization and deployment-oriented safety evaluation applied directly on the same prompts and contexts, systematically revealing high-risk interactions, context-dependent vulnerabilities, and emergent behaviors. By combining structured decomposition, automated red teaming, and rule-based reliability, AgentSeer establishes a safety-first methodology for observability in multi-agent AI.Published
2026-03-14
How to Cite
Wicaksono, I., Wu, Z., Patel, R., King, T., Koshiyama, A., & Treleaven, P. C. (2026). AgentSeer: Visualizing and Evaluating Temporal Actions in Agentic AI Systems. Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41718–41720. https://doi.org/10.1609/aaai.v40i48.42392