AgentSeer: Visualizing and Evaluating Temporal Actions in Agentic AI Systems

Ilham Wicaksono; Zekun Wu; Rahul Patel; Theo King; Adriano Koshiyama; Philip Colin Treleaven

doi:10.1609/aaai.v40i48.42392

Authors

Ilham Wicaksono Holistic AI Centre for Artificial Intelligence, University College London
Zekun Wu Holistic AI Centre for Artificial Intelligence, University College London
Rahul Patel Holistic AI
Theo King Holistic AI
Adriano Koshiyama Holistic AI Centre for Artificial Intelligence, University College London
Philip Colin Treleaven Centre for Artificial Intelligence, University College London

DOI:

https://doi.org/10.1609/aaai.v40i48.42392

Abstract

We present AgentSeer, an interactive observability framework for agentic AI systems. Unlike conventional tracing tools that expose raw spans or model-centric metrics, AgentSeer introduces a dual graph decomposition constructed through a deterministic rule-based parser: a temporal action graph, where each prompt or tool invocation is represented as a distinct action, and a component graph capturing architectural relations among agents, tools, and memory modules. Beyond visualization, AgentSeer enables action-level red teaming, where jailbreak payloads are systematically attached to every action node (including agent messages, tool calls, and memory retrievals) to uncover vulnerabilities invisible to model-level testing. Our demonstration features a six-agent hierarchical testbed with interactive visualization and deployment-oriented safety evaluation applied directly on the same prompts and contexts, systematically revealing high-risk interactions, context-dependent vulnerabilities, and emergent behaviors. By combining structured decomposition, automated red teaming, and rule-based reliability, AgentSeer establishes a safety-first methodology for observability in multi-agent AI.

AgentSeer: Visualizing and Evaluating Temporal Actions in Agentic AI Systems

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information