KnowThyself: An Agentic Assistant for LLM Interpretability

Authors

  • Suraj Prasai Wake Forest University
  • Mengnan Du New Jersey Institute of Technology
  • Ying Zhang Wake Forest University
  • Fan Yang Wake Forest University

DOI:

https://doi.org/10.1609/aaai.v40i48.42373

Abstract

We develop KnowThyself, an agentic assistant that advances large language model (LLM) interpretability. Existing tools provide useful insights but remain fragmented and code-intensive. KnowThyself consolidates these capabilities into a chat-based interface, where users can upload models, pose natural language questions, and obtain interactive visualizations with guided explanations. At its core, an orchestrator LLM first reformulates user queries, an agent router further directs them to specialized modules, and the outputs are finally contextualized into coherent explanations. This design lowers technical barriers and provides an extensible platform for LLM inspection. By embedding the whole process into a conversational workflow, KnowThyself offers a robust foundation for accessible LLM interpretability.

Published

2026-03-14

How to Cite

Prasai, S., Du, M., Zhang, Y., & Yang, F. (2026). KnowThyself: An Agentic Assistant for LLM Interpretability. Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41661–41663. https://doi.org/10.1609/aaai.v40i48.42373