CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems
DOI:
https://doi.org/10.1609/aaai.v38i13.29372Keywords:
ML: Transparent, Interpretable, Explainable ML, ML: Reinforcement Learning, APP: Web, APP: Other ApplicationsAbstract
We present CrystalBox, a novel, model-agnostic, posthoc explainability framework for Deep Reinforcement Learning (DRL) controllers in the large family of input-driven environments which includes computer systems. We combine the natural decomposability of reward functions in input-driven environments with the explanatory power of decomposed returns. We propose an efficient algorithm to generate future-based explanations across both discrete and continuous control environments. Using applications such as adaptive bitrate streaming and congestion control, we demonstrate CrystalBox's capability to generate high-fidelity explanations. We further illustrate its higher utility across three practical use cases: contrastive explanations, network observability, and guided reward design, as opposed to prior explainability techniques that identify salient features.Downloads
Published
2024-03-24
How to Cite
Patel, S., Abdu Jyothi, S., & Narodytska, N. (2024). CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems. Proceedings of the AAAI Conference on Artificial Intelligence, 38(13), 14563-14571. https://doi.org/10.1609/aaai.v38i13.29372
Issue
Section
AAAI Technical Track on Machine Learning IV