CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems

Abstract

We present CrystalBox, a novel, model-agnostic, posthoc explainability framework for Deep Reinforcement Learning (DRL) controllers in the large family of input-driven environments which includes computer systems. We combine the natural decomposability of reward functions in input-driven environments with the explanatory power of decomposed returns. We propose an efficient algorithm to generate future-based explanations across both discrete and continuous control environments. Using applications such as adaptive bitrate streaming and congestion control, we demonstrate CrystalBox's capability to generate high-fidelity explanations. We further illustrate its higher utility across three practical use cases: contrastive explanations, network observability, and guided reward design, as opposed to prior explainability techniques that identify salient features.

Cite

Text

Patel et al. "CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I13.29372

Markdown

[Patel et al. "CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/patel2024aaai-crystalbox/) doi:10.1609/AAAI.V38I13.29372

BibTeX

@inproceedings{patel2024aaai-crystalbox,
  title     = {{CrystalBox: Future-Based Explanations for Input-Driven Deep RL Systems}},
  author    = {Patel, Sagar and Jyothi, Sangeetha Abdu and Narodytska, Nina},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {14563-14571},
  doi       = {10.1609/AAAI.V38I13.29372},
  url       = {https://mlanthology.org/aaai/2024/patel2024aaai-crystalbox/}
}