Did I Do That? Blame as a Means to Identify Controlled Effects in Reinforcement Learning
Abstract
Affordance learning is a crucial ability of intelligent agents. This ability relies on understanding the different ways the environment can be controlled. Approaches encouraging RL agents to model controllable aspects of their environment have repeatedly achieved state-of-the-art results. Despite their success, these approaches have only been studied using generic tasks as a proxy but have not yet been evaluated in isolation. In this work, we study the problem of identifying controlled effects from a causal perspective. Humans compare counterfactual outcomes to assign a degree of blame to their actions. Following this idea, we propose Controlled Effect Network (CEN), a self-supervised method based on the causal concept of blame. CEN is evaluated in a wide range of environments against two state-of-the-art models, showing that it precisely identifies controlled effects.
Cite
Text
Corcoll et al. "Did I Do That? Blame as a Means to Identify Controlled Effects in Reinforcement Learning." Transactions on Machine Learning Research, 2022.Markdown
[Corcoll et al. "Did I Do That? Blame as a Means to Identify Controlled Effects in Reinforcement Learning." Transactions on Machine Learning Research, 2022.](https://mlanthology.org/tmlr/2022/corcoll2022tmlr-blame/)BibTeX
@article{corcoll2022tmlr-blame,
title = {{Did I Do That? Blame as a Means to Identify Controlled Effects in Reinforcement Learning}},
author = {Corcoll, Oriol and Mohamed, Youssef Sherif Mansour and Vicente, Raul},
journal = {Transactions on Machine Learning Research},
year = {2022},
url = {https://mlanthology.org/tmlr/2022/corcoll2022tmlr-blame/}
}