Counterfactual Strategies for Markov Decision Processes
Abstract
Counterfactuals are widely used in AI to explain how minimal changes to a model’s input can lead to a different output. However, established methods for computing counterfactuals typically focus on one-step decision-making, and are not directly applicable to sequential decision-making tasks. This paper fills this gap by introducing counterfactual strategies for Markov Decision Processes (MDPs). During MDP execution, a strategy decides which of the enabled actions (with known probabilistic effects) to execute next. Given an initial strategy that reaches an undesired outcome with a probability above some limit, we identify minimal changes to the initial strategy to reduce that probability below the limit. We encode such counterfactual strategies as solutions to non-linear optimization problems, and further extend our encoding to synthesize diverse counterfactual strategies. We evaluate our approach on four real-world datasets and demonstrate its practical viability in sophisticated sequential decision-making tasks.
Cite
Text
Kobialka et al. "Counterfactual Strategies for Markov Decision Processes." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/47Markdown
[Kobialka et al. "Counterfactual Strategies for Markov Decision Processes." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/kobialka2025ijcai-counterfactual/) doi:10.24963/IJCAI.2025/47BibTeX
@inproceedings{kobialka2025ijcai-counterfactual,
title = {{Counterfactual Strategies for Markov Decision Processes}},
author = {Kobialka, Paul and Gerlach, Lina and Leofante, Francesco and Ábrahám, Erika and Tarifa, Silvia Lizeth Tapia and Johnsen, Einar Broch},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2025},
pages = {412-420},
doi = {10.24963/IJCAI.2025/47},
url = {https://mlanthology.org/ijcai/2025/kobialka2025ijcai-counterfactual/}
}