Very Fast, Approximate Counterfactual Explanations for Decision Forests
Abstract
We consider finding a counterfactual explanation for a classification or regression forest, such as a random forest. This requires solving an optimization problem to find the closest input instance to a given instance for which the forest outputs a desired value. Finding an exact solution has a cost that is exponential on the number of leaves in the forest. We propose a simple but very effective approach: we constrain the optimization to input space regions populated by actual data points. The problem reduces to a form of nearest-neighbor search using a certain distance on a certain dataset. This has two advantages: first, the solution can be found very quickly, scaling to large forests and high-dimensional data, and enabling interactive use. Second, the solution found is more likely to be realistic in that it is guided towards high-density areas of input space.
Cite
Text
Carreira-Perpiñán and Hada. "Very Fast, Approximate Counterfactual Explanations for Decision Forests." AAAI Conference on Artificial Intelligence, 2023. doi:10.1609/AAAI.V37I6.25848Markdown
[Carreira-Perpiñán and Hada. "Very Fast, Approximate Counterfactual Explanations for Decision Forests." AAAI Conference on Artificial Intelligence, 2023.](https://mlanthology.org/aaai/2023/carreiraperpinan2023aaai-very/) doi:10.1609/AAAI.V37I6.25848BibTeX
@inproceedings{carreiraperpinan2023aaai-very,
title = {{Very Fast, Approximate Counterfactual Explanations for Decision Forests}},
author = {Carreira-Perpiñán, Miguel Á. and Hada, Suryabhan Singh},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2023},
pages = {6935-6943},
doi = {10.1609/AAAI.V37I6.25848},
url = {https://mlanthology.org/aaai/2023/carreiraperpinan2023aaai-very/}
}