Counterfactual Fairness by Combining Factual and Counterfactual Predictions
Abstract
In high-stakes domains such as healthcare and hiring, the role of machine learning (ML) in decision-making raises significant fairness concerns. This work focuses on Counterfactual Fairness (CF), which posits that an ML model's outcome on any individual should remain unchanged if they had belonged to a different demographic group.Previous works have proposed methods that guarantee CF. Notwithstanding, their effects on the model's predictive performance remain largely unclear.To fill this gap, we provide a theoretical study on the inherent trade-off between CF and predictive performance in a model-agnostic manner. We first propose a simple but effective method to cast an optimal but potentially unfair predictor into a fair one with a minimal loss of performance.By analyzing the excess risk incurred by perfect CF, we quantify this inherent trade-off. Further analysis on our method's performance with access to only incomplete causal knowledge is also conducted. Built upon this, we propose a practical algorithm that can be applied in such scenarios. Experiments on both synthetic and semi-synthetic datasets demonstrate the validity of our analysis and methods.
Cite
Text
Zhou et al. "Counterfactual Fairness by Combining Factual and Counterfactual Predictions." Neural Information Processing Systems, 2024. doi:10.52202/079017-1517Markdown
[Zhou et al. "Counterfactual Fairness by Combining Factual and Counterfactual Predictions." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/zhou2024neurips-counterfactual/) doi:10.52202/079017-1517BibTeX
@inproceedings{zhou2024neurips-counterfactual,
title = {{Counterfactual Fairness by Combining Factual and Counterfactual Predictions}},
author = {Zhou, Zeyu and Liu, Tianci and Bai, Ruqi and Gao, Jing and Kocaoglu, Murat and Inouye, David I.},
booktitle = {Neural Information Processing Systems},
year = {2024},
doi = {10.52202/079017-1517},
url = {https://mlanthology.org/neurips/2024/zhou2024neurips-counterfactual/}
}