Causal Logistic Bandits with Counterfactual Fairness Constraints
Abstract
Artificial intelligence will play a significant role in decision making in numerous aspects of society. Numerous fairness criteria have been proposed in the machine learning community, but there remains limited investigation into fairness as defined through specified attributes in a sequential decision-making framework. In this paper, we focus on causal logistic bandit problems where the learner seeks to make fair decisions, under a notion of fairness that accounts for counterfactual reasoning. We propose and analyze an algorithm by leveraging primal-dual optimization for constrained causal logistic bandits where the non-linear constraints are a priori unknown and must be learned in time. We obtain sub-linear regret guarantees with leading term similar to that for unconstrained logistic bandits (Lee et al., 2024) while guaranteeing sub-linear constraint violations. We show how to achieve zero cumulative constraint violations with a small increase in the regret bound.
Cite
Text
Chen et al. "Causal Logistic Bandits with Counterfactual Fairness Constraints." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Chen et al. "Causal Logistic Bandits with Counterfactual Fairness Constraints." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/chen2025icml-causal/)BibTeX
@inproceedings{chen2025icml-causal,
title = {{Causal Logistic Bandits with Counterfactual Fairness Constraints}},
author = {Chen, Jiajun and Tian, Jin and Quinn, Christopher John},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {9146-9175},
volume = {267},
url = {https://mlanthology.org/icml/2025/chen2025icml-causal/}
}