Causal Bandits with Propagating Inference

Abstract

Bandit is a framework for designing sequential experiments, where a learner selects an arm $A \in \mathcal{A}$ and obtains an observation corresponding to $A$ in each experiment. Theoretically, the tight regret lower-bound for the general bandit is polynomial with respect to the number of arms $|\mathcal{A}|$, and thus, to overcome this bound, the bandit problem with side-information is often considered. Recently, a bandit framework over a causal graph was introduced, where the structure of the causal graph is available as side-information and the arms are identified with interventions on the causal graph. Existing algorithms for causal bandit overcame the $\Omega(\sqrt{|\mathcal{A}|/T})$ simple-regret lower-bound; however, their algorithms work only when the interventions $\mathcal{A}$ are localized around a single node (i.e., an intervention propagates only to its neighbors). We then propose a novel causal bandit algorithm for an arbitrary set of interventions, which can propagate throughout the causal graph. We also show that it achieves $O(\sqrt{ \gamma^*\log(|\mathcal{A}|T) / T})$ regret bound, where $\gamma^*$ is determined by using a causal graph structure. In particular, if the maximum in-degree of the causal graph is a constant, then $\gamma^* = O(N^2)$, where $N$ is the number of nodes.

Cite

Text

Yabe et al. "Causal Bandits with Propagating Inference." International Conference on Machine Learning, 2018.

Markdown

[Yabe et al. "Causal Bandits with Propagating Inference." International Conference on Machine Learning, 2018.](https://mlanthology.org/icml/2018/yabe2018icml-causal/)

BibTeX

@inproceedings{yabe2018icml-causal,
  title     = {{Causal Bandits with Propagating Inference}},
  author    = {Yabe, Akihiro and Hatano, Daisuke and Sumita, Hanna and Ito, Shinji and Kakimura, Naonori and Fukunaga, Takuro and Kawarabayashi, Ken-ichi},
  booktitle = {International Conference on Machine Learning},
  year      = {2018},
  pages     = {5512-5520},
  volume    = {80},
  url       = {https://mlanthology.org/icml/2018/yabe2018icml-causal/}
}