Bandit Learning in Matching Markets Robust to Adversarial Corruptions

Abstract

This paper investigates the problem of bandit learning in two-sided decentralized matching markets with adversarial corruptions. In matching markets, players on one side aim to learn their unknown preferences over arms on the other side through iterative online learning, with the goal of identifying the optimal stable match. However, in real-world applications, stochastic rewards observed by players may be corrupted by malicious adversaries, potentially misleading the learning process and causing convergence to a sub-optimal match. We study this problem under two settings: one where the corruption level $C$ (defined as the sum of the largest adversarial alterations to the feedback across rounds) is known, and another where it is unknown. For the known corruption setting, we develop a robust variant of the classical Explore-Then-Gale-Shapley (ETGS) algorithm by incorporating widened confidence intervals. For the unknown corruption case, we propose a Multi-layer ETGS race method that adaptively mitigates adversarial effects without prior corruption knowledge. We provide theoretical guarantees for both algorithms by establishing upper bounds on their optimal stable regret, and further derive the lower bound to demonstrate their optimality.

Cite

Text

Wu et al. "Bandit Learning in Matching Markets Robust to Adversarial Corruptions." International Conference on Learning Representations, 2026.

Markdown

[Wu et al. "Bandit Learning in Matching Markets Robust to Adversarial Corruptions." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/wu2026iclr-bandit/)

BibTeX

@inproceedings{wu2026iclr-bandit,
  title     = {{Bandit Learning in Matching Markets Robust to Adversarial Corruptions}},
  author    = {Wu, Zheshun and Zuo, Jinhang and Xu, Zenglin and Kong, Fang},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/wu2026iclr-bandit/}
}