Robust Budget Pacing with a Single Sample

Abstract

Major Internet advertising platforms offer budget pacing tools as a standard service for advertisers to manage their ad campaigns. Given the inherent non-stationarity in an advertiser’s value and also competing advertisers’ values over time, a commonly used approach is to learn a target expenditure plan that specifies a target spend as a function of time, and then run a controller that tracks this plan. This raises the question: how many historical samples are required to learn a good expenditure plan? We study this question by considering an advertiser repeatedly participating in $T$ second-price auctions, where the tuple of her value and the highest competing bid is drawn from an unknown time-varying distribution. The advertiser seeks to maximize her total utility subject to her budget constraint. Prior work has shown the sufficiency of $T\log T$ samples per distribution to achieve the optimal $O(\sqrt{T})$-regret. We dramatically improve this state-of-the-art and show that just one sample per distribution is enough to achieve the near-optimal $\tilde O(\sqrt{T})$-regret, while still being robust to noise in the sampling distributions.

Cite

Text

Balseiro et al. "Robust Budget Pacing with a Single Sample." International Conference on Machine Learning, 2023.

Markdown

[Balseiro et al. "Robust Budget Pacing with a Single Sample." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/balseiro2023icml-robust/)

BibTeX

@inproceedings{balseiro2023icml-robust,
  title     = {{Robust Budget Pacing with a Single Sample}},
  author    = {Balseiro, Santiago R. and Kumar, Rachitesh and Mirrokni, Vahab and Sivan, Balasubramanian and Wang, Di},
  booktitle = {International Conference on Machine Learning},
  year      = {2023},
  pages     = {1636-1659},
  volume    = {202},
  url       = {https://mlanthology.org/icml/2023/balseiro2023icml-robust/}
}