Improved Learning Rates in Multi-Unit Uniform Price Auctions
Abstract
Motivated by the strategic participation of electricity producers in electricity day-ahead market, we study the problem of online learning in repeated multi-unit uniform price auctions focusing on the adversarial opposing bid setting. The main contribution of this paper is the introduction of a new modeling of the bid space. Indeed, we prove that a learning algorithm leveraging the structure of this problem achieves a regret of $\tilde{O}(K^{4/3}T^{2/3})$ under bandit feedback, improving over the bound of $\tilde{O}(K^{7/4}T^{3/4})$ previously obtained in the literature. This improved regret rate is tight up to logarithmic terms. %by deducing a lower bound of $\Omega (T^{2/3})$ from the dynamic pricing literature, proving the optimality in $T$ of our algorithm up to log factors. Inspired by electricity reserve markets, we further introduce a different feedback model under which all winning bids are revealed. This feedback interpolates between the full-information and bandit scenarios depending on the auctions' results. We prove that, under this feedback, the algorithm that we propose achieves regret $\tilde{O}(K^{5/2}\sqrt{T})$.
Cite
Text
Potfer et al. "Improved Learning Rates in Multi-Unit Uniform Price Auctions." Neural Information Processing Systems, 2024. doi:10.52202/079017-4138Markdown
[Potfer et al. "Improved Learning Rates in Multi-Unit Uniform Price Auctions." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/potfer2024neurips-improved/) doi:10.52202/079017-4138BibTeX
@inproceedings{potfer2024neurips-improved,
title = {{Improved Learning Rates in Multi-Unit Uniform Price Auctions}},
author = {Potfer, Marius and Baudry, Dorian and Richard, Hugo and Perchet, Vianney and Wan, Cheng},
booktitle = {Neural Information Processing Systems},
year = {2024},
doi = {10.52202/079017-4138},
url = {https://mlanthology.org/neurips/2024/potfer2024neurips-improved/}
}