Tracking Adversarial Targets
Abstract
We study linear control problems with quadratic losses and adversarially chosen tracking targets. We present an efficient algorithm for this problem and show that, under standard conditions on the linear system, its regret with respect to an optimal linear policy grows as O(\log^2 T), where T is the number of rounds of the game. We also study a problem with adversarially chosen transition dynamics; we present an exponentially-weighted average algorithm for this problem, and we give regret bounds that grow as O(\sqrt T).
Cite
Text
Abbasi-Yadkori et al. "Tracking Adversarial Targets." International Conference on Machine Learning, 2014.Markdown
[Abbasi-Yadkori et al. "Tracking Adversarial Targets." International Conference on Machine Learning, 2014.](https://mlanthology.org/icml/2014/abbasiyadkori2014icml-tracking/)BibTeX
@inproceedings{abbasiyadkori2014icml-tracking,
title = {{Tracking Adversarial Targets}},
author = {Abbasi-Yadkori, Yasin and Bartlett, Peter and Kanade, Varun},
booktitle = {International Conference on Machine Learning},
year = {2014},
pages = {369-377},
volume = {32},
url = {https://mlanthology.org/icml/2014/abbasiyadkori2014icml-tracking/}
}