Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator
Abstract
We consider adaptive control of the Linear Quadratic Regulator (LQR), where an unknown linear system is controlled subject to quadratic costs. Leveraging recent developments in the estimation of linear systems and in robust controller synthesis, we present the first provably polynomial time algorithm that achieves sub-linear regret on this problem. We further study the interplay between regret minimization and parameter estimation by proving a lower bound on the expected regret in terms of the exploration schedule used by any algorithm. Finally, we conduct a numerical study comparing our robust adaptive algorithm to other methods from the adaptive LQR literature, and demonstrate the flexibility of our proposed method by extending it to a demand forecasting problem subject to state constraints.
Cite
Text
Dean et al. "Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator." Neural Information Processing Systems, 2018.Markdown
[Dean et al. "Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator." Neural Information Processing Systems, 2018.](https://mlanthology.org/neurips/2018/dean2018neurips-regret/)BibTeX
@inproceedings{dean2018neurips-regret,
title = {{Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator}},
author = {Dean, Sarah and Mania, Horia and Matni, Nikolai and Recht, Benjamin and Tu, Stephen},
booktitle = {Neural Information Processing Systems},
year = {2018},
pages = {4188-4197},
url = {https://mlanthology.org/neurips/2018/dean2018neurips-regret/}
}