Learning Stabilizing Controllers for Unstable Linear Quadratic Regulators from a Single Trajectory

Abstract

The principal task to control dynamical systems is to ensure their stability. When the system is unknown, robust approaches are promising since they aim to stabilize a large set of plausible systems simultaneously. We study linear controllers under quadratic costs model also known as linear quadratic regulators (LQR). We present two different semi-definite programs (SDP) which results in a controller that stabilizes all systems within an ellipsoid uncertainty set. We further show that the feasibility conditions of the proposed SDPs are \emph{equivalent}. Using the derived robust controller syntheses, we propose an efficient data dependent algorithm – \textsc{eXploration} – that with high probability quickly identifies a stabilizing controller. Our approach can be used to initialize existing algorithms that require a stabilizing controller as an input while adding constant to the regret. We further propose different heuristics which empirically reduce the number of steps taken by \textsc{eXploration} and reduce the suffered cost while searching for a stabilizing controller.

Cite

Text

Treven et al. "Learning Stabilizing Controllers for Unstable Linear Quadratic Regulators from a Single Trajectory." Proceedings of the 3rd Conference on Learning for Dynamics and Control, 2021.

Markdown

[Treven et al. "Learning Stabilizing Controllers for Unstable Linear Quadratic Regulators from a Single Trajectory." Proceedings of the 3rd Conference on Learning for Dynamics and Control, 2021.](https://mlanthology.org/l4dc/2021/treven2021l4dc-learning/)

BibTeX

@inproceedings{treven2021l4dc-learning,
  title     = {{Learning Stabilizing Controllers for Unstable Linear Quadratic Regulators from a Single Trajectory}},
  author    = {Treven, Lenart and Curi, Sebastian and Mutný, Mojmír and Krause, Andreas},
  booktitle = {Proceedings of the 3rd Conference on Learning for Dynamics and Control},
  year      = {2021},
  pages     = {664-676},
  volume    = {144},
  url       = {https://mlanthology.org/l4dc/2021/treven2021l4dc-learning/}
}