Perturbation-Based Regret Analysis of Predictive Control in Linear Time Varying Systems
Abstract
We study predictive control in a setting where the dynamics are time-varying and linear, and the costs are time-varying and well-conditioned. At each time step, the controller receives the exact predictions of costs, dynamics, and disturbances for the future $k$ time steps. We show that when the prediction window $k$ is sufficiently large, predictive control is input-to-state stable and achieves a dynamic regret of $O(\lambda^k T)$, where $\lambda < 1$ is a positive constant. This is the first dynamic regret bound on the predictive control of linear time-varying systems. We also show a variation of predictive control obtains the first competitive bound for the control of linear time-varying systems: $1 + O(\lambda^k)$. Our results are derived using a novel proof framework based on a perturbation bound that characterizes how a small change to the system parameters impacts the optimal trajectory.
Cite
Text
Lin et al. "Perturbation-Based Regret Analysis of Predictive Control in Linear Time Varying Systems." Neural Information Processing Systems, 2021.Markdown
[Lin et al. "Perturbation-Based Regret Analysis of Predictive Control in Linear Time Varying Systems." Neural Information Processing Systems, 2021.](https://mlanthology.org/neurips/2021/lin2021neurips-perturbationbased/)BibTeX
@inproceedings{lin2021neurips-perturbationbased,
title = {{Perturbation-Based Regret Analysis of Predictive Control in Linear Time Varying Systems}},
author = {Lin, Yiheng and Hu, Yang and Shi, Guanya and Sun, Haoyuan and Qu, Guannan and Wierman, Adam},
booktitle = {Neural Information Processing Systems},
year = {2021},
url = {https://mlanthology.org/neurips/2021/lin2021neurips-perturbationbased/}
}