Multiple Mean-Payoff Optimization Under Local Stability Constraints
Abstract
The long-run average payoff per transition (mean payoff) is the main tool for specifying the performance and dependability properties of discrete systems. The problem of constructing a controller (strategy) simultaneously optimizing several mean payoffs has been deeply studied for stochastic and game-theoretic models. One common issue of the constructed controllers is the instability of the mean payoffs, measured by the deviations of the average rewards per transition computed in a finite "window" sliding along a run. Unfortunately, the problem of simultaneously optimizing the mean payoffs under local stability constraints is computationally hard, and the existing works do not provide a practically usable algorithm even for non-stochastic models such as two-player games. In this paper, we design and evaluate the first efficient and scalable solution to this problem applicable to Markov decision processes.
Cite
Text
Klaska et al. "Multiple Mean-Payoff Optimization Under Local Stability Constraints." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I25.34856Markdown
[Klaska et al. "Multiple Mean-Payoff Optimization Under Local Stability Constraints." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/klaska2025aaai-multiple/) doi:10.1609/AAAI.V39I25.34856BibTeX
@inproceedings{klaska2025aaai-multiple,
title = {{Multiple Mean-Payoff Optimization Under Local Stability Constraints}},
author = {Klaska, David and Kucera, Antonín and Kur, Vojtech and Musil, Vít and Rehák, Vojtech},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {26551-26558},
doi = {10.1609/AAAI.V39I25.34856},
url = {https://mlanthology.org/aaai/2025/klaska2025aaai-multiple/}
}