Multiple Mean-Payoff Optimization Under Local Stability Constraints

Abstract

The long-run average payoff per transition (mean payoff) is the main tool for specifying the performance and dependability properties of discrete systems. The problem of constructing a controller (strategy) simultaneously optimizing several mean payoffs has been deeply studied for stochastic and game-theoretic models. One common issue of the constructed controllers is the instability of the mean payoffs, measured by the deviations of the average rewards per transition computed in a finite "window" sliding along a run. Unfortunately, the problem of simultaneously optimizing the mean payoffs under local stability constraints is computationally hard, and the existing works do not provide a practically usable algorithm even for non-stochastic models such as two-player games. In this paper, we design and evaluate the first efficient and scalable solution to this problem applicable to Markov decision processes.

Cite

Text

Klaska et al. "Multiple Mean-Payoff Optimization Under Local Stability Constraints." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I25.34856

Markdown

[Klaska et al. "Multiple Mean-Payoff Optimization Under Local Stability Constraints." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/klaska2025aaai-multiple/) doi:10.1609/AAAI.V39I25.34856

BibTeX

@inproceedings{klaska2025aaai-multiple,
  title     = {{Multiple Mean-Payoff Optimization Under Local Stability Constraints}},
  author    = {Klaska, David and Kucera, Antonín and Kur, Vojtech and Musil, Vít and Rehák, Vojtech},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {26551-26558},
  doi       = {10.1609/AAAI.V39I25.34856},
  url       = {https://mlanthology.org/aaai/2025/klaska2025aaai-multiple/}
}