ConstrainedZero: Chance-Constrained POMDP Planning Using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints

Abstract

In many real applications, the data attributes are incremental and the samples are stored with accumulated feature spaces gradually. Although there are several elegant approaches to tackling this problem, the theoretical analysis is still limited. There exist at least two challenges and fundamental questions. 1) How to derive the generalization bounds of these approaches? 2) Under what conditions do these approaches have a strong generalization guarantee? To solve these crucial but rarely studied problems, we provide a comprehensive theoretical analysis in this paper. We begin by summarizing and refining four strategies for addressing feature incremental data. Subsequently, we derive their generalization bounds, providing rigorous and quantitative insights. The theoretical findings highlight the key factors influencing the generalization abilities of different strategies. In tackling the above two fundamental problems, we also provide valuable guidance for exploring other learning challenges in dynamic environments. Finally, the comprehensive experimental and theoretical results mutually validate each other, underscoring the reliability of our conclusions.

Cite

Text

Moss et al. "ConstrainedZero: Chance-Constrained POMDP Planning Using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/746

Markdown

[Moss et al. "ConstrainedZero: Chance-Constrained POMDP Planning Using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/moss2024ijcai-constrainedzero/) doi:10.24963/ijcai.2024/746

BibTeX

@inproceedings{moss2024ijcai-constrainedzero,
  title     = {{ConstrainedZero: Chance-Constrained POMDP Planning Using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints}},
  author    = {Moss, Robert J. and Jamgochian, Arec L. and Fischer, Johannes and Corso, Anthony and Kochenderfer, Mykel J.},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {6752-6760},
  doi       = {10.24963/ijcai.2024/746},
  url       = {https://mlanthology.org/ijcai/2024/moss2024ijcai-constrainedzero/}
}