ConstrainedZero: Chance-Constrained POMDP Planning Using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints
Abstract
In many real applications, the data attributes are incremental and the samples are stored with accumulated feature spaces gradually. Although there are several elegant approaches to tackling this problem, the theoretical analysis is still limited. There exist at least two challenges and fundamental questions. 1) How to derive the generalization bounds of these approaches? 2) Under what conditions do these approaches have a strong generalization guarantee? To solve these crucial but rarely studied problems, we provide a comprehensive theoretical analysis in this paper. We begin by summarizing and refining four strategies for addressing feature incremental data. Subsequently, we derive their generalization bounds, providing rigorous and quantitative insights. The theoretical findings highlight the key factors influencing the generalization abilities of different strategies. In tackling the above two fundamental problems, we also provide valuable guidance for exploring other learning challenges in dynamic environments. Finally, the comprehensive experimental and theoretical results mutually validate each other, underscoring the reliability of our conclusions.
Cite
Text
Moss et al. "ConstrainedZero: Chance-Constrained POMDP Planning Using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/746Markdown
[Moss et al. "ConstrainedZero: Chance-Constrained POMDP Planning Using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/moss2024ijcai-constrainedzero/) doi:10.24963/ijcai.2024/746BibTeX
@inproceedings{moss2024ijcai-constrainedzero,
title = {{ConstrainedZero: Chance-Constrained POMDP Planning Using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints}},
author = {Moss, Robert J. and Jamgochian, Arec L. and Fischer, Johannes and Corso, Anthony and Kochenderfer, Mykel J.},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2024},
pages = {6752-6760},
doi = {10.24963/ijcai.2024/746},
url = {https://mlanthology.org/ijcai/2024/moss2024ijcai-constrainedzero/}
}