V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions
Abstract
Ensuring safety in autonomous systems requires controllers that aim to satisfy state-wise constraints without relying on online interaction. While existing Safe Offline RL methods typically enforce soft expected-cost constraints, they struggle to ensure strict state-wise safety. Conversely, Control Barrier Functions (CBFs) offer a principled mechanism to enforce forward invariance, but often rely on expert-designed barrier functions or knowledge of the system dynamics. We introduce Value-Guided Offline Control Barrier Functions (V-OCBF), a framework that learns a neural CBF entirely from offline demonstrations. Unlike prior approaches, V-OCBF does not assume access to the dynamics model; instead, it derives a recursive finite-difference barrier update, enabling model-free learning of a barrier that propagates safety information over time. Moreover, V-OCBF incorporates an expectile-based objective that avoids querying the barrier on out-of-distribution actions and restricts updates to the dataset-supported action set. The learned barrier is then used with a Quadratic Program (QP) formulation to synthesize real-time safe control. Across multiple case studies, V-OCBF yields substantially fewer safety violations than baseline methods while maintaining strong task performance, highlighting its scalability for offline synthesis of safety-critical controllers without online interaction or hand-engineered barriers.
Cite
Text
Tayal et al. "V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions." Transactions on Machine Learning Research, 2026.Markdown
[Tayal et al. "V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/tayal2026tmlr-vocbf/)BibTeX
@article{tayal2026tmlr-vocbf,
title = {{V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions}},
author = {Tayal, Mumuksh and Tayal, Manan and Singh, Aditya and Kolathaya, Shishir and Prakash, Ravi},
journal = {Transactions on Machine Learning Research},
year = {2026},
url = {https://mlanthology.org/tmlr/2026/tayal2026tmlr-vocbf/}
}