V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions

Tayal, Mumuksh; Tayal, Manan; Singh, Aditya; Kolathaya, Shishir; Prakash, Ravi

V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions

Mumuksh Tayal, Manan Tayal, Aditya Singh, Shishir Kolathaya, Ravi Prakash

TMLR 2026

/tmlr/2026/tayal2026tmlr-vocbf/

Abstract

Ensuring safety in autonomous systems requires controllers that aim to satisfy state-wise constraints without relying on online interaction. While existing Safe Offline RL methods typically enforce soft expected-cost constraints, they struggle to ensure strict state-wise safety. Conversely, Control Barrier Functions (CBFs) offer a principled mechanism to enforce forward invariance, but often rely on expert-designed barrier functions or knowledge of the system dynamics. We introduce Value-Guided Offline Control Barrier Functions (V-OCBF), a framework that learns a neural CBF entirely from offline demonstrations. Unlike prior approaches, V-OCBF does not assume access to the dynamics model; instead, it derives a recursive finite-difference barrier update, enabling model-free learning of a barrier that propagates safety information over time. Moreover, V-OCBF incorporates an expectile-based objective that avoids querying the barrier on out-of-distribution actions and restricts updates to the dataset-supported action set. The learned barrier is then used with a Quadratic Program (QP) formulation to synthesize real-time safe control. Across multiple case studies, V-OCBF yields substantially fewer safety violations than baseline methods while maintaining strong task performance, highlighting its scalability for offline synthesis of safety-critical controllers without online interaction or hand-engineered barriers.

PDF TMLR OpenReview Code Semantic Scholar

Cite

Text

Tayal et al. "V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions." Transactions on Machine Learning Research, 2026.

Markdown

[Tayal et al. "V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/tayal2026tmlr-vocbf/)

BibTeX

@article{tayal2026tmlr-vocbf,
  title     = {{V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions}},
  author    = {Tayal, Mumuksh and Tayal, Manan and Singh, Aditya and Kolathaya, Shishir and Prakash, Ravi},
  journal   = {Transactions on Machine Learning Research},
  year      = {2026},
  url       = {https://mlanthology.org/tmlr/2026/tayal2026tmlr-vocbf/}
}