[2510.14959] CBF-RL: Safety Filtering Reinforcement Learning in

[2510.14959] CBF-RL: Safety Filtering Reinforcement Learning in Training with Control Barrier Functions

arXiv - Machine Learning March 06, 2026 4 min read

About this article

Abstract page for arXiv paper 2510.14959: CBF-RL: Safety Filtering Reinforcement Learning in Training with Control Barrier Functions

Computer Science > Robotics arXiv:2510.14959 (cs) [Submitted on 16 Oct 2025 (v1), last revised 5 Mar 2026 (this version, v3)] Title:CBF-RL: Safety Filtering Reinforcement Learning in Training with Control Barrier Functions Authors:Lizhi Yang, Blake Werner, Massimiliano de Sa, Aaron D. Ames View a PDF of the paper titled CBF-RL: Safety Filtering Reinforcement Learning in Training with Control Barrier Functions, by Lizhi Yang and 3 other authors View PDF HTML (experimental) Abstract:Reinforcement learning (RL), while powerful and expressive, can often prioritize performance at the expense of safety. Yet safety violations can lead to catastrophic outcomes in real-world deployments. Control Barrier Functions (CBFs) offer a principled method to enforce dynamic safety -- traditionally deployed online via safety filters. While the result is safe behavior, the fact that the RL policy does not have knowledge of the CBF can lead to conservative behaviors. This paper proposes CBF-RL, a framework for generating safe behaviors with RL by enforcing CBFs in training. CBF-RL has two key attributes: (1) minimally modifying a nominal RL policy to encode safety constraints via a CBF term, (2) and safety filtering of the policy rollouts in training. Theoretically, we prove that continuous-time safety filters can be deployed via closed-form expressions on discrete-time roll-outs. Practically, we demonstrate that CBF-RL internalizes the safety constraints in the learned policy -- both enforcing...

Originally published on March 06, 2026. Curated by AI News.

Llms

World models will be the next big thing, bye-bye LLMs

Was at Nvidia's GTC conference recently and honestly, it was one of the most eye-opening events I've attended in a while. There was a lot...

Reddit - Artificial Intelligence · 1 min · 5 minutes ago

Machine Learning

[D] Got my first offer after months of searching — below posted range, contract-to-hire, and worried it may pause my search. Do I take it?

I could really use some outside perspective. I’m a senior ML/CV engineer in Canada with about 5–6 years across research and industry. Mas...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

[Research] AI training is bad, so I started an research

Hello, I started researching about AI training Q:Why? R: Because AI training is bad right now. Q: What do you mean its bad? R: Like when ...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

[P] Unix philosophy for ML pipelines: modular, swappable stages with typed contracts

We built an open-source prototype that applies Unix philosophy to retrieval pipelines. Each stage (PII redaction, chunking, dedup, embedd...

Reddit - Machine Learning · 1 min · about 4 hours ago

[2510.14959] CBF-RL: Safety Filtering Reinforcement Learning in Training with Control Barrier Functions

About this article

Related Articles

World models will be the next big thing, bye-bye LLMs

[D] Got my first offer after months of searching — below posted range, contract-to-hire, and worried it may pause my search. Do I take it?

[Research] AI training is bad, so I started an research

[P] Unix philosophy for ML pipelines: modular, swappable stages with typed contracts

No comments

Stay updated with AI News