[2603.04790] Diffusion Policy through Conditional Proximal Policy

[2603.04790] Diffusion Policy through Conditional Proximal Policy Optimization

arXiv - Machine Learning March 06, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.04790: Diffusion Policy through Conditional Proximal Policy Optimization

Computer Science > Machine Learning arXiv:2603.04790 (cs) [Submitted on 5 Mar 2026] Title:Diffusion Policy through Conditional Proximal Policy Optimization Authors:Ben Liu, Shunpeng Yang, Hua Chen View a PDF of the paper titled Diffusion Policy through Conditional Proximal Policy Optimization, by Ben Liu and Shunpeng Yang and Hua Chen View PDF HTML (experimental) Abstract:Reinforcement learning (RL) has been extensively employed in a wide range of decision-making problems, such as games and robotics. Recently, diffusion policies have shown strong potential in modeling multi-modal behaviors, enabling more diverse and flexible action generation compared to the conventional Gaussian policy. Despite various attempts to combine RL with diffusion, a key challenge is the difficulty of computing action log-likelihood under the diffusion model. This greatly hinders the direct application of diffusion policies in on-policy reinforcement learning. Most existing methods calculate or approximate the log-likelihood through the entire denoising process in the diffusion model, which can be memory- and computationally inefficient. To overcome this challenge, we propose a novel and efficient method to train a diffusion policy in an on-policy setting that requires only evaluating a simple Gaussian probability. This is achieved by aligning the policy iteration with the diffusion process, which is a distinct paradigm compared to previous work. Moreover, our formulation can naturally handle ent...

Originally published on March 06, 2026. Curated by AI News.

Machine Learning

[D] Could really use some guidance . I'm a 2nd year Data Science UG Student

I'm currently finishing up my second year of a three year Bachelor of Data Science degree. I've got the basics down quite well, linear re...

Reddit - Machine Learning · 1 min · 19 minutes ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · 35 minutes ago

Machine Learning

[R] Spectral Compact Training: 172x memory reduction for 70B model training - verified on a Steam Deck (7.24 GB)

This is a research article about a patent I filed (not self promotion). I am dyslexic so I used AI to help with the writing. I have been ...

Reddit - Machine Learning · 1 min · about 2 hours ago

Llms

ChatGPT Critiques My Approach to AI

I uploaded VulcanAMI into ChatGPT and had it to a deep analysis. I then asked one simple question: What would be the result of wider adop...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

[2603.04790] Diffusion Policy through Conditional Proximal Policy Optimization

About this article

Related Articles

[D] Could really use some guidance . I'm a 2nd year Data Science UG Student

UMKC Announces New Master of Science in Artificial Intelligence

[R] Spectral Compact Training: 172x memory reduction for 70B model training - verified on a Steam Deck (7.24 GB)

ChatGPT Critiques My Approach to AI

No comments

Stay updated with AI News