[2602.09580] SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows

[2602.09580] SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows

arXiv - Machine Learning 4 min read

About this article

Abstract page for arXiv paper 2602.09580: SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows

Computer Science > Robotics arXiv:2602.09580 (cs) [Submitted on 10 Feb 2026 (v1), last revised 5 Apr 2026 (this version, v3)] Title:SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows Authors:Chenyu Yang, Denis Tarasov, Davide Liconti, Hehui Zheng, Robert K. Katzschmann View a PDF of the paper titled SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows, by Chenyu Yang and 4 other authors View PDF HTML (experimental) Abstract:Real-world fine-tuning of dexterous manipulation policies remains challenging due to limited real-world interaction budgets and highly multimodal action distributions. Diffusion-based policies, while expressive, do not permit conservative likelihood-based updates during fine-tuning because action probabilities are intractable. In contrast, conventional Gaussian policies collapse under multimodality, particularly when actions are executed in chunks, and standard per-step critics fail to align with chunked execution, leading to poor credit assignment. We present SERFN, a sample-efficient off-policy fine-tuning framework with normalizing flow (NF) to address these challenges. The normalizing flow policy yields exact likelihoods for multimodal action chunks, allowing conservative, stable policy updates through likelihood regularization and thereby improving sample efficiency. An action-chunked critic evaluates entire action sequences, al...

Originally published on April 07, 2026. Curated by AI News.

Related Articles

The Download: DeepSeek’s latest AI breakthrough, and the race to build world models | MIT Technology Review
Machine Learning

The Download: DeepSeek’s latest AI breakthrough, and the race to build world models | MIT Technology Review

China has blocked Meta’s $2 billion acquisition of AI startup Manus.

MIT Technology Review · 6 min ·
Machine Learning

Maths vs machine learning publishing venues [D]

I am a research mathematician that has recently written a (in my opinion) pretty neat paper in theoretical computer science that is proba...

Reddit - Machine Learning · 1 min ·
The AI-designed car is taking shape | The Verge
Machine Learning

The AI-designed car is taking shape | The Verge

Automakers like GM are using AI tools to speed up the design process so they can get cars developed quicker. But will it lead to job losses?

The Verge - AI · 8 min ·
Llms

I tested the same prompt across multiple AI models… the differences surprised me

I’ve been experimenting with different AI models lately (ChatGPT, Claude, etc.), and I tried something simple: Using the exact same promp...

Reddit - Artificial Intelligence · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime