[2602.09580] SERNF: Sample-Efficient Real-World Dexterous Policy

[2602.09580] SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows

arXiv - Machine Learning April 07, 2026 4 min read

About this article

Abstract page for arXiv paper 2602.09580: SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows

Computer Science > Robotics arXiv:2602.09580 (cs) [Submitted on 10 Feb 2026 (v1), last revised 5 Apr 2026 (this version, v3)] Title:SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows Authors:Chenyu Yang, Denis Tarasov, Davide Liconti, Hehui Zheng, Robert K. Katzschmann View a PDF of the paper titled SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows, by Chenyu Yang and 4 other authors View PDF HTML (experimental) Abstract:Real-world fine-tuning of dexterous manipulation policies remains challenging due to limited real-world interaction budgets and highly multimodal action distributions. Diffusion-based policies, while expressive, do not permit conservative likelihood-based updates during fine-tuning because action probabilities are intractable. In contrast, conventional Gaussian policies collapse under multimodality, particularly when actions are executed in chunks, and standard per-step critics fail to align with chunked execution, leading to poor credit assignment. We present SERFN, a sample-efficient off-policy fine-tuning framework with normalizing flow (NF) to address these challenges. The normalizing flow policy yields exact likelihoods for multimodal action chunks, allowing conservative, stable policy updates through likelihood regularization and thereby improving sample efficiency. An action-chunked critic evaluates entire action sequences, al...

Originally published on April 07, 2026. Curated by AI News.

Machine Learning

The Download: DeepSeek’s latest AI breakthrough, and the race to build world models | MIT Technology Review

China has blocked Meta’s $2 billion acquisition of AI startup Manus.

MIT Technology Review · 6 min · about 1 hour ago

Machine Learning

Maths vs machine learning publishing venues [D]

I am a research mathematician that has recently written a (in my opinion) pretty neat paper in theoretical computer science that is proba...

Reddit - Machine Learning · 1 min · about 2 hours ago

Machine Learning

The AI-designed car is taking shape | The Verge

Automakers like GM are using AI tools to speed up the design process so they can get cars developed quicker. But will it lead to job losses?

The Verge - AI · 8 min · about 2 hours ago

Llms

I tested the same prompt across multiple AI models… the differences surprised me

I’ve been experimenting with different AI models lately (ChatGPT, Claude, etc.), and I tried something simple: Using the exact same promp...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

[2602.09580] SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows

About this article

Related Articles

The Download: DeepSeek’s latest AI breakthrough, and the race to build world models | MIT Technology Review

Maths vs machine learning publishing venues [D]

The AI-designed car is taking shape | The Verge

I tested the same prompt across multiple AI models… the differences surprised me

No comments

Stay updated with AI News