[2510.13358] Adversarial Fine-tuning in Offline-to-Online

[2510.13358] Adversarial Fine-tuning in Offline-to-Online Reinforcement Learning for Robust Robot Control

arXiv - AI March 02, 2026 3 min read

About this article

Abstract page for arXiv paper 2510.13358: Adversarial Fine-tuning in Offline-to-Online Reinforcement Learning for Robust Robot Control

Computer Science > Robotics arXiv:2510.13358 (cs) [Submitted on 15 Oct 2025 (v1), last revised 27 Feb 2026 (this version, v2)] Title:Adversarial Fine-tuning in Offline-to-Online Reinforcement Learning for Robust Robot Control Authors:Shingo Ayabe, Hiroshi Kera, Kazuhiko Kawamoto View a PDF of the paper titled Adversarial Fine-tuning in Offline-to-Online Reinforcement Learning for Robust Robot Control, by Shingo Ayabe and 2 other authors View PDF HTML (experimental) Abstract:Offline reinforcement learning enables sample-efficient policy acquisition without risky online interaction, yet policies trained on static datasets remain brittle under action-space perturbations such as actuator faults. This study introduces an offline-to-online framework that trains policies on clean data and then performs adversarial fine-tuning, where perturbations are injected into executed actions to induce compensatory behavior and improve resilience. A performance-aware curriculum further adjusts the perturbation probability during training via an exponential-moving-average signal, balancing robustness and stability throughout the learning process. Experiments on continuous-control locomotion tasks demonstrate that the proposed method consistently improves robustness over offline-only baselines and converges faster than training from scratch. Matching the fine-tuning and evaluation conditions yields the strongest robustness to action-space perturbations, while the adaptive curriculum strategy m...

Originally published on March 02, 2026. Curated by AI News.

Llms

CLI for Google AI Search (gai.google) — run AI-powered code/tech searches headlessly from your terminal

Google AI (gai.google) gives Gemini-powered answers for technical queries — think AI-enhanced search with code understanding. I built a C...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Machine Learning

Big increase in the amount of people using AI to write their replies with AI

I find it interesting that we’ve all randomly decided to use the “-“ more often recently on reddit, and everyone’s grammar has drasticall...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Machine Learning

[D] MXFP8 GEMM: Up to 99% of cuBLAS performance using CUDA + PTX

New blog post by Daniel Vega-Myhre (Meta/PyTorch) illustrating GEMM design for FP8, including deep-dives into all the constraints and des...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

IIT Delhi launches 8th batch of Advanced AI, ML, and DL online programme: Check who is eligible, applicat

News News: The Continuing Education Programme (CEP) at IIT Delhi has announced the launch of the 8th batch of its Advanced Certificate Pr...

AI News - General · 9 min · about 3 hours ago

[2510.13358] Adversarial Fine-tuning in Offline-to-Online Reinforcement Learning for Robust Robot Control

About this article

Related Articles

CLI for Google AI Search (gai.google) — run AI-powered code/tech searches headlessly from your terminal

Big increase in the amount of people using AI to write their replies with AI

[D] MXFP8 GEMM: Up to 99% of cuBLAS performance using CUDA + PTX

IIT Delhi launches 8th batch of Advanced AI, ML, and DL online programme: Check who is eligible, applicat

No comments

Stay updated with AI News