[2602.20057] AdaWorldPolicy: World-Model-Driven Diffusion Policy with Online Adaptive Learning for Robotic Manipulation
Summary
The paper presents AdaWorldPolicy, a novel framework for robotic manipulation that utilizes world models and online adaptive learning to enhance performance in dynamic environments.
Why It Matters
As robotics continues to evolve, the ability to adapt to real-world conditions with minimal human intervention is crucial. This research introduces a framework that not only improves robotic manipulation but also sets a precedent for future developments in adaptive learning and AI-driven robotics.
Key Takeaways
- AdaWorldPolicy integrates world models with adaptive learning for improved robotic manipulation.
- The framework employs interconnected Flow Matching Diffusion Transformers for enhanced feature exchange.
- It introduces an Online Adaptive Learning strategy that dynamically adjusts to environmental changes.
- The approach demonstrates state-of-the-art performance in both simulated and real-robot benchmarks.
- This research highlights the importance of minimal human involvement in robotic systems.
Computer Science > Robotics arXiv:2602.20057 (cs) [Submitted on 23 Feb 2026] Title:AdaWorldPolicy: World-Model-Driven Diffusion Policy with Online Adaptive Learning for Robotic Manipulation Authors:Ge Yuan, Qiyuan Qiao, Jing Zhang, Dong Xu View a PDF of the paper titled AdaWorldPolicy: World-Model-Driven Diffusion Policy with Online Adaptive Learning for Robotic Manipulation, by Ge Yuan and 3 other authors View PDF HTML (experimental) Abstract:Effective robotic manipulation requires policies that can anticipate physical outcomes and adapt to real-world environments. Effective robotic manipulation requires policies that can anticipate physical outcomes and adapt to real-world environments. In this work, we introduce a unified framework, World-Model-Driven Diffusion Policy with Online Adaptive Learning (AdaWorldPolicy) to enhance robotic manipulation under dynamic conditions with minimal human involvement. Our core insight is that world models provide strong supervision signals, enabling online adaptive learning in dynamic environments, which can be complemented by force-torque feedback to mitigate dynamic force shifts. Our AdaWorldPolicy integrates a world model, an action expert, and a force predictor-all implemented as interconnected Flow Matching Diffusion Transformers (DiT). They are interconnected via the multi-modal self-attention layers, enabling deep feature exchange for joint learning while preserving their distinct modularity characteristics. We further propose a ...