[2603.28716] Dynamic Dual-Granularity Skill Bank for Agentic RL

[2603.28716] Dynamic Dual-Granularity Skill Bank for Agentic RL

arXiv - AI 3 min read

About this article

Abstract page for arXiv paper 2603.28716: Dynamic Dual-Granularity Skill Bank for Agentic RL

Computer Science > Artificial Intelligence arXiv:2603.28716 (cs) [Submitted on 30 Mar 2026] Title:Dynamic Dual-Granularity Skill Bank for Agentic RL Authors:Songjun Tu, Chengdong Xu, Qichao Zhang, Yaocheng Zhang, Xiangyuan Lan, Linjing Li, Dongbin Zhao View a PDF of the paper titled Dynamic Dual-Granularity Skill Bank for Agentic RL, by Songjun Tu and 6 other authors View PDF HTML (experimental) Abstract:Agentic reinforcement learning (RL) can benefit substantially from reusable experience, yet existing skill-based methods mainly extract trajectory-level guidance and often lack principled mechanisms for maintaining an evolving skill memory. We propose D2Skill, a dynamic dual-granularity skill bank for agentic RL that organizes reusable experience into task skills for high-level guidance and step skills for fine-grained decision support and error correction. D2Skill jointly trains the policy and skill bank through paired baseline and skill-injected rollouts under the same policy, using their performance gap to derive hindsight utility signals for both skill updating and policy optimization. Built entirely from training-time experience, the skill bank is continuously expanded through reflection and maintained with utility-aware retrieval and pruning. Experiments on ALFWorld and WebShop with Qwen2.5-7B-Instruct and Qwen3-4B-Instruct-2507 show that D2Skill consistently improves success rates over skill-free baselines by 10-20 points. Further ablations and analyses show that bo...

Originally published on March 31, 2026. Curated by AI News.

Related Articles

Walmart CEO reportedly brags that company's in-app AI agent is making people spend 35% more money
Nlp

Walmart CEO reportedly brags that company's in-app AI agent is making people spend 35% more money

AI Tools & Products · 4 min ·
Open Source Ai

we just hit 555 stars on our open source AI agent config tool and i'm honestly still in shock

so a while back me and a few folks started working on Caliber, an open source tool for managing AI agent configs and syncing them with yo...

Reddit - Artificial Intelligence · 1 min ·
Robotics

[P] Cadenza: Connect Wandb logs to agents easily for autonomous research.

Wandb CLI and MCP is atrocious to use with agents for full autonomous research loops. They are slow, clunky, and result in context rot. S...

Reddit - Artificial Intelligence · 1 min ·
Robotics

[P] Cadenza: Connect Wandb logs to agents easily for autonomous research.

Wandb CLI and MCP is atrocious to use with agents for full autonomous research loops. They are slow, clunky, and result in context rot. S...

Reddit - Machine Learning · 1 min ·
More in Ai Agents: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime