[2603.28716] Dynamic Dual-Granularity Skill Bank for Agentic RL
About this article
Abstract page for arXiv paper 2603.28716: Dynamic Dual-Granularity Skill Bank for Agentic RL
Computer Science > Artificial Intelligence arXiv:2603.28716 (cs) [Submitted on 30 Mar 2026] Title:Dynamic Dual-Granularity Skill Bank for Agentic RL Authors:Songjun Tu, Chengdong Xu, Qichao Zhang, Yaocheng Zhang, Xiangyuan Lan, Linjing Li, Dongbin Zhao View a PDF of the paper titled Dynamic Dual-Granularity Skill Bank for Agentic RL, by Songjun Tu and 6 other authors View PDF HTML (experimental) Abstract:Agentic reinforcement learning (RL) can benefit substantially from reusable experience, yet existing skill-based methods mainly extract trajectory-level guidance and often lack principled mechanisms for maintaining an evolving skill memory. We propose D2Skill, a dynamic dual-granularity skill bank for agentic RL that organizes reusable experience into task skills for high-level guidance and step skills for fine-grained decision support and error correction. D2Skill jointly trains the policy and skill bank through paired baseline and skill-injected rollouts under the same policy, using their performance gap to derive hindsight utility signals for both skill updating and policy optimization. Built entirely from training-time experience, the skill bank is continuously expanded through reflection and maintained with utility-aware retrieval and pruning. Experiments on ALFWorld and WebShop with Qwen2.5-7B-Instruct and Qwen3-4B-Instruct-2507 show that D2Skill consistently improves success rates over skill-free baselines by 10-20 points. Further ablations and analyses show that bo...