[2604.09455] E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning

[2604.09455] E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning

arXiv - AI 3 min read

About this article

Abstract page for arXiv paper 2604.09455: E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning

Computer Science > Artificial Intelligence arXiv:2604.09455 (cs) [Submitted on 10 Apr 2026] Title:E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning Authors:Weiyang Guo, Zesheng Shi, Liye Zhao, Jiayuan Ma, Zeen Zhu, Junxian He, Min Zhang, Jing Li View a PDF of the paper titled E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning, by Weiyang Guo and 7 other authors View PDF HTML (experimental) Abstract:While Large Language Models (LLMs) have demonstrated significant potential in Tool-Integrated Reasoning (TIR), existing training paradigms face significant limitations: Zero-RL suffers from inefficient exploration and mode degradation due to a lack of prior guidance, while SFT-then-RL is limited by high data costs and capability plateaus caused by low-entropy collapse. To address these challenges, we propose E3-TIR (Enhanced Experience Exploitation), a warm-up paradigm for the early stages of agent training. Specifically, we formulate training as the dynamic integration of three experience types: Expert Prefixes, Expert Guided, and Self-Exploration. By executing diverse branching exploration around expert "anchors" and employing a mix policy optimization mechanism, we effectively mitigate distribution shifts and resolve optimization conflicts arising from shared prefixes. Our method dynamically adapts the model's knowledge boundaries, effectively balancing exploration diversity with training this http URL results demonstrate that E3-TIR a...

Originally published on April 13, 2026. Curated by AI News.

Related Articles

Llms

Transformer Math Explorer [P]

This is an interactive math reference for transformer models, presented via dataflow graphs, all the way down to elementary math. Covers ...

Reddit - Machine Learning · 1 min ·
Spotify wants to become the home for AI-generated personal audio | TechCrunch
Llms

Spotify wants to become the home for AI-generated personal audio | TechCrunch

Users will be able to create a podcast from Codex or Claude Code and import it to Spotify

TechCrunch - AI · 3 min ·
Llms

We built something ChatGPT doesn't do — AI that delivers results, not answers

Most AI gives you text. We built cards. Here's what I mean. When you ask LookMood Agent to find you a job, you don't get advice on where ...

Reddit - Artificial Intelligence · 1 min ·
Llms

I am not an "anti" like this guy, but still an interesting video of person interacting with chat 4o

(Posting Here because removed by Chatgpt Complaints moderators because the model here is 4o, and refuse to believe there were any safety ...

Reddit - Artificial Intelligence · 1 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime