[2505.13820] Structured Agent Distillation for Large Language Model

[2505.13820] Structured Agent Distillation for Large Language Model

arXiv - Machine Learning 4 min read

About this article

Abstract page for arXiv paper 2505.13820: Structured Agent Distillation for Large Language Model

Computer Science > Machine Learning arXiv:2505.13820 (cs) [Submitted on 20 May 2025 (v1), last revised 28 Mar 2026 (this version, v4)] Title:Structured Agent Distillation for Large Language Model Authors:Jun Liu, Zhenglun Kong, Peiyan Dong, Changdi Yang, Tianqi Li, Hao Tang, Geng Yuan, Wei Niu, Wenbin Zhang, Pu Zhao, Xue Lin, Dong Huang, Yanzhi Wang View a PDF of the paper titled Structured Agent Distillation for Large Language Model, by Jun Liu and 11 other authors View PDF HTML (experimental) Abstract:Large language models (LLMs) exhibit strong capabilities as decision-making agents by interleaving reasoning and actions, as seen in ReAct-style frameworks. Yet, their practical deployment is constrained by high inference costs and large model sizes. We propose Structured Agent Distillation, a framework that compresses large LLM-based agents into smaller student models while preserving both reasoning fidelity and action consistency. Unlike standard token-level distillation, our method segments trajectories into {[REASON]} and {[ACT]} spans, applying segment-specific losses to align each component with the teacher's behavior. This structure-aware supervision enables compact agents to better replicate the teacher's decision process. Experiments on ALFWorld, HotPotQA-ReAct, and WebShop show that our approach consistently outperforms token-level and imitation learning baselines, achieving significant compression with minimal performance drop. Scaling and ablation results furthe...

Originally published on March 31, 2026. Curated by AI News.

Related Articles

Claude AI Goes Down for Thousands of Users Wednesday, Downdetector Reports
Llms

Claude AI Goes Down for Thousands of Users Wednesday, Downdetector Reports

Claude AI faces an outage today as over 7,000 users report issues. Stay informed about the situation here.

AI Tools & Products · 6 min ·
Llms

ChatGPT meets coffee: Starbucks launches AI ordering tool

Starbucks has launched an AI ordering tool that integrates with ChatGPT, aiming to improve the customer experience by streamlining the or...

AI Tools & Products · 1 min ·
NFL mock draft 2026: ChatGPT AI gives the worst predictions you'll ever see
Llms

NFL mock draft 2026: ChatGPT AI gives the worst predictions you'll ever see

USA TODAY Sports features a mock draft for the 2026 NFL Draft created by ChatGPT AI, which is noted for being the worst mock draft ever p...

AI Tools & Products · 9 min ·
Gemini Mac app puts Google AI right in your workflow
Llms

Gemini Mac app puts Google AI right in your workflow

The new Gemini for Mac app integrates into your workflow for quicker and easier AI access, hopefully improving productivity.

AI Tools & Products · 9 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime