[2603.22918] EVA: Efficient Reinforcement Learning for End-to-End

[2603.22918] EVA: Efficient Reinforcement Learning for End-to-End Video Agent

arXiv - AI March 25, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.22918: EVA: Efficient Reinforcement Learning for End-to-End Video Agent

Computer Science > Computer Vision and Pattern Recognition arXiv:2603.22918 (cs) [Submitted on 24 Mar 2026] Title:EVA: Efficient Reinforcement Learning for End-to-End Video Agent Authors:Yaolun Zhang, Ruohui Wang, Jiahao Wang, Yepeng Tang, Xuanyu Zheng, Haonan Duan, Hao Lu, Hanming Deng, Lewei Lu View a PDF of the paper titled EVA: Efficient Reinforcement Learning for End-to-End Video Agent, by Yaolun Zhang and 8 other authors View PDF HTML (experimental) Abstract:Video understanding with multimodal large language models (MLLMs) remains challenging due to the long token sequences of videos, which contain extensive temporal dependencies and redundant frames. Existing approaches typically treat MLLMs as passive recognizers, processing entire videos or uniformly sampled frames without adaptive reasoning. Recent agent-based methods introduce external tools, yet still depend on manually designed workflows and perception-first strategies, resulting in inefficiency on long videos. We present EVA, an Efficient Reinforcement Learning framework for End-to-End Video Agent, which enables planning-before-perception through iterative summary-plan-action-reflection reasoning. EVA autonomously decides what to watch, when to watch, and how to watch, achieving query-driven and efficient video understanding. To train such agents, we design a simple yet effective three-stage learning pipeline - comprising supervised fine-tuning (SFT), Kahneman-Tversky Optimization (KTO), and Generalized Rewar...

Originally published on March 25, 2026. Curated by AI News.

Llms

Apple to open Siri to rival AI services beyond ChatGPT

Apple plans to open its Siri voice assistant to rival artificial intelligence (AI) services, moving beyond its partnership with OpenAI, a...

AI Tools & Products · 4 min · 26 minutes ago

Llms

Claude's scheduled tasks finally fixed what ChatGPT, Gemini, and every other AI tool got wrong

The boring stuff finally does itself.

AI Tools & Products · 9 min · 26 minutes ago

Llms

ChatGPT Just Got 33% More Accurate (The AI News You Missed)

ChatGPT has improved its accuracy by 33%, marking a notable enhancement for users of the AI platform.

AI Tools & Products · 1 min · 27 minutes ago

Llms

Exclusive | The Sudden Fall of OpenAI’s Most Hyped Product Since ChatGPT

The content discusses the sudden decline of OpenAI's most anticipated product since ChatGPT.

AI Tools & Products · 1 min · 27 minutes ago

[2603.22918] EVA: Efficient Reinforcement Learning for End-to-End Video Agent

About this article

Related Articles

Apple to open Siri to rival AI services beyond ChatGPT

Claude's scheduled tasks finally fixed what ChatGPT, Gemini, and every other AI tool got wrong

ChatGPT Just Got 33% More Accurate (The AI News You Missed)

Exclusive | The Sudden Fall of OpenAI’s Most Hyped Product Since ChatGPT

No comments

Stay updated with AI News