[2603.02951] CGL: Advancing Continual GUI Learning via Reinforcement

[2603.02951] CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

arXiv - Machine Learning March 04, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.02951: CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

Computer Science > Machine Learning arXiv:2603.02951 (cs) [Submitted on 3 Mar 2026] Title:CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning Authors:Zhenquan Yao, Zitong Huang, Yihan Zeng, Jianhua Han, Hang Xu, Chun-Mei Feng, Jianwei Ma, Wangmeng Zuo View a PDF of the paper titled CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning, by Zhenquan Yao and 6 other authors View PDF HTML (experimental) Abstract:Graphical User Interface (GUI) Agents, benefiting from recent advances in multimodal large language models (MLLM), have achieved significant development. However, due to the frequent updates of GUI applications, adapting to new tasks without forgetting old tasks in GUI continual learning remains an open problem. In this work, we reveal that while Supervised Fine-Tuning (SFT) facilitates fast adaptation, it often triggers knowledge overwriting, whereas Reinforcement Learning (RL) demonstrates an inherent resilience that shields prior interaction logic from erasure. Based on this insight, we propose a \textbf{C}ontinual \textbf{G}UI \textbf{L}earning (CGL) framework that dynamically balances adaptation efficiency and skill retention by enhancing the synergy between SFT and RL. Specifically, we introduce an SFT proportion adjustment mechanism guided by policy entropy to dynamically control the weight allocation between the SFT and RL training phases. To resolve explicit gradient interference, we further develop a specialized gradient surgery ...

Originally published on March 04, 2026. Curated by AI News.

Llms

8 free AI courses from Anthropic’s Claude platform with certificates

AI News - General · 20 minutes ago

Llms

Anthropic launches Claude Managed Agents — composable APIs for shipping production AI agents 10x faster. Notion, Rakuten, Asana, and Sentry already in production.

Anthropic launches Claude Managed Agents in public beta — composable APIs for shipping production AI agents 10x faster Handles sandboxing...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

6 Months Using AI for Actual Work: What's Incredible, What's Overhyped, and What's Quietly Dangerous

Six months ago I committed to using AI tools for everything I possibly could in my work. Every day, every task, every workflow. Here's th...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

Gemini gets major upgrade towards interactive AI learning

Google has updated its Gemini AI assistant to generate three-dimensional models and live simulations, allowing users to interact with com...

AI News - General · 3 min · about 2 hours ago

[2603.02951] CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning

About this article

Related Articles

8 free AI courses from Anthropic’s Claude platform with certificates

Anthropic launches Claude Managed Agents — composable APIs for shipping production AI agents 10x faster. Notion, Rakuten, Asana, and Sentry already in production.

6 Months Using AI for Actual Work: What's Incredible, What's Overhyped, and What's Quietly Dangerous

Gemini gets major upgrade towards interactive AI learning

No comments

Stay updated with AI News