[2602.22190] GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

[2602.22190] GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

arXiv - Machine Learning 4 min read Article

Summary

The paper presents GUI-Libra, a novel training approach for native GUI agents that enhances reasoning and action capabilities through action-aware supervision and partially verifiable reinforcement learning.

Why It Matters

This research addresses significant gaps in the development of open-source GUI agents, particularly in their ability to perform complex navigation tasks. By introducing a curated dataset and innovative training techniques, the study aims to improve the effectiveness of GUI agents, which is crucial for advancing AI applications in user interface automation and interaction.

Key Takeaways

  • GUI-Libra improves the performance of native GUI agents in long-horizon navigation tasks.
  • The study introduces a curated dataset of 81K GUI reasoning instances to enhance training.
  • Action-aware supervised fine-tuning (SFT) is proposed to better align reasoning with actions.
  • KL regularization is identified as critical for stabilizing reinforcement learning under partial verifiability.
  • The results indicate that effective data curation can enhance task-solving capabilities without extensive online data collection.

Computer Science > Machine Learning arXiv:2602.22190 (cs) [Submitted on 25 Feb 2026] Title:GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL Authors:Rui Yang, Qianhui Wu, Zhaoyang Wang, Hanyang Chen, Ke Yang, Hao Cheng, Huaxiu Yao, Baoling Peng, Huan Zhang, Jianfeng Gao, Tong Zhang View a PDF of the paper titled GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL, by Rui Yang and 10 other authors View PDF HTML (experimental) Abstract:Open-source native GUI agents still lag behind closed-source systems on long-horizon navigation tasks. This gap stems from two limitations: a shortage of high-quality, action-aligned reasoning data, and the direct adoption of generic post-training pipelines that overlook the unique challenges of GUI agents. We identify two fundamental issues in these pipelines: (i) standard SFT with CoT reasoning often hurts grounding, and (ii) step-wise RLVR-tyle training faces partial verifiability, where multiple actions can be correct but only a single demonstrated action is used for verification. This makes offline step-wise metrics weak predictors of online task success. In this work, we present GUI-Libra, a tailored training recipe that addresses these challenges. First, to mitigate the scarcity of action-aligned reasoning data, we introduce a data construction and filtering pipeline and release a curated 81K GUI reasoning datase...

Related Articles

Machine Learning

[P] Unix philosophy for ML pipelines: modular, swappable stages with typed contracts

We built an open-source prototype that applies Unix philosophy to retrieval pipelines. Each stage (PII redaction, chunking, dedup, embedd...

Reddit - Machine Learning · 1 min ·
Machine Learning

Making an AI native sovereign computational stack

I’ve been working on a personal project that ended up becoming a kind of full computing stack: identity / trust protocol decentralized ch...

Reddit - Artificial Intelligence · 1 min ·
Llms

An attack class that passes every current LLM filter - no payload, no injection signature, no log trace

https://shapingrooms.com/research I published a paper today on something I've been calling postural manipulation. The short version: ordi...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

What tools are sr MLEs using? (clawdbot, openspec, wispr) [D]

I'm already blasting cursor, but I want to level up my output. I heard that these kind of AI tools and workflows are being asked in SF. W...

Reddit - Machine Learning · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime