[2604.06240] The Art of Building Verifiers for Computer Use Agents
About this article
Abstract page for arXiv paper 2604.06240: The Art of Building Verifiers for Computer Use Agents
Computer Science > Cryptography and Security arXiv:2604.06240 (cs) [Submitted on 5 Apr 2026] Title:The Art of Building Verifiers for Computer Use Agents Authors:Corby Rosset, Pratyusha Sharma, Andrew Zhao, Miguel Gonzalez-Fernandez, Ahmed Awadallah View a PDF of the paper titled The Art of Building Verifiers for Computer Use Agents, by Corby Rosset and 4 other authors View PDF HTML (experimental) Abstract:Verifying the success of computer use agent (CUA) trajectories is a critical challenge: without reliable verification, neither evaluation nor training signal can be trusted. In this paper, we present lessons learned from building a best-in-class verifier for web tasks we call the Universal Verifier. We design the Universal Verifier around four key principles: 1) constructing rubrics with meaningful, non-overlapping criteria to reduce noise; 2) separating process and outcome rewards that yield complementary signals, capturing cases where an agent follows the right steps but gets blocked or succeeds through an unexpected path; 3) distinguishing between controllable and uncontrollable failures scored via a cascading-error-free strategy for finer-grained failure understanding; and 4) a divide-and-conquer context management scheme that attends to all screenshots in a trajectory, improving reliability on longer task horizons. We validate these findings on CUAVerifierBench, a new set of CUA trajectories with both process and outcome human labels, showing that our Universal Verif...