[2604.04917] Vero: An Open RL Recipe for General Visual Reasoning

arXiv - AI April 07, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.04917: Vero: An Open RL Recipe for General Visual Reasoning

Computer Science > Computer Vision and Pattern Recognition arXiv:2604.04917 (cs) [Submitted on 6 Apr 2026] Title:Vero: An Open RL Recipe for General Visual Reasoning Authors:Gabriel Sarch, Linrong Cai, Qunzhong Wang, Haoyang Wu, Danqi Chen, Zhuang Liu View a PDF of the paper titled Vero: An Open RL Recipe for General Visual Reasoning, by Gabriel Sarch and 5 other authors View PDF HTML (experimental) Abstract:What does it take to build a visual reasoner that works across charts, science, spatial understanding, and open-ended tasks? The strongest vision-language models (VLMs) show such broad visual reasoning is within reach, but the recipe behind them remains unclear, locked behind proprietary reinforcement learning (RL) pipelines with non-public data. We introduce Vero, a family of fully open VLMs that matches or exceeds existing open-weight models across diverse visual reasoning tasks. We scale RL data and rewards across six broad task categories, constructing Vero-600K, a 600K-sample dataset from 59 datasets, and designing task-routed rewards that handle heterogeneous answer formats. Vero achieves state-of-the-art performance, improving over four base models by 3.7-5.5 points on average across VeroEval, our suite of 30 challenging benchmarks. Starting from Qwen3-VL-8B-Instruct, Vero outperforms Qwen3-VL-8B-Thinking on 23 of 30 benchmarks without additional proprietary thinking data. When trained from the same base model, Vero-600K exceeds existing RL datasets across task ...

Originally published on April 07, 2026. Curated by AI News.

Llms

ChatGPT downloads are slowing — and may cause problems for OpenAI’s IPO | The Verge

Data from Sensor Tower shows ChatGPT’s growth is slowing down, as Claude and other competitors’ growth is increasing, just as OpenAI is p...

The Verge - AI · 4 min · about 1 hour ago

Llms

Larry Ellison’s betting everything on OpenAI. Will it pay off or pop the bubble? | The Verge

Larry Ellison and Oracle have staked their future on a data center deal with OpenAI and a big bet that enterprise AI will pay off.

The Verge - AI · 32 min · about 1 hour ago

Llms

Google just released Deep Research Max — an autonomous research agent that writes expert-grade reports on its own

Google quietly dropped something interesting last week. They updated their Deep Research agent (available via Gemini API) and introduced ...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

When Robots Have Their ChatGPT Moment, Remember These Pincers | WIRED

From sorting chicken nuggets to screwing in light bulbs, Eka’s robots are eerily lifelike. But do they have real physical smarts?

Wired - AI · 13 min · about 4 hours ago

[2604.04917] Vero: An Open RL Recipe for General Visual Reasoning

About this article

Related Articles

ChatGPT downloads are slowing — and may cause problems for OpenAI’s IPO | The Verge

Larry Ellison’s betting everything on OpenAI. Will it pay off or pop the bubble? | The Verge

Google just released Deep Research Max — an autonomous research agent that writes expert-grade reports on its own

When Robots Have Their ChatGPT Moment, Remember These Pincers | WIRED

No comments

Stay updated with AI News