[2603.04737] Interactive Benchmarks

arXiv - Machine Learning March 06, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.04737: Interactive Benchmarks

Computer Science > Artificial Intelligence arXiv:2603.04737 (cs) [Submitted on 5 Mar 2026] Title:Interactive Benchmarks Authors:Baoqing Yue, Zihan Zhu, Yifan Zhang, Jichen Feng, Hufei Yang, Mengdi Wang View a PDF of the paper titled Interactive Benchmarks, by Baoqing Yue and 5 other authors View PDF Abstract:Standard benchmarks have become increasingly unreliable due to saturation, subjectivity, and poor generalization. We argue that evaluating model's ability to acquire information actively is important to assess model's intelligence. We propose Interactive Benchmarks, a unified evaluation paradigm that assesses model's reasoning ability in an interactive process under budget constraints. We instantiate this framework across two settings: Interactive Proofs, where models interact with a judge to deduce objective truths or answers in logic and mathematics; and Interactive Games, where models reason strategically to maximize long-horizon utilities. Our results show that interactive benchmarks provide a robust and faithful assessment of model intelligence, revealing that there is still substantial room to improve in interactive scenarios. Project page: this https URL Comments: Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG) Cite as: arXiv:2603.04737 [cs.AI] (or arXiv:2603.04737v1 [cs.AI] for this version) https://doi.org/10.48550/arXiv.2603.04737 Focus to learn more arXiv-issued DOI via DataCite Submission history Fro...

Originally published on March 06, 2026. Curated by AI News.

Machine Learning

[R] Editing ICML Rebuttal

Hi guys, If I submit my ICML rebuttal now on OpenReview, can I edit it afterwards until the deadline. submitted by /u/isentropiccombustor...

Reddit - Machine Learning · 1 min · about 1 hour ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 1 hour ago

Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min · about 1 hour ago

[2603.04737] Interactive Benchmarks

About this article

Related Articles

[R] Editing ICML Rebuttal

UMKC Announces New Master of Science in Artificial Intelligence

Accelerating science with AI and simulations

Improving AI models’ ability to explain their predictions

No comments

Stay updated with AI News