[2603.27333] ComBench: A Repo-level Real-world Benchmark for Compilation Error Repair

[2603.27333] ComBench: A Repo-level Real-world Benchmark for Compilation Error Repair

arXiv - AI 4 min read

About this article

Abstract page for arXiv paper 2603.27333: ComBench: A Repo-level Real-world Benchmark for Compilation Error Repair

Computer Science > Software Engineering arXiv:2603.27333 (cs) [Submitted on 28 Mar 2026] Title:ComBench: A Repo-level Real-world Benchmark for Compilation Error Repair Authors:Jia Li, Zeyang Zhuang, Zhuangbin Chen, Yuxin Su, Wei Meng, Michael R. Lyu View a PDF of the paper titled ComBench: A Repo-level Real-world Benchmark for Compilation Error Repair, by Jia Li and 5 other authors View PDF HTML (experimental) Abstract:Compilation errors pose pervasive and critical challenges in software development, significantly hindering productivity. Therefore, Automated Compilation Error Repair (ACER) techniques are proposed to mitigate these issues. Despite recent advancements in ACER, its real-world performance remains poorly evaluated. This can be largely attributed to the limitations of existing benchmarks, \ie decontextualized single-file data, lack of authentic source diversity, and biased local task modeling that ignores crucial repository-level complexities. To bridge this critical gap, we propose ComBench, the first repository-level, reproducible real-world benchmark for C/C++ compilation error repair. ComBench is constructed through a novel, automated framework that systematically mines real-world failures from the GitHub CI histories of large-scale open-source projects. Our framework contributes techniques for the high-precision identification of ground-truth repair patches from complex version histories and a high-fidelity mechanism for reproducing the original, ephemeral ...

Originally published on March 31, 2026. Curated by AI News.

Related Articles

Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED
Llms

Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED

Plus: The FBI says a recent hack of its wiretap tools poses a national security risk, attackers stole Cisco source code as part of an ong...

Wired - AI · 9 min ·
Computer Vision

Best OCR for template-based form extraction? [D]

Hi, I’m working on a school project and I’m currently testing OCR tools for forms. The documents are mostly structured or semi-structured...

Reddit - Machine Learning · 1 min ·

Choosing between Gnani.ai (Startup), Bayer, and Yotta for AI Engineer role — looking for insights on growth, work quality, and long-term value

Hello everyone, I’m currently evaluating three offers for AI Engineer roles and would really appreciate some guidance from people with re...

Reddit - ML Jobs · 1 min ·
Artificial intelligence: a perspective from teaching and the learning sciences

Artificial intelligence: a perspective from teaching and the learning sciences

AI News - General · 9 min ·

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime