[2603.25810] ExVerus: Verus Proof Repair via Counterexample Reasoning

[2603.25810] ExVerus: Verus Proof Repair via Counterexample Reasoning

arXiv - Machine Learning 3 min read

About this article

Abstract page for arXiv paper 2603.25810: ExVerus: Verus Proof Repair via Counterexample Reasoning

Computer Science > Programming Languages arXiv:2603.25810 (cs) [Submitted on 26 Mar 2026] Title:ExVerus: Verus Proof Repair via Counterexample Reasoning Authors:Jun Yang, Yuechun Sun, Yi Wu, Rodrigo Caridad, Yongwei Yuan, Jianan Yao, Shan Lu, Kexin Pei View a PDF of the paper titled ExVerus: Verus Proof Repair via Counterexample Reasoning, by Jun Yang and 7 other authors View PDF Abstract:Large Language Models (LLMs) have shown promising results in automating formal verification. However, existing approaches treat proof generation as a static, end-to-end prediction over source code, relying on limited verifier feedback and lacking access to concrete program behaviors. We present EXVERUS, a counterexample-guided framework that enables LLMs to reason about proofs using behavioral feedback via counterexamples. When a proof fails, EXVERUS automatically generates and validates counterexamples, and then guides the LLM to generalize them into inductive invariants to block these failures. Our evaluation shows that EXVERUS significantly improves proof accuracy, robustness, and token efficiency over the state-of-the-art prompting-based Verus proof generator. Comments: Subjects: Programming Languages (cs.PL); Machine Learning (cs.LG) ACM classes: D.2.4 Cite as: arXiv:2603.25810 [cs.PL]   (or arXiv:2603.25810v1 [cs.PL] for this version)   https://doi.org/10.48550/arXiv.2603.25810 Focus to learn more arXiv-issued DOI via DataCite (pending registration) Submission history From: Jun Yang...

Originally published on March 30, 2026. Curated by AI News.

Related Articles

Llms

Depth-first pruning seems to transfer from GPT-2 to Llama (unexpectedly well)

TL;DR: Removing the right transformer layers (instead of shrinking all layers) gives smaller, faster models with minimal quality loss — a...

Reddit - Artificial Intelligence · 1 min ·
[2603.23966] Policy-Guided Threat Hunting: An LLM enabled Framework with Splunk SOC Triage
Llms

[2603.23966] Policy-Guided Threat Hunting: An LLM enabled Framework with Splunk SOC Triage

Abstract page for arXiv paper 2603.23966: Policy-Guided Threat Hunting: An LLM enabled Framework with Splunk SOC Triage

arXiv - AI · 4 min ·
[2603.16790] InCoder-32B: Code Foundation Model for Industrial Scenarios
Llms

[2603.16790] InCoder-32B: Code Foundation Model for Industrial Scenarios

Abstract page for arXiv paper 2603.16790: InCoder-32B: Code Foundation Model for Industrial Scenarios

arXiv - AI · 4 min ·
[2603.16430] EngGPT2: Sovereign, Efficient and Open Intelligence
Llms

[2603.16430] EngGPT2: Sovereign, Efficient and Open Intelligence

Abstract page for arXiv paper 2603.16430: EngGPT2: Sovereign, Efficient and Open Intelligence

arXiv - AI · 4 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime