[2604.04898] QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

arXiv - AI April 07, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.04898: QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

Computer Science > Artificial Intelligence arXiv:2604.04898 (cs) [Submitted on 6 Apr 2026] Title:QED-Nano: Teaching a Tiny Model to Prove Hard Theorems Authors:LM-Provers, Yuxiao Qu, Amrith Setlur, Jasper Dekoninck, Edward Beeching, Jia Li, Ian Wu, Lewis Tunstall, Aviral Kumar View a PDF of the paper titled QED-Nano: Teaching a Tiny Model to Prove Hard Theorems, by LM-Provers and 8 other authors View PDF HTML (experimental) Abstract:Proprietary AI systems have recently demonstrated impressive capabilities on complex proof-based problems, with gold-level performance reported at the 2025 International Mathematical Olympiad (IMO). However, the training pipelines behind these systems remain largely undisclosed, and their reliance on large "internal" models and scaffolds makes them expensive to run, difficult to reproduce, and hard to study or improve upon. This raises a central question: can small, open models also be trained to achieve competitive reasoning performance on difficult Olympiad-level math? In this paper, we answer this question by building QED-Nano, a 4B model post-trained for Olympiad-level proofs. Our training recipe has three stages: (1) supervised fine-tuning to imbue good proof-writing styles by distilling from DeepSeek-Math-V2, (2) reinforcement learning (RL) with rubric-based rewards, and (3) expanding RL with a reasoning cache, which decomposes long proofs into iterative summarize-and-refine cycles and enables stronger test-time reasoning. QED-Nano surpas...

Originally published on April 07, 2026. Curated by AI News.

Machine Learning

How are you managing long-running preprocessing jobs at scale? Curious what's actually working [R]

Did anyone actually trial these properly for Machine Learning Jobs before walking away, or was it more of a ‘looked at the docs and noped...

Reddit - Machine Learning · 1 min · about 1 hour ago

Ai Startups

Top 10 AI certifications and courses for 2026

This article reviews the top 10 AI certifications and courses for 2026, highlighting their significance in a rapidly evolving field and t...

AI Events · 15 min · about 1 hour ago

Llms

If AI is about to get 10x smarter, how do we prevent the internet from collapsing under synthetic noise?

Im all for acceleration. I think the faster we hit AGI the better. but theres a bottleneck nobody here talks about enough-training data. ...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

Qwen3 4B outperforms cloud agents on code tasks—with Mahoraga research [R]

Hey everyone in ML. I've been working on Mahoraga, an open-source orchestrator that routes tasks across local and cloud AI agents using a...

Reddit - Machine Learning · 1 min · about 2 hours ago

[2604.04898] QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

About this article

Related Articles

How are you managing long-running preprocessing jobs at scale? Curious what's actually working [R]

Top 10 AI certifications and courses for 2026

If AI is about to get 10x smarter, how do we prevent the internet from collapsing under synthetic noise?

Qwen3 4B outperforms cloud agents on code tasks—with Mahoraga research [R]

No comments

Stay updated with AI News