[2604.03472] Vocabulary Dropout for Curriculum Diversity in LLM

[2604.03472] Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution

arXiv - AI April 07, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.03472: Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution

Computer Science > Computation and Language arXiv:2604.03472 (cs) [Submitted on 3 Apr 2026] Title:Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution Authors:Jacob Dineen, Aswin RRV, Zhikun Xu, Ben Zhou View a PDF of the paper titled Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution, by Jacob Dineen and 3 other authors View PDF Abstract:Co-evolutionary self-play, where one language model generates problems and another solves them, promises autonomous curriculum learning without human supervision. In practice, the proposer quickly converges to a narrow distribution of problems that satisfy the reward function. This diversity collapse renders the curriculum uninformative for the solver, stalling the co-evolutionary loop. We introduce vocabulary dropout, a random mask applied to the proposer's output logits during both policy training and curriculum generation, as a lightweight mechanism to sustain diversity. The mask is hard and non-stationary, preventing the proposer from locking into fixed token sequences. Training Qwen3-4B and Qwen3-8B on mathematical reasoning via R-Zero, we find that vocabulary dropout sustains proposer diversity across lexical, semantic, and functional metrics throughout training, and yields solver improvements averaging +4.4 points at 8B, with the largest gains on competition-level benchmarks. Our findings suggest that explicit action-space constraints, analogous to the structural role that game rules play in classical self-p...

Originally published on April 07, 2026. Curated by AI News.

Llms

I built a solo AI platform from Algeria with no funding, no team and no ad spend - here's what's inside it after 2 months

Hello, 20 years old here just got into the Ai platform and launched this last two weeks and here is what I have on it so far. - Latest Ai...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

USF murder suspect accused of using ChatGPT to research cover-up, prosecutors say

Days after the remains of one of the two missing University of South Florida doctoral students were found, prosecutors say the suspect ma...

AI Tools & Products · 3 min · about 4 hours ago

Llms

Anthropic’s Claude AI deletes PocketOS production database

Claude AI deleted PocketOS's production database, but the market for Claude 4.7 release by May 31 remains at 100% YES.

AI Tools & Products · 3 min · about 4 hours ago

Llms

Claude-powered AI coding agent deletes entire company database in 9 seconds

The founder of PocketOS has penned a social media post to warn others about the “systemic failures” of flagship AI and digital services p...

AI Tools & Products · 1 min · about 4 hours ago

[2604.03472] Vocabulary Dropout for Curriculum Diversity in LLM Co-Evolution

About this article

Related Articles

I built a solo AI platform from Algeria with no funding, no team and no ad spend - here's what's inside it after 2 months

USF murder suspect accused of using ChatGPT to research cover-up, prosecutors say

Anthropic’s Claude AI deletes PocketOS production database

Claude-powered AI coding agent deletes entire company database in 9 seconds

No comments

Stay updated with AI News