[2604.04281] Preservation Is Not Enough for Width Growth: Regime-Sensitive Selection of Dense LM Warm Starts

[2604.04281] Preservation Is Not Enough for Width Growth: Regime-Sensitive Selection of Dense LM Warm Starts

arXiv - AI 3 min read

About this article

Abstract page for arXiv paper 2604.04281: Preservation Is Not Enough for Width Growth: Regime-Sensitive Selection of Dense LM Warm Starts

Computer Science > Artificial Intelligence arXiv:2604.04281 (cs) [Submitted on 5 Apr 2026] Title:Preservation Is Not Enough for Width Growth: Regime-Sensitive Selection of Dense LM Warm Starts Authors:Eren Unlu View a PDF of the paper titled Preservation Is Not Enough for Width Growth: Regime-Sensitive Selection of Dense LM Warm Starts, by Eren Unlu View PDF HTML (experimental) Abstract:Width expansion offers a practical route to reuse smaller causal-language-model checkpoints, but selecting a widened warm start is not solved by zero-step preservation alone. We study dense width growth as a candidate-selection problem over full training states, including copied weights, optimizer moments, and scheduler state. In a small-scale TinyStories proxy, we compare exact-copy, perturbative, asymmetric-reset, and structured non-clone warm starts under matched continuation budgets. We evaluate zero-step preservation, short-lag probe metrics, and downstream continuation utility in deterministic and stochastic regimes. The picture is mixed and partially replicated through a reduced-pool seed-1 check. Exact-copy symmetric warm starts rank first in every completed 16-step probe and in the completed stochastic 128-step continuations at seed-0 steps 1000 and 2000 plus reduced seed-1 step 2000. By contrast, the structured non-clone challenger wins deterministic 128-step continuation. Early escape from the inherited cloned subspace is therefore not a universal selector: it helps in long deter...

Originally published on April 07, 2026. Curated by AI News.

Related Articles

AI training rolled out for Kilifi teachers, learners
Machine Learning

AI training rolled out for Kilifi teachers, learners

Education in Kilifi County has received a major boost following the rollout of an Artificial Intelligence (AI) training initiative target...

AI News - General · 5 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Improving AI models’ ability to explain their predictions
Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min ·
New technique makes AI models leaner and faster while they’re still learning
Machine Learning

New technique makes AI models leaner and faster while they’re still learning

AI News - General · 9 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime