[2604.00230] Neural Collapse Dynamics: Depth, Activation,

[2604.00230] Neural Collapse Dynamics: Depth, Activation, Regularisation, and Feature Norm Threshold

arXiv - Machine Learning April 02, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.00230: Neural Collapse Dynamics: Depth, Activation, Regularisation, and Feature Norm Threshold

Computer Science > Machine Learning arXiv:2604.00230 (cs) [Submitted on 31 Mar 2026] Title:Neural Collapse Dynamics: Depth, Activation, Regularisation, and Feature Norm Threshold Authors:Anamika Paul Rupa View a PDF of the paper titled Neural Collapse Dynamics: Depth, Activation, Regularisation, and Feature Norm Threshold, by Anamika Paul Rupa View PDF HTML (experimental) Abstract:Neural collapse (NC) -- the convergence of penultimate-layer features to a simplex equiangular tight frame -- is well understood at equilibrium, but the dynamics governing its onset remain poorly characterised. We identify a simple and predictive regularity: NC occurs when the mean feature norm reaches a model-dataset-specific critical value, fn*, that is largely invariant to training conditions. This value concentrates tightly within each (model, dataset) pair (CV < 8%); training dynamics primarily affect the rate at which fn approaches fn*, rather than the value itself. In standard training trajectories, the crossing of fn below fn* consistently precedes NC onset, providing a practical predictor with a mean lead time of 62 epochs (MAE 24 epochs). A direct intervention experiment confirms fn* is a stable attractor of the gradient flow -- perturbations to feature scale are self-corrected during training, with convergence to the same value regardless of direction (p>0.2). Completing the (architecture)x(dataset) grid reveals the paper's strongest result: ResNet-20 on MNIST gives fn* = 5.867 -- a +4...

Originally published on April 02, 2026. Curated by AI News.

Llms

I can't help rooting for tiny open source AI model maker Arcee | TechCrunch

Arcee is a tiny 26-person U.S. startup that built a high-performing, massive, open source LLM. And it's gaining popularity with OpenClaw ...

TechCrunch - AI · 4 min · about 1 hour ago

Machine Learning

We have an AI agent fragmentation problem

Every AI agent works fine on its own — but the moment you try to use more than one, everything falls apart. Different runtimes. Different...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

Using AI properly

AI is a tool. Period. I spent decades asking forums for help in writing HTML code for my website. I wanted my posts to self-scroll to a p...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

Anthropic Teams Up With Its Rivals to Keep AI From Hacking Everything | WIRED

The AI lab's Project Glasswing will bring together Apple, Google, and more than 45 other organizations. They'll use the new Claude Mythos...

Wired - AI · 7 min · about 4 hours ago

[2604.00230] Neural Collapse Dynamics: Depth, Activation, Regularisation, and Feature Norm Threshold

About this article

Related Articles

I can't help rooting for tiny open source AI model maker Arcee | TechCrunch

We have an AI agent fragmentation problem

Using AI properly

Anthropic Teams Up With Its Rivals to Keep AI From Hacking Everything | WIRED

No comments

Stay updated with AI News