[2604.02459] On the Geometric Structure of Layer Updates in Deep

[2604.02459] On the Geometric Structure of Layer Updates in Deep Language Models

arXiv - AI April 06, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.02459: On the Geometric Structure of Layer Updates in Deep Language Models

Computer Science > Machine Learning arXiv:2604.02459 (cs) [Submitted on 2 Apr 2026] Title:On the Geometric Structure of Layer Updates in Deep Language Models Authors:Jun-Sik Yoo View a PDF of the paper titled On the Geometric Structure of Layer Updates in Deep Language Models, by Jun-Sik Yoo View PDF HTML (experimental) Abstract:We study the geometric structure of layer updates in deep language models. Rather than analyzing what information is encoded in intermediate representations, we ask how representations change from one layer to the next. We show that layerwise updates admit a decomposition into a dominant tokenwise component and a residual that is not captured by restricted tokenwise function classes. Across multiple architectures, including Transformers and state-space models, we find that the full layer update is almost perfectly aligned with the tokenwise component, while the residual exhibits substantially weaker alignment, larger angular deviation, and significantly lower projection onto the dominant tokenwise subspace. This indicates that the residual is not merely a small correction, but a geometrically distinct component of the transformation. This geometric separation has functional consequences: approximation error under the restricted tokenwise model is strongly associated with output perturbation, with Spearman correlations often exceeding 0.7 and reaching up to 0.95 in larger models. Together, these results suggest that most layerwise updates behave lik...

Originally published on April 06, 2026. Curated by AI News.

Llms

Started a video series on building an orchestration layer for LLM post-training [P]

Hi everyone! Context, motivation, a lot of yapping, feel free to skip to TL;DR. A while back I posted here asking [D] What framework do y...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

ChatGPT finally offers $100/month Pro plan

OpenAI announced on Thursday something that power users have been asking for: a $100/month plan. Previously, subscriptions jumped from $2...

TechCrunch - AI · 4 min · about 1 hour ago

Llms

Anthropic says new Claude Mythos AI is too risky for public use

Dubbed Claude Mythos, the software is part of the Claude AI family, an artificial intelligence model that can act like a chatbot and AI a...

AI Tools & Products · 10 min · about 1 hour ago

Llms

ChatGPT has a new $100 per month Pro subscription

OpenAI has announced a new version of its ChatGPT Pro subscription that costs $100 per month. The new Pro tier offers "5x more" usage of ...

The Verge - AI · 4 min · about 1 hour ago

[2604.02459] On the Geometric Structure of Layer Updates in Deep Language Models

About this article

Related Articles

Started a video series on building an orchestration layer for LLM post-training [P]

ChatGPT finally offers $100/month Pro plan

Anthropic says new Claude Mythos AI is too risky for public use

ChatGPT has a new $100 per month Pro subscription

No comments

Stay updated with AI News