[2602.00130] On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

[2602.00130] On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

arXiv - Machine Learning 3 min read

About this article

Abstract page for arXiv paper 2602.00130: On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

Computer Science > Machine Learning arXiv:2602.00130 (cs) [Submitted on 28 Jan 2026 (v1), last revised 3 Mar 2026 (this version, v2)] Title:On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks Authors:Sumit Yadav View a PDF of the paper titled On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks, by Sumit Yadav View PDF Abstract:We investigate the relationship between representation geometry and neural network performance. Analyzing 52 pretrained ImageNet models across 13 architecture families, we show that effective dimension -- an unsupervised geometric metric -- strongly predicts accuracy. Output effective dimension achieves partial r=0.75 ($p < 10^(-10)$) after controlling for model capacity, while total compression achieves partial r=-0.72. These findings replicate across ImageNet and CIFAR-10, and generalize to NLP: effective dimension predicts performance for 8 encoder models on SST-2/MNLI and 15 decoder-only LLMs on AG News (r=0.69, p=0.004), while model size does not (r=0.07). We establish bidirectional causality: degrading geometry via noise causes accuracy loss (r=-0.94, $p < 10^(-9)$), while improving geometry via PCA maintains accuracy across architectures (-0.03pp at 95% variance). This relationship is noise-type agnostic -- Gaussian, Uniform, Dropout, and Salt-and-pepper noise all show $|r| > 0.90$. These results establish that effective dimension provides domain-agnostic pre...

Originally published on March 04, 2026. Curated by AI News.

Related Articles

[2603.26683] LITTA: Late-Interaction and Test-Time Alignment for Visually-Grounded Multimodal Retrieval
Machine Learning

[2603.26683] LITTA: Late-Interaction and Test-Time Alignment for Visually-Grounded Multimodal Retrieval

Abstract page for arXiv paper 2603.26683: LITTA: Late-Interaction and Test-Time Alignment for Visually-Grounded Multimodal Retrieval

arXiv - AI · 4 min ·
[2603.26680] AlpsBench: An LLM Personalization Benchmark for Real-Dialogue Memorization and Preference Alignment
Llms

[2603.26680] AlpsBench: An LLM Personalization Benchmark for Real-Dialogue Memorization and Preference Alignment

Abstract page for arXiv paper 2603.26680: AlpsBench: An LLM Personalization Benchmark for Real-Dialogue Memorization and Preference Align...

arXiv - AI · 4 min ·
[2603.26678] Power Couple? AI Growth and Renewable Energy Investment
Machine Learning

[2603.26678] Power Couple? AI Growth and Renewable Energy Investment

Abstract page for arXiv paper 2603.26678: Power Couple? AI Growth and Renewable Energy Investment

arXiv - AI · 4 min ·
[2603.26676] Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift
Machine Learning

[2603.26676] Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift

Abstract page for arXiv paper 2603.26676: Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift

arXiv - AI · 3 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime