[2405.17573] Hamiltonian Mechanics of Feature Learning: Bottleneck

[2405.17573] Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets

arXiv - Machine Learning March 26, 2026 4 min read

About this article

Abstract page for arXiv paper 2405.17573: Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets

Statistics > Machine Learning arXiv:2405.17573 (stat) [Submitted on 27 May 2024 (v1), last revised 25 Mar 2026 (this version, v3)] Title:Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets Authors:Arthur Jacot, Alexandre Kaiser View a PDF of the paper titled Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets, by Arthur Jacot and 1 other authors View PDF HTML (experimental) Abstract:We study Leaky ResNets, which interpolate between ResNets and Fully-Connected nets depending on an 'effective depth' hyper-parameter $\tilde{L}$. In the infinite depth limit, we study 'representation geodesics' $A_{p}$: continuous paths in representation space (similar to NeuralODEs) from input $p=0$ to output $p=1$ that minimize the parameter norm of the network. We give a Lagrangian and Hamiltonian reformulation, which highlight the importance of two terms: a kinetic energy which favors small layer derivatives $\partial_{p}A_{p}$ and a potential energy that favors low-dimensional representations, as measured by the 'Cost of Identity'. The balance between these two forces offers an intuitive understanding of feature learning in ResNets. We leverage this intuition to explain the emergence of a bottleneck structure, as observed in previous work: for large $\tilde{L}$ the potential energy dominates and leads to a separation of timescales, where the representation jumps rapidly from the high dimensional inputs to a low-dimensional represent...

Originally published on March 26, 2026. Curated by AI News.

Llms

🤖 AI News Digest - March 27, 2026

Today's AI news: 1. My minute-by-minute response to the LiteLLM malware attack The article describes a detailed, minute-by-minute respons...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

[D] Real-time Student Attention Detection: ResNet vs Facial Landmarks - Which approach for resource-constrained deployment?

I have a problem statement where we are supposed to detect the attention level of student in a classroom, basically output whether he is ...

Reddit - Machine Learning · 1 min · about 2 hours ago

Ai Infrastructure

[D] Building a demand forecasting system for multi-location retail with no POS integration, architecture feedback wanted

We’re building a lightweight demand forecasting engine on top of manually entered operational data. No POS integration, no external feeds...

Reddit - Machine Learning · 1 min · about 3 hours ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 6 hours ago

[2405.17573] Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets

About this article

Related Articles

🤖 AI News Digest - March 27, 2026

[D] Real-time Student Attention Detection: ResNet vs Facial Landmarks - Which approach for resource-constrained deployment?

[D] Building a demand forecasting system for multi-location retail with no POS integration, architecture feedback wanted

UMKC Announces New Master of Science in Artificial Intelligence

No comments

Stay updated with AI News