[2405.17573] Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets

[2405.17573] Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets

arXiv - Machine Learning 4 min read

About this article

Abstract page for arXiv paper 2405.17573: Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets

Statistics > Machine Learning arXiv:2405.17573 (stat) [Submitted on 27 May 2024 (v1), last revised 25 Mar 2026 (this version, v3)] Title:Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets Authors:Arthur Jacot, Alexandre Kaiser View a PDF of the paper titled Hamiltonian Mechanics of Feature Learning: Bottleneck Structure in Leaky ResNets, by Arthur Jacot and 1 other authors View PDF HTML (experimental) Abstract:We study Leaky ResNets, which interpolate between ResNets and Fully-Connected nets depending on an 'effective depth' hyper-parameter $\tilde{L}$. In the infinite depth limit, we study 'representation geodesics' $A_{p}$: continuous paths in representation space (similar to NeuralODEs) from input $p=0$ to output $p=1$ that minimize the parameter norm of the network. We give a Lagrangian and Hamiltonian reformulation, which highlight the importance of two terms: a kinetic energy which favors small layer derivatives $\partial_{p}A_{p}$ and a potential energy that favors low-dimensional representations, as measured by the 'Cost of Identity'. The balance between these two forces offers an intuitive understanding of feature learning in ResNets. We leverage this intuition to explain the emergence of a bottleneck structure, as observed in previous work: for large $\tilde{L}$ the potential energy dominates and leads to a separation of timescales, where the representation jumps rapidly from the high dimensional inputs to a low-dimensional represent...

Originally published on March 26, 2026. Curated by AI News.

Related Articles

Llms

🤖 AI News Digest - March 27, 2026

Today's AI news: 1. My minute-by-minute response to the LiteLLM malware attack The article describes a detailed, minute-by-minute respons...

Reddit - Artificial Intelligence · 1 min ·
Llms

[D] Real-time Student Attention Detection: ResNet vs Facial Landmarks - Which approach for resource-constrained deployment?

I have a problem statement where we are supposed to detect the attention level of student in a classroom, basically output whether he is ...

Reddit - Machine Learning · 1 min ·
Ai Infrastructure

[D] Building a demand forecasting system for multi-location retail with no POS integration, architecture feedback wanted

We’re building a lightweight demand forecasting engine on top of manually entered operational data. No POS integration, no external feeds...

Reddit - Machine Learning · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
More in Ai Infrastructure: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime