[2604.00316] Breaking Data Symmetry is Needed For Generalization in

[2604.00316] Breaking Data Symmetry is Needed For Generalization in Feature Learning Kernels

arXiv - Machine Learning April 02, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.00316: Breaking Data Symmetry is Needed For Generalization in Feature Learning Kernels

Statistics > Machine Learning arXiv:2604.00316 (stat) [Submitted on 31 Mar 2026] Title:Breaking Data Symmetry is Needed For Generalization in Feature Learning Kernels Authors:Marcel Tomàs Bernal, Neil Rohit Mallinar, Mikhail Belkin View a PDF of the paper titled Breaking Data Symmetry is Needed For Generalization in Feature Learning Kernels, by Marcel Tom\`as Bernal and 2 other authors View PDF HTML (experimental) Abstract:Grokking occurs when a model achieves high training accuracy but generalization to unseen test points happens long after that. This phenomenon was initially observed on a class of algebraic problems, such as learning modular arithmetic (Power et al., 2022). We study grokking on algebraic tasks in a class of feature learning kernels via the Recursive Feature Machine (RFM) algorithm (Radhakrishnan et al., 2024), which iteratively updates feature matrices through the Average Gradient Outer Product (AGOP) of an estimator in order to learn task-relevant features. Our main experimental finding is that generalization occurs only when a certain symmetry in the training set is broken. Furthermore, we empirically show that RFM generalizes by recovering the underlying invariance group action inherent in the data. We find that the learned feature matrices encode specific elements of the invariance group, explaining the dependence of generalization on symmetry. Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG) Cite as: arXiv:2604.00316 [stat.ML] (or ar...

Originally published on April 02, 2026. Curated by AI News.

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 2 hours ago

Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min · about 2 hours ago

Machine Learning

AI Hiring Growth: AI and ML Hiring Surges 37% in Marche

AI News - General · 1 min · about 2 hours ago

Llms

Anthropic Claude AI training model targets AI skills gap | ETIH EdTech News

AI in education, edtech AI tools, and AI skills training drive Anthropic’s Claude curriculum. ETIH edtech news covers how AI fluency, wor...

AI Tools & Products · 6 min · about 2 hours ago

[2604.00316] Breaking Data Symmetry is Needed For Generalization in Feature Learning Kernels

About this article

Related Articles

UMKC Announces New Master of Science in Artificial Intelligence

Improving AI models’ ability to explain their predictions

AI Hiring Growth: AI and ML Hiring Surges 37% in Marche

Anthropic Claude AI training model targets AI skills gap | ETIH EdTech News

No comments

Stay updated with AI News