[2510.00504] A universal compression theory for lottery ticket

[2510.00504] A universal compression theory for lottery ticket hypothesis and neural scaling laws

arXiv - Machine Learning March 03, 2026 4 min read

About this article

Abstract page for arXiv paper 2510.00504: A universal compression theory for lottery ticket hypothesis and neural scaling laws

Statistics > Machine Learning arXiv:2510.00504 (stat) [Submitted on 1 Oct 2025 (v1), last revised 2 Mar 2026 (this version, v2)] Title:A universal compression theory for lottery ticket hypothesis and neural scaling laws Authors:Hong-Yi Wang, Di Luo, Tomaso Poggio, Isaac L. Chuang, Liu Ziyin View a PDF of the paper titled A universal compression theory for lottery ticket hypothesis and neural scaling laws, by Hong-Yi Wang and 4 other authors View PDF Abstract:When training large-scale models, the performance typically scales with the number of parameters and the dataset size according to a slow power law. A fundamental theoretical and practical question is whether comparable performance can be achieved with significantly smaller models and substantially less data. In this work, we provide a positive and constructive answer. We prove that a generic permutation-invariant function of $d$ objects can be asymptotically compressed into a function of $\operatorname{polylog} d$ objects with vanishing error, which is proved to be the optimal compression rate. This theorem yields two key implications: (Ia) a large neural network can be compressed to polylogarithmic width while preserving its learning dynamics; (Ib) a large dataset can be compressed to polylogarithmic size while leaving the loss landscape of the corresponding model unchanged. Implication (Ia) directly establishes a proof of the dynamical lottery ticket hypothesis, which states that any ordinary network can be strongly...

Originally published on March 03, 2026. Curated by AI News.

Machine Learning

[D] MXFP8 GEMM: Up to 99% of cuBLAS performance using CUDA + PTX

New blog post by Daniel Vega-Myhre (Meta/PyTorch) illustrating GEMM design for FP8, including deep-dives into all the constraints and des...

Reddit - Machine Learning · 1 min · 41 minutes ago

Machine Learning

IIT Delhi launches 8th batch of Advanced AI, ML, and DL online programme: Check who is eligible, applicat

News News: The Continuing Education Programme (CEP) at IIT Delhi has announced the launch of the 8th batch of its Advanced Certificate Pr...

AI News - General · 9 min · about 1 hour ago

Machine Learning

Chamco Digital Launches Microsoft AI and Cloud Technology Training Program with Board-Endorsed Strategic Expansion

Chamco Digital, a recognized Microsoft AI and Cloud Technology Partner, announced the launch of its globally accessible Microsoft AI and ...

AI News - General · 4 min · about 1 hour ago

Machine Learning

FPT Wins AI & Machine Learning Innovation Award at 2026 InsurInnovator Connect Asia Awards

HANOI, Vietnam--(BUSINESS WIRE)--Mar 30, 2026--

AI News - General · 13 min · about 1 hour ago

[2510.00504] A universal compression theory for lottery ticket hypothesis and neural scaling laws

About this article

Related Articles

[D] MXFP8 GEMM: Up to 99% of cuBLAS performance using CUDA + PTX

IIT Delhi launches 8th batch of Advanced AI, ML, and DL online programme: Check who is eligible, applicat

Chamco Digital Launches Microsoft AI and Cloud Technology Training Program with Board-Endorsed Strategic Expansion

FPT Wins AI & Machine Learning Innovation Award at 2026 InsurInnovator Connect Asia Awards

No comments

Stay updated with AI News