[2604.00821] Optimal Brain Decomposition for Accurate LLM Low-Rank

[2604.00821] Optimal Brain Decomposition for Accurate LLM Low-Rank Approximation

arXiv - Machine Learning April 02, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.00821: Optimal Brain Decomposition for Accurate LLM Low-Rank Approximation

Computer Science > Machine Learning arXiv:2604.00821 (cs) [Submitted on 1 Apr 2026] Title:Optimal Brain Decomposition for Accurate LLM Low-Rank Approximation Authors:Yuhang Li, Donghyun Lee, Ruokai Yin, Priyadarshini Panda View a PDF of the paper titled Optimal Brain Decomposition for Accurate LLM Low-Rank Approximation, by Yuhang Li and 3 other authors View PDF HTML (experimental) Abstract:Low-rank decomposition has emerged as an important problem in Large Language Model (LLM) fine-tuning and inference. Through Singular Value Decomposition (SVD), the weight matrix can be factorized into low-rank spaces optimally. Previously, a common practice was to decompose the weight in the activation-whitened space, and then achieve satisfying results. In this work, we propose Optimal Brain Decomposition LLM (OBD-LLM), which studies the decomposition problem in the model space by utilizing second-order Hessian information. Through a rigorous Kronecker-factorization of the Hessian, we show that the decomposition needs to consider both input and output information of the layer, and achieves much better decomposition results compared to input only method. Our loss-aware decomposition method involves a bi-directional whitening on the weight matrix. As a result, OBD-LLM is a closed-form solution for the optimal decomposition of weights in the language model. Remarkably, we achieve ~20-40\% better results than previous state-of-the-art decomposition methods, the SVD-LLM. Subjects: Machine L...

Originally published on April 02, 2026. Curated by AI News.

Llms

ParetoBandit: Budget-Paced Adaptive Routing for Non-Stationary LLM Serving

submitted by /u/PatienceHistorical70 [link] [comments]

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

Stop Overcomplicating AI Workflows. This Is the Simple Framework

I’ve been working on building an agentic AI workflow system for business use cases and one thing became very clear very quickly. This is ...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

Lemonade 10.1 released for latest improvements for local LLMs on AMD GPUs & NPUs

submitted by /u/Fcking_Chuck [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

The Jose robot at the airport is just a trained parrot

Saw the news about Jose, the AI humanoid greeting passengers in California, speaking 50+ languages. Everyone's impressed by the language ...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

[2604.00821] Optimal Brain Decomposition for Accurate LLM Low-Rank Approximation

About this article

Related Articles

ParetoBandit: Budget-Paced Adaptive Routing for Non-Stationary LLM Serving

Stop Overcomplicating AI Workflows. This Is the Simple Framework

Lemonade 10.1 released for latest improvements for local LLMs on AMD GPUs & NPUs

The Jose robot at the airport is just a trained parrot

No comments

Stay updated with AI News