[2604.01329] Model Merging via Data-Free Covariance Estimation

[2604.01329] Model Merging via Data-Free Covariance Estimation

arXiv - Machine Learning 3 min read

About this article

Abstract page for arXiv paper 2604.01329: Model Merging via Data-Free Covariance Estimation

Computer Science > Machine Learning arXiv:2604.01329 (cs) [Submitted on 1 Apr 2026] Title:Model Merging via Data-Free Covariance Estimation Authors:Marawan Gamal Abdel Hameed, Derek Tam, Pascal Jr Tikeng Notsawo, Colin Raffel, Guillaume Rabusseau View a PDF of the paper titled Model Merging via Data-Free Covariance Estimation, by Marawan Gamal Abdel Hameed and Derek Tam and Pascal Jr Tikeng Notsawo and Colin Raffel and Guillaume Rabusseau View PDF Abstract:Model merging provides a way of cheaply combining individual models to produce a model that inherits each individual's capabilities. While some merging methods can approach the performance of multitask training, they are often heuristically motivated and lack theoretical justification. A principled alternative is to pose model merging as a layer-wise optimization problem that directly minimizes interference between tasks. However, this formulation requires estimating per-layer covariance matrices from data, which may not be available when performing merging. In contrast, many of the heuristically-motivated methods do not require auxiliary data, making them practically advantageous. In this work, we revisit the interference minimization framework and show that, under certain conditions, covariance matrices can be estimated directly from difference matrices, eliminating the need for data while also reducing computational costs. We validate our approach across vision and language benchmarks on models ranging from 86M parame...

Originally published on April 03, 2026. Curated by AI News.

Related Articles

Google quietly launched an AI dictation app that works offline | TechCrunch
Machine Learning

Google quietly launched an AI dictation app that works offline | TechCrunch

Google's new offline-first dictation app uses Gemma AI models to take on the apps like Wispr Flow.

TechCrunch - AI · 4 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
CONESTOGA COLLEGE Robots deepen AI and data analytics training for Conestoga students
Machine Learning

CONESTOGA COLLEGE Robots deepen AI and data analytics training for Conestoga students

AI News - General · 5 min ·
Alabama A&M University chosen for Amazon Web Services AI training program
Machine Learning

Alabama A&M University chosen for Amazon Web Services AI training program

AI News - General · 2 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime