[2602.17385] Dataless Weight Disentanglement in Task Arithmetic via Kronecker-Factored Approximate Curvature

[2602.17385] Dataless Weight Disentanglement in Task Arithmetic via Kronecker-Factored Approximate Curvature

arXiv - AI 3 min read Article

Summary

This paper presents a novel dataless approach to disentangling task vectors in task arithmetic using Kronecker-Factored Approximate Curvature, enhancing modularity and performance without requiring external data.

Why It Matters

The research addresses significant challenges in adapting foundation models for multiple tasks without data dependency, which is crucial for applications with privacy constraints. By proposing a method that maintains performance while eliminating the need for held-out tuning, it opens new avenues for robust AI model development.

Key Takeaways

  • Introduces a dataless method for disentangling task vectors.
  • Utilizes Kronecker-Factored Approximate Curvature for regularization.
  • Achieves state-of-the-art results in task addition and negation.
  • Promotes robustness against task vector rescaling.
  • Maintains constant complexity regardless of the number of tasks.

Computer Science > Artificial Intelligence arXiv:2602.17385 (cs) [Submitted on 19 Feb 2026] Title:Dataless Weight Disentanglement in Task Arithmetic via Kronecker-Factored Approximate Curvature Authors:Angelo Porrello, Pietro Buzzega, Felix Dangel, Thomas Sommariva, Riccardo Salami, Lorenzo Bonicelli, Simone Calderara View a PDF of the paper titled Dataless Weight Disentanglement in Task Arithmetic via Kronecker-Factored Approximate Curvature, by Angelo Porrello and 6 other authors View PDF HTML (experimental) Abstract:Task Arithmetic yields a modular, scalable way to adapt foundation models. Combining multiple task vectors, however, can lead to cross-task interference, causing representation drift and degraded performance. Representation drift regularization provides a natural remedy to disentangle task vectors; however, existing approaches typically require external task data, conflicting with modularity and data availability constraints (e.g., privacy requirements). We propose a dataless approach by framing regularization against representation drift as a curvature matrix approximation problem. This allows us to leverage well-established techniques; in particular, we adopt Kronecker-Factored Approximate Curvature and obtain a practical regularizer that achieves state-of-the-art results in task addition and negation. Our method has constant complexity in the number of tasks and promotes robustness to task vector rescaling, eliminating the need for held-out tuning. Commen...

Related Articles

Tubi is the first streamer to launch a native app within ChatGPT | TechCrunch
Llms

Tubi is the first streamer to launch a native app within ChatGPT | TechCrunch

Tubi becomes the first streaming service to offer an app integration within ChatGPT, the AI chatbot that millions of users turn to for an...

TechCrunch - AI · 3 min ·
Llms

Anyone out there use Claude Pro/Max at the same time on different screens?

I am asking for feedback ? I’m currently using a Claude paid plan (Pro/Max) and was wondering about the logistics of simultaneous use. Sp...

Reddit - Artificial Intelligence · 1 min ·
Llms

[R] The Lyra Technique — A framework for interpreting internal cognitive states in LLMs (Zenodo, open access)

We're releasing a paper on a new framework for reading and interpreting the internal cognitive states of large language models: "The Lyra...

Reddit - Machine Learning · 1 min ·
Llms

Looking to build a production-level AI/ML project (agentic systems), need guidance on what to build

Hi everyone, I’m a final-year undergraduate AI/ML student currently focusing on applied AI / agentic systems. So far, I’ve spent time und...

Reddit - ML Jobs · 1 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime