[2507.04754] Intervening to Learn and Compose Causally Disentangled Representations

[2507.04754] Intervening to Learn and Compose Causally Disentangled Representations

arXiv - Machine Learning 3 min read

About this article

Abstract page for arXiv paper 2507.04754: Intervening to Learn and Compose Causally Disentangled Representations

Statistics > Machine Learning arXiv:2507.04754 (stat) [Submitted on 7 Jul 2025 (v1), last revised 2 Apr 2026 (this version, v2)] Title:Intervening to Learn and Compose Causally Disentangled Representations Authors:Alex Markham, Isaac Hirsch, Jeri A. Chang, Liam Solus, Bryon Aragam View a PDF of the paper titled Intervening to Learn and Compose Causally Disentangled Representations, by Alex Markham and 4 other authors View PDF HTML (experimental) Abstract:In designing generative models, it is commonly believed that in order to learn useful latent structure, we face a fundamental tension between expressivity and structure. In this paper we challenge this view by proposing a new approach to training arbitrarily expressive generative models that simultaneously learn causally disentangled concepts. This is accomplished by adding a simple context module to an arbitrarily complex black-box model, which learns to process concept information by implicitly inverting linear representations from the model's encoder. Inspired by the notion of intervention in a causal model, our module selectively modifies its architecture during training, allowing it to learn a compact joint model over different contexts. We show how adding this module leads to causally disentangled representations that can be composed for out-of-distribution generation on both real and simulated data. The resulting models can be trained end-to-end or fine-tuned from pre-trained models. To further validate our proposed...

Originally published on April 03, 2026. Curated by AI News.

Related Articles

Machine Learning

How do you anonymize code for a conference submission? [D]

Hi everyone, I have a question about anonymizing code for conference submissions. I’m submitting an AI/ML paper to a conference and would...

Reddit - Machine Learning · 1 min ·
Now Meta will track what employees do on their computers to train its AI agents | The Verge
Machine Learning

Now Meta will track what employees do on their computers to train its AI agents | The Verge

Meta is reportedly using tracking software to record its employees’ mouse and keyboard activity for training data for its AI agents.

The Verge - AI · 4 min ·
Llms

Training-time intervention yields 63.4% blind-pair human preference at matched val-loss (1.2B params, 320 judgments, p = 1.98 × 10⁻⁵) [R]

TL;DR. I ran a blind A/B preference evaluation between two 1.2B-parameter LMs trained on identical data (same order, same seed, 30K steps...

Reddit - Machine Learning · 1 min ·
Machine Learning

I can't believe text normalization is so underdiscussed in streaming text-to-speech [D]

Kinda suprises me how little discussion there is around about mistakes in streaming TTS models People look for natural readers, high voic...

Reddit - Machine Learning · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime