[2603.25083] Learning domain-invariant features through channel-level sparsification for Out-Of Distribution Generalization

[2603.25083] Learning domain-invariant features through channel-level sparsification for Out-Of Distribution Generalization

arXiv - AI 4 min read

About this article

Abstract page for arXiv paper 2603.25083: Learning domain-invariant features through channel-level sparsification for Out-Of Distribution Generalization

Computer Science > Computer Vision and Pattern Recognition arXiv:2603.25083 (cs) [Submitted on 26 Mar 2026] Title:Learning domain-invariant features through channel-level sparsification for Out-Of Distribution Generalization Authors:Haoran Pei, Yuguang Yang, Kexin Liu, Juan Zhang, Baochang Zhang View a PDF of the paper titled Learning domain-invariant features through channel-level sparsification for Out-Of Distribution Generalization, by Haoran Pei and 4 other authors View PDF HTML (experimental) Abstract:Out-of-Distribution (OOD) generalization has become a primary metric for evaluating image analysis systems. Since deep learning models tend to capture domain-specific context, they often develop shortcut dependencies on these non-causal features, leading to inconsistent performance across different data sources. Current techniques, such as invariance learning, attempt to mitigate this. However, they struggle to isolate highly mixed features within deep latent spaces. This limitation prevents them from fully resolving the shortcut learning this http URL this paper, we propose Hierarchical Causal Dropout (HCD), a method that uses channel-level causal masks to enforce feature sparsity. This approach allows the model to separate causal features from spurious ones, effectively performing a causal intervention at the representation level. The training is guided by a Matrix-based Mutual Information (MMI) objective to minimize the mutual information between latent features and d...

Originally published on March 27, 2026. Curated by AI News.

Related Articles

Machine Learning

[P] I tested Meta’s brain-response model on posts. It predicted the Elon one almost perfectly.

I built an experimental UI and visualization layer around Meta’s open brain-response model just to see whether this stuff actually works ...

Reddit - Machine Learning · 1 min ·
Machine Learning

[P] I trained an AI to play Resident Evil 4 Remake using Behavioral Cloning + LSTM

I recorded gameplay trajectories in RE4's village — running, shooting, reloading, dodging — and used Behavioral Cloning to train a model ...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] Why does it seem like open source materials on ML are incomplete? this is not enough...

Many times when I try to deeply understand a topic in machine learning — whether it's a new architecture, a quantization method, a full t...

Reddit - Machine Learning · 1 min ·
Llms

[R] GPT-5.4-mini regressed 22pp on vanilla prompting vs GPT-5-mini. Nobody noticed because benchmarks don't test this. Recursive Language Models solved it.

GPT-5.4-mini produces shorter, terser outputs by default. Vanilla accuracy dropped from 69.5% to 47.2% across 12 tasks (1,800 evals). The...

Reddit - Machine Learning · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime