[2603.26799] Gaussian Joint Embeddings For Self-Supervised

[2603.26799] Gaussian Joint Embeddings For Self-Supervised Representation Learning

arXiv - Machine Learning March 31, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.26799: Gaussian Joint Embeddings For Self-Supervised Representation Learning

Computer Science > Machine Learning arXiv:2603.26799 (cs) [Submitted on 26 Mar 2026] Title:Gaussian Joint Embeddings For Self-Supervised Representation Learning Authors:Yongchao Huang View a PDF of the paper titled Gaussian Joint Embeddings For Self-Supervised Representation Learning, by Yongchao Huang View PDF HTML (experimental) Abstract:Self-supervised representation learning often relies on deterministic predictive architectures to align context and target views in latent space. While effective in many settings, such methods are limited in genuinely multi-modal inverse problems, where squared-loss prediction collapses towards conditional averages, and they frequently depend on architectural asymmetries to prevent representation collapse. In this work, we propose a probabilistic alternative based on generative joint modeling. We introduce Gaussian Joint Embeddings (GJE) and its multi-modal extension, Gaussian Mixture Joint Embeddings (GMJE), which model the joint density of context and target representations and replace black-box prediction with closed-form conditional inference under an explicit probabilistic model. This yields principled uncertainty estimates and a covariance-aware objective for controlling latent geometry. We further identify a failure mode of naive empirical batch optimization, which we term the Mahalanobis Trace Trap, and develop several remedies spanning parametric, adaptive, and non-parametric settings, including prototype-based GMJE, conditional...

Originally published on March 31, 2026. Curated by AI News.

Nlp

Built an Event Kernel for Agent OSes that Coordinates Under Load: Real-Time Events, Replayable Logs, TTL subs, No Deadlocks

Agent systems are running on outdated infrastructure, manual state checks, endless polling, and fragile logs. Every workaround patches an...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Nlp

[2603.13793] GhanaNLP Parallel Corpora: Comprehensive Multilingual Resources for Low-Resource Ghanaian Languages

Abstract page for arXiv paper 2603.13793: GhanaNLP Parallel Corpora: Comprehensive Multilingual Resources for Low-Resource Ghanaian Langu...

arXiv - AI · 4 min · about 4 hours ago

Llms

[2602.08482] CLEAR: A Knowledge-Centric Vessel Trajectory Analysis Platform

Abstract page for arXiv paper 2602.08482: CLEAR: A Knowledge-Centric Vessel Trajectory Analysis Platform

arXiv - AI · 3 min · about 4 hours ago

Machine Learning

[2603.12057] Coarse-Guided Visual Generation via Weighted h-Transform Sampling

Abstract page for arXiv paper 2603.12057: Coarse-Guided Visual Generation via Weighted h-Transform Sampling