[P] PCA before truncation makes non-Matryoshka embeddings compressible: results on BGE-M3 [P]

Reddit - Machine Learning 1 min read

About this article

Most embedding models are not Matryoshka-trained, so naive dimension truncation tends to destroy them. I tested a simple alternative: fit PCA once on a sample of embeddings, rotate vectors into the PCA basis, and then truncate. The idea is that PCA concentrates signal into leading components, so truncation stops being arbitrary. On a 10K-vector BGE-M3 sample (1024d), I got: 512d: naive truncation 0.707 cosine, PCA-first 0.996 384d: naive 0.609, PCA-first 0.990 256d: naive 0.467, PCA-first 0.9...

You've been blocked by network security.To continue, log in to your Reddit account or use your developer tokenIf you think you've been blocked by mistake, file a ticket below and we'll look into it.Log in File a ticket

Originally published on April 09, 2026. Curated by AI News.

Related Articles

Machine Learning

Flux maintains facial geometry and spatial coherence across 5 sequential iterative edits - is anything else doing this at this level?

One woman. 5 Different Prompts. Perfect Contextual Preservation Playing around with Flux again and thought I'll try it with a model chang...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Looking for Feedback & Improvement Ideas[P]

Hey everyone, I recently built a machine learning project and would really appreciate some honest feedback from this community. LINK- htt...

Reddit - Machine Learning · 1 min ·
Machine Learning

Why Anthropic’s new model has cybersecurity experts rattled

submitted by /u/ThereWas [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

AI Systems Performance Engineering by Chris Fregly - is it worth it? [D]

I found this book "AI Systems Performance Engineering" by Chris Fregly [1]. There is another book "Machine Learning Systems" by harvard [...

Reddit - Machine Learning · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime