Machine Learning Nlp

[P] PCA before truncation makes non-Matryoshka embeddings compressible: results on BGE-M3 [P]

Reddit - Machine Learning April 09, 2026 1 min read

About this article

Most embedding models are not Matryoshka-trained, so naive dimension truncation tends to destroy them. I tested a simple alternative: fit PCA once on a sample of embeddings, rotate vectors into the PCA basis, and then truncate. The idea is that PCA concentrates signal into leading components, so truncation stops being arbitrary. On a 10K-vector BGE-M3 sample (1024d), I got: 512d: naive truncation 0.707 cosine, PCA-first 0.996 384d: naive 0.609, PCA-first 0.990 256d: naive 0.467, PCA-first 0.9...

You've been blocked by network security.To continue, log in to your Reddit account or use your developer tokenIf you think you've been blocked by mistake, file a ticket below and we'll look into it.Log in File a ticket

Originally published on April 09, 2026. Curated by AI News.

Read Original Article

Machine Learning

Flux maintains facial geometry and spatial coherence across 5 sequential iterative edits - is anything else doing this at this level?

One woman. 5 Different Prompts. Perfect Contextual Preservation Playing around with Flux again and thought I'll try it with a model chang...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

Looking for Feedback & Improvement Ideas[P]

Hey everyone, I recently built a machine learning project and would really appreciate some honest feedback from this community. LINK- htt...

Reddit - Machine Learning · 1 min · about 2 hours ago

Machine Learning

Why Anthropic’s new model has cybersecurity experts rattled

submitted by /u/ThereWas [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Machine Learning

AI Systems Performance Engineering by Chris Fregly - is it worth it? [D]

I found this book "AI Systems Performance Engineering" by Chris Fregly [1]. There is another book "Machine Learning Systems" by harvard [...

Reddit - Machine Learning · 1 min · about 5 hours ago

More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

[P] PCA before truncation makes non-Matryoshka embeddings compressible: results on BGE-M3 [P]

About this article

Related Articles

Flux maintains facial geometry and spatial coherence across 5 sequential iterative edits - is anything else doing this at this level?

Looking for Feedback & Improvement Ideas[P]

Why Anthropic’s new model has cybersecurity experts rattled

AI Systems Performance Engineering by Chris Fregly - is it worth it? [D]

No comments

Stay updated with AI News