Machine Learning

[R] Spectral Compact Training: 172x memory reduction for 70B model training - verified on a Steam Deck (7.24 GB)

Reddit - Machine Learning March 28, 2026 1 min read

About this article

This is a research article about a patent I filed (not self promotion). I am dyslexic so I used AI to help with the writing. I have been working on Spectral Compact Training (SCT). It stores every weight matrix as [ W = U \operatorname{diag}(s) VT ] and trains directly through the small spectral factors. Never builds the dense matrix. Exact gradients via standard backprop. QR retraction keeps U and V orthonormal after each optimizer step. Results on a 70B-class architecture (80 layers, hidden...

You've been blocked by network security.To continue, log in to your Reddit account or use your developer tokenIf you think you've been blocked by mistake, file a ticket below and we'll look into it.Log in File a ticket

Originally published on March 28, 2026. Curated by AI News.

Read Original Article

Llms

[D] Litellm supply chain attack and what it means for api key management

If you missed it, litellm versions 1.82.7 and 1.82.8 on pypi got compromised. malicious .pth file that runs on every python process start...

Reddit - Machine Learning · 1 min · 5 minutes ago

Machine Learning

Is anyone else watching what Qubic is doing with distributed compute and AI training? Seems underreported in AI cirles

I follow AI infrastructure pretty closely and Qubic keeps coming up in my research in a way I find intersting but havent seen much discus...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Machine Learning

We need to teach AI the essence of being human to reduce the risk of misalignment

One part of the alignment problem is that AI does not genuinely understand what it's like to live in the world, even though it can descri...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

Llms

Looking for a solid ChatGPT alternative for daily work

I was long juggling separate monthly subscriptions for Claude, Gemini, and GPT-4 until the costs and tab-switching became a total mess an...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

[R] Spectral Compact Training: 172x memory reduction for 70B model training - verified on a Steam Deck (7.24 GB)

About this article

Related Articles

[D] Litellm supply chain attack and what it means for api key management

Is anyone else watching what Qubic is doing with distributed compute and AI training? Seems underreported in AI cirles

We need to teach AI the essence of being human to reduce the risk of misalignment

Looking for a solid ChatGPT alternative for daily work

No comments

Stay updated with AI News