[2604.02505] Optimal Projection-Free Adaptive SGD for Matrix

[2604.02505] Optimal Projection-Free Adaptive SGD for Matrix Optimization

arXiv - Machine Learning April 06, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.02505: Optimal Projection-Free Adaptive SGD for Matrix Optimization

Mathematics > Optimization and Control arXiv:2604.02505 (math) [Submitted on 2 Apr 2026] Title:Optimal Projection-Free Adaptive SGD for Matrix Optimization Authors:Dmitry Kovalev View a PDF of the paper titled Optimal Projection-Free Adaptive SGD for Matrix Optimization, by Dmitry Kovalev View PDF HTML (experimental) Abstract:Recently, Jiang et al. [2026] developed Leon, a practical variant of One-sided Shampoo [Xie et al., 2025a, An et al., 2025] algorithm for online convex optimization, which does not require computing a costly quadratic projection at each iteration. Unfortunately, according to the existing analysis, Leon requires tuning an additional hyperparameter in its preconditioner and cannot achieve dimension-independent convergence guarantees for convex optimization problems beyond the bounded gradients assumption. In this paper, we resolve this issue by proving certain stability properties of Leon's preconditioner. Using our improved analysis, we show that tuning the extra hyperparameter can be avoided and, more importantly, develop the first practical variant of One-sided Shampoo with Nesterov acceleration, which does not require computing projections at each iteration. As a side contribution, we obtain improved dimension-independent rates in the non-smooth non-convex setting and develop a unified analysis of the proposed algorithm, which yields accelerated projection-free adaptive SGD with (block-)diagonal preconditioners. Subjects: Optimization and Control (m...

Originally published on April 06, 2026. Curated by AI News.

AI CEO vs Engineer (2026).

This gave me a good chuckle. Wouldn't be so funny if it wasn't true. submitted by /u/Ayla_Leren [link] [comments]

Reddit - Artificial Intelligence · 1 min · 1 minute ago

Ai Agents

Visa rolls out AI agent shopping infrastructure

submitted by /u/tekz [link] [comments]

Reddit - Artificial Intelligence · 1 min · 1 minute ago

Machine Learning

Flux maintains facial geometry and spatial coherence across 5 sequential iterative edits - is anything else doing this at this level?

One woman. 5 Different Prompts. Perfect Contextual Preservation Playing around with Flux again and thought I'll try it with a model chang...

Reddit - Artificial Intelligence · 1 min · 1 minute ago

I legitimately think Anthropic is worth $100B more than it was a week ago

A week ago I put out a first-day IPO market cap forecast for Anthropic with a reference point of $19B ARR. Then Anthropic announced their...

Reddit - Artificial Intelligence · 1 min · 1 minute ago

[2604.02505] Optimal Projection-Free Adaptive SGD for Matrix Optimization

About this article

Related Articles

AI CEO vs Engineer (2026).

Visa rolls out AI agent shopping infrastructure

Flux maintains facial geometry and spatial coherence across 5 sequential iterative edits - is anything else doing this at this level?

I legitimately think Anthropic is worth $100B more than it was a week ago

No comments

Stay updated with AI News