[2510.01938] StelLA: Subspace Learning in Low-rank Adaptation using

[2510.01938] StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold

arXiv - Machine Learning April 03, 2026 4 min read

About this article

Abstract page for arXiv paper 2510.01938: StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold

Computer Science > Machine Learning arXiv:2510.01938 (cs) [Submitted on 2 Oct 2025 (v1), last revised 2 Apr 2026 (this version, v2)] Title:StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold Authors:Zhizhong Li, Sina Sajadmanesh, Jingtao Li, Lingjuan Lyu View a PDF of the paper titled StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold, by Zhizhong Li and 3 other authors View PDF HTML (experimental) Abstract:Low-rank adaptation (LoRA) has been widely adopted as a parameter-efficient technique for fine-tuning large-scale pre-trained models. However, it still lags behind full fine-tuning in performance, partly due to its insufficient exploitation of the geometric structure underlying low-rank manifolds. In this paper, we propose a geometry-aware extension of LoRA that uses a three-factor decomposition $U\!SV^\top$. Analogous to the structure of singular value decomposition (SVD), it separates the adapter's input and output subspaces, $V$ and $U$, from the scaling factor $S$. Our method constrains $U$ and $V$ to lie on the Stiefel manifold, ensuring their orthonormality throughout the training. To optimize on the Stiefel manifold, we employ a flexible and modular geometric optimization design that converts any Euclidean optimizer to a Riemannian one. It enables efficient subspace learning while remaining compatible with existing fine-tuning pipelines. Empirical results across a wide range of downstream tasks, including commonsense reaso...

Originally published on April 03, 2026. Curated by AI News.

Machine Learning

Anthropic’s Mythos rollout has missed America’s cybersecurity agency | The Verge

The Cybersecurity and Infrastructure Security Agency (CISA) doesn’t have access to Anthropic’s Mythos Preview, Axios reported.

The Verge - AI · 5 min · about 1 hour ago

Machine Learning

How do you anonymize code for a conference submission? [D]

Hi everyone, I have a question about anonymizing code for conference submissions. I’m submitting an AI/ML paper to a conference and would...

Reddit - Machine Learning · 1 min · about 2 hours ago

Machine Learning

Now Meta will track what employees do on their computers to train its AI agents | The Verge

Meta is reportedly using tracking software to record its employees’ mouse and keyboard activity for training data for its AI agents.

The Verge - AI · 4 min · about 3 hours ago

Llms

Training-time intervention yields 63.4% blind-pair human preference at matched val-loss (1.2B params, 320 judgments, p = 1.98 × 10⁻⁵) [R]

TL;DR. I ran a blind A/B preference evaluation between two 1.2B-parameter LMs trained on identical data (same order, same seed, 30K steps...

Reddit - Machine Learning · 1 min · about 5 hours ago

[2510.01938] StelLA: Subspace Learning in Low-rank Adaptation using Stiefel Manifold

About this article

Related Articles

Anthropic’s Mythos rollout has missed America’s cybersecurity agency | The Verge

How do you anonymize code for a conference submission? [D]

Now Meta will track what employees do on their computers to train its AI agents | The Verge

Training-time intervention yields 63.4% blind-pair human preference at matched val-loss (1.2B params, 320 judgments, p = 1.98 × 10⁻⁵) [R]

No comments

Stay updated with AI News