[2508.05984] Parameter-free Optimal Rates for Nonlinear Semi-Norm

[2508.05984] Parameter-free Optimal Rates for Nonlinear Semi-Norm Contractions with Applications to $Q$-Learning

arXiv - Machine Learning March 24, 2026 3 min read

About this article

Abstract page for arXiv paper 2508.05984: Parameter-free Optimal Rates for Nonlinear Semi-Norm Contractions with Applications to $Q$-Learning

Computer Science > Machine Learning arXiv:2508.05984 (cs) [Submitted on 8 Aug 2025 (v1), last revised 21 Mar 2026 (this version, v2)] Title:Parameter-free Optimal Rates for Nonlinear Semi-Norm Contractions with Applications to $Q$-Learning Authors:Ankur Naskar, Gugan Thoppe, Vijay Gupta View a PDF of the paper titled Parameter-free Optimal Rates for Nonlinear Semi-Norm Contractions with Applications to $Q$-Learning, by Ankur Naskar and 2 other authors View PDF HTML (experimental) Abstract:Algorithms for solving \textit{nonlinear} fixed-point equations -- such as average-reward \textit{$Q$-learning} and \textit{TD-learning} -- often involve semi-norm contractions. Achieving parameter-free optimal convergence rates for these methods via Polyak--Ruppert averaging has remained elusive, largely due to the non-monotonicity of such semi-norms. We close this gap by (i.) recasting the averaged error as a linear recursion involving a nonlinear perturbation, and (ii.) taming the nonlinearity by coupling the semi-norm's contraction with the monotonicity of a suitably induced norm. Our main result yields the first parameter-free $\tilde{O}(1/\sqrt{t})$ optimal rates for $Q$-learning in both average-reward and exponentially discounted settings, where $t$ denotes the iteration index. The result applies within a broad framework that accommodates synchronous and asynchronous updates, single-agent and distributed deployments, and data streams obtained either from simulators or along Markovi...

Originally published on March 24, 2026. Curated by AI News.

Machine Learning

VulcanAMI Might Help

I open-sourced a large AI platform I built solo, working 16 hours a day, at my kitchen table, fueled by an inordinate degree of compulsio...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

[P] I tested Meta’s brain-response model on posts. It predicted the Elon one almost perfectly.

I built an experimental UI and visualization layer around Meta’s open brain-response model just to see whether this stuff actually works ...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

[R] First open-source implementation of Hebbian fast-weight write-back for the BDH architecture

The BDH (Dragon Hatchling) paper (arXiv:2509.26507) describes a Hebbian synaptic plasticity mechanism where model weights update during i...

Reddit - Machine Learning · 1 min · about 11 hours ago

Machine Learning

[D] Could really use some guidance . I'm a 2nd year Data Science UG Student

I'm currently finishing up my second year of a three year Bachelor of Data Science degree. I've got the basics down quite well, linear re...

Reddit - Machine Learning · 1 min · 1 day ago

[2508.05984] Parameter-free Optimal Rates for Nonlinear Semi-Norm Contractions with Applications to $Q$-Learning

About this article

Related Articles

VulcanAMI Might Help

[P] I tested Meta’s brain-response model on posts. It predicted the Elon one almost perfectly.

[R] First open-source implementation of Hebbian fast-weight write-back for the BDH architecture

[D] Could really use some guidance . I'm a 2nd year Data Science UG Student

No comments

Stay updated with AI News