[2508.02441] Computationally efficient Gauss-Newton reinforcement

[2508.02441] Computationally efficient Gauss-Newton reinforcement learning for model predictive control

arXiv - Machine Learning April 03, 2026 4 min read

About this article

Abstract page for arXiv paper 2508.02441: Computationally efficient Gauss-Newton reinforcement learning for model predictive control

Electrical Engineering and Systems Science > Systems and Control arXiv:2508.02441 (eess) [Submitted on 4 Aug 2025 (v1), last revised 2 Apr 2026 (this version, v2)] Title:Computationally efficient Gauss-Newton reinforcement learning for model predictive control Authors:Dean Brandner, Sebastien Gros, Sergio Lucia View a PDF of the paper titled Computationally efficient Gauss-Newton reinforcement learning for model predictive control, by Dean Brandner and 2 other authors View PDF HTML (experimental) Abstract:Model predictive control (MPC) is widely used in process control due to its interpretability and ability to handle constraints. As a parametric policy in reinforcement learning (RL), MPC offers strong initial performance and low data requirements compared to black-box policies like neural networks. However, most RL methods rely on first-order updates, which scale well to large parameter spaces but converge at most linearly, making them inefficient when each policy update requires solving an optimal control problem, as is the case with MPC. While MPC policies are typically low parameterized and thus amenable to second-order approaches, existing second-order methods demand second-order policy derivatives, which can be computationally intractable. This work introduces a Gauss-Newton approximation of the deterministic policy Hessian that eliminates the need for second-order policy derivatives, enabling superlinear convergence with minimal computational overhead. To further im...

Originally published on April 03, 2026. Curated by AI News.

Machine Learning

How do you anonymize code for a conference submission? [D]

Hi everyone, I have a question about anonymizing code for conference submissions. I’m submitting an AI/ML paper to a conference and would...

Reddit - Machine Learning · 1 min · 29 minutes ago

Machine Learning

Now Meta will track what employees do on their computers to train its AI agents | The Verge

Meta is reportedly using tracking software to record its employees’ mouse and keyboard activity for training data for its AI agents.

The Verge - AI · 4 min · about 2 hours ago

Llms

Training-time intervention yields 63.4% blind-pair human preference at matched val-loss (1.2B params, 320 judgments, p = 1.98 × 10⁻⁵) [R]

TL;DR. I ran a blind A/B preference evaluation between two 1.2B-parameter LMs trained on identical data (same order, same seed, 30K steps...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

I can't believe text normalization is so underdiscussed in streaming text-to-speech [D]

Kinda suprises me how little discussion there is around about mistakes in streaming TTS models People look for natural readers, high voic...

Reddit - Machine Learning · 1 min · about 4 hours ago

[2508.02441] Computationally efficient Gauss-Newton reinforcement learning for model predictive control

About this article

Related Articles

How do you anonymize code for a conference submission? [D]

Now Meta will track what employees do on their computers to train its AI agents | The Verge

Training-time intervention yields 63.4% blind-pair human preference at matched val-loss (1.2B params, 320 judgments, p = 1.98 × 10⁻⁵) [R]

I can't believe text normalization is so underdiscussed in streaming text-to-speech [D]

No comments

Stay updated with AI News