[2604.00686] Full-Gradient Successor Feature Representations

[2604.00686] Full-Gradient Successor Feature Representations

arXiv - Machine Learning 3 min read

About this article

Abstract page for arXiv paper 2604.00686: Full-Gradient Successor Feature Representations

Computer Science > Machine Learning arXiv:2604.00686 (cs) [Submitted on 1 Apr 2026] Title:Full-Gradient Successor Feature Representations Authors:Ritish Shrirao, Aditya Priyadarshi, Raghuram Bharadwaj Diddigi View a PDF of the paper titled Full-Gradient Successor Feature Representations, by Ritish Shrirao and 2 other authors View PDF HTML (experimental) Abstract:Successor Features (SF) combined with Generalized Policy Improvement (GPI) provide a robust framework for transfer learning in Reinforcement Learning (RL) by decoupling environment dynamics from reward functions. However, standard SF learning methods typically rely on semi-gradient Temporal Difference (TD) updates. When combined with non-linear function approximation, semi-gradient methods lack robust convergence guarantees and can lead to instability, particularly in the multi-task setting where accurate feature estimation is critical for effective GPI. Inspired by Full Gradient DQN, we propose Full-Gradient Successor Feature Representations Q-Learning (FG-SFRQL), an algorithm that optimizes the successor features by minimizing the full Mean Squared Bellman Error. Unlike standard approaches, our method computes gradients with respect to parameters in both the online and target networks. We provide a theoretical proof of almost-sure convergence for FG-SFRQL and demonstrate empirically that minimizing the full residual leads to superior sample efficiency and transfer performance compared to semi-gradient baselines i...

Originally published on April 02, 2026. Curated by AI News.

Related Articles

Machine Learning

FYI the Tennessee bill makes making an AI friend the same level as murder or aggravated rape

I think what Tennessee is doing is they recently passed SB 1580, which makes it illegal to even advertise that an AI can act as a mental ...

Reddit - Artificial Intelligence · 1 min ·
Nlp

Has anyone here switched to TeraBox recently? Is it actually worth it?

I’ve been seeing more people talk about TeraBox lately, especially around storage for AI-related workflows. Curious if anyone here has us...

Reddit - Artificial Intelligence · 1 min ·

Has anyone chosen to stick with the original Cove voice instead of the advanced voice?

I was already using the Cove voice when the advanced voice mode started rolling out. From what I remember, it was automatically enabled f...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[P] A control plane for post-training workflows

We have been exploring a project around post-training infrastructure, a minimalist tool that does one thing really well: Make post-traini...

Reddit - Machine Learning · 1 min ·

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime