[2603.23571] StateLinFormer: Stateful Training Enhancing Long-term

[2603.23571] StateLinFormer: Stateful Training Enhancing Long-term Memory in Navigation

arXiv - Machine Learning March 26, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.23571: StateLinFormer: Stateful Training Enhancing Long-term Memory in Navigation

Computer Science > Machine Learning arXiv:2603.23571 (cs) [Submitted on 24 Mar 2026] Title:StateLinFormer: Stateful Training Enhancing Long-term Memory in Navigation Authors:Zhiyuan Chen, Yuxuan Zhong, Fan Wang, Bo Yu, Pengtao Shao, Shaoshan Liu, Ning Ding View a PDF of the paper titled StateLinFormer: Stateful Training Enhancing Long-term Memory in Navigation, by Zhiyuan Chen and 6 other authors View PDF HTML (experimental) Abstract:Effective navigation intelligence relies on long-term memory to support both immediate generalization and sustained adaptation. However, existing approaches face a dilemma: modular systems rely on explicit mapping but lack flexibility, while Transformer-based end-to-end models are constrained by fixed context windows, limiting persistent memory across extended interactions. We introduce StateLinFormer, a linear-attention navigation model trained with a stateful memory mechanism that preserves recurrent memory states across consecutive training segments instead of reinitializing them at each batch boundary. This training paradigm effectively approximates learning on infinitely long sequences, enabling the model to achieve long-horizon memory retention. Experiments across both MAZE and ProcTHOR environments demonstrate that StateLinFormer significantly outperforms its stateless linear-attention counterpart and standard Transformer baselines with fixed context windows. Notably, as interaction length increases, persistent stateful training substan...

Originally published on March 26, 2026. Curated by AI News.

Machine Learning

[P] MCGrad: fix calibration of your ML model in subgroups

Hi r/MachineLearning, We’re open-sourcing MCGrad, a Python package for multicalibration–developed and deployed in production at Meta. Thi...

Reddit - Machine Learning · 1 min · 39 minutes ago

Machine Learning

Ml project user give dataset and I give best model [D] [P]

Tl,dr : suggest me a solution to create a ai ml project where user will give his dataset as input and the project should give best model ...

Reddit - Machine Learning · 1 min · about 4 hours ago

Machine Learning

[D] ICML Reviewer Acknowledgement

Hi, I'm a little confused about ICML discussion period Does the period for reviewer acknowledging responses have already ended? One of th...

Reddit - Machine Learning · 1 min · about 7 hours ago

Llms

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hey everyone I've set up a self-hosted API gateway using [New-API](QuantumNous/new-ap) to manage and distribute Claude Opus 4.6 access ac...

Reddit - Artificial Intelligence · 1 min · about 8 hours ago

[2603.23571] StateLinFormer: Stateful Training Enhancing Long-term Memory in Navigation

About this article

Related Articles

[P] MCGrad: fix calibration of your ML model in subgroups

Ml project user give dataset and I give best model [D] [P]

[D] ICML Reviewer Acknowledgement

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

No comments

Stay updated with AI News