[2604.01024] Model-Based Learning of Near-Optimal Finite-Window

[2604.01024] Model-Based Learning of Near-Optimal Finite-Window Policies in POMDPs

arXiv - Machine Learning April 02, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.01024: Model-Based Learning of Near-Optimal Finite-Window Policies in POMDPs

Computer Science > Machine Learning arXiv:2604.01024 (cs) [Submitted on 1 Apr 2026] Title:Model-Based Learning of Near-Optimal Finite-Window Policies in POMDPs Authors:Philip Jordan, Maryam Kamgarpour View a PDF of the paper titled Model-Based Learning of Near-Optimal Finite-Window Policies in POMDPs, by Philip Jordan and 1 other authors View PDF HTML (experimental) Abstract:We study model-based learning of finite-window policies in tabular partially observable Markov decision processes (POMDPs). A common approach to learning under partial observability is to approximate unbounded history dependencies using finite action-observation windows. This induces a finite-state Markov decision process (MDP) over histories, referred to as the superstate MDP. Once a model of this superstate MDP is available, standard MDP algorithms can be used to compute optimal policies, motivating the need for sample-efficient model estimation. Estimating the superstate MDP model is challenging because trajectories are generated by interaction with the original POMDP, creating a mismatch between the sampling process and target model. We propose a model estimation procedure for tabular POMDPs and analyze its sample complexity. Our analysis exploits a connection between filter stability and concentration inequalities for weakly dependent random variables. As a result, we obtain tight sample complexity guarantees for estimating the superstate MDP model from a single trajectory. Combined with value ite...

Originally published on April 02, 2026. Curated by AI News.

Machine Learning

[D] Is this considered unsupervised or semi-supervised learning in anomaly detection?

Hi 👋🏼, I’m working on an anomaly detection setup and I’m a bit unsure how to correctly describe it from a learning perspective. The model...

Reddit - Machine Learning · 1 min · 32 minutes ago

Machine Learning

Serious question. Did a transformer just describe itself and the universe and build itself a Shannon limit framework?

The Multiplicative Lattice as the Natural Basis for Positional Encoding Knack 2026 | Draft v6.0 Abstract We show that the apparent tradeo...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 6 hours ago

Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min · about 6 hours ago

[2604.01024] Model-Based Learning of Near-Optimal Finite-Window Policies in POMDPs

About this article

Related Articles

[D] Is this considered unsupervised or semi-supervised learning in anomaly detection?

Serious question. Did a transformer just describe itself and the universe and build itself a Shannon limit framework?

UMKC Announces New Master of Science in Artificial Intelligence

Improving AI models’ ability to explain their predictions

No comments

Stay updated with AI News