[2604.05185] Cross-fitted Proximal Learning for Model-Based

[2604.05185] Cross-fitted Proximal Learning for Model-Based Reinforcement Learning

arXiv - Machine Learning April 08, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.05185: Cross-fitted Proximal Learning for Model-Based Reinforcement Learning

Computer Science > Machine Learning arXiv:2604.05185 (cs) [Submitted on 6 Apr 2026] Title:Cross-fitted Proximal Learning for Model-Based Reinforcement Learning Authors:Nishanth Venkatesh, Andreas A. Malikopoulos View a PDF of the paper titled Cross-fitted Proximal Learning for Model-Based Reinforcement Learning, by Nishanth Venkatesh and 1 other authors View PDF HTML (experimental) Abstract:Model-based reinforcement learning is attractive for sequential decision-making because it explicitly estimates reward and transition models and then supports planning through simulated rollouts. In offline settings with hidden confounding, however, models learned directly from observational data may be biased. This challenge is especially pronounced in partially observable systems, where latent factors may jointly affect actions, rewards, and future observations. Recent work has shown that policy evaluation in such confounded partially observable Markov decision processes (POMDPs) can be reduced to estimating reward-emission and observation-transition bridge functions satisfying conditional moment restrictions (CMRs). In this paper, we study the statistical estimation of these bridge functions. We formulate bridge learning as a CMR problem with nuisance objects given by a conditional mean embedding and a conditional density. We then develop a $K$-fold cross-fitted extension of the existing two-stage bridge estimator. The proposed procedure preserves the original bridge-based identifica...

Originally published on April 08, 2026. Curated by AI News.

Machine Learning

‘The cost of compute is far beyond the costs of the employees’: Nvidia exec says right now AI is more expensive than paying human workers

Nvidia’s vice president of applied deep learning, Bryan Catanzaro, recently stated that for his team, “the cost of compute is far beyond ...

Reddit - Artificial Intelligence · 1 min · 5 minutes ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · 8 minutes ago

Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min · 8 minutes ago

Machine Learning

New technique makes AI models leaner and faster while they’re still learning

AI News - General · 9 min · 8 minutes ago

[2604.05185] Cross-fitted Proximal Learning for Model-Based Reinforcement Learning

About this article

Related Articles

‘The cost of compute is far beyond the costs of the employees’: Nvidia exec says right now AI is more expensive than paying human workers

UMKC Announces New Master of Science in Artificial Intelligence

Improving AI models’ ability to explain their predictions

New technique makes AI models leaner and faster while they’re still learning

No comments

Stay updated with AI News