[2604.05125] Offline RL for Adaptive Policy Retrieval in Prior

[2604.05125] Offline RL for Adaptive Policy Retrieval in Prior Authorization

arXiv - AI April 08, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.05125: Offline RL for Adaptive Policy Retrieval in Prior Authorization

Computer Science > Information Retrieval arXiv:2604.05125 (cs) [Submitted on 6 Apr 2026] Title:Offline RL for Adaptive Policy Retrieval in Prior Authorization Authors:Ruslan Sharifullin, Maxim Gorshkov, Hannah Clay View a PDF of the paper titled Offline RL for Adaptive Policy Retrieval in Prior Authorization, by Ruslan Sharifullin and 2 other authors View PDF HTML (experimental) Abstract:Prior authorization (PA) requires interpretation of complex and fragmented coverage policies, yet existing retrieval-augmented systems rely on static top-$K$ strategies with fixed numbers of retrieved sections. Such fixed retrieval can be inefficient and gather irrelevant or insufficient information. We model policy retrieval for PA as a sequential decision-making problem, formulating adaptive retrieval as a Markov Decision Process (MDP). In our system, an agent iteratively selects policy chunks from a top-$K$ candidate set or chooses to stop and issue a decision. The reward balances decision correctness against retrieval cost, capturing the trade-off between accuracy and efficiency. We train policies using Conservative Q-Learning (CQL), Implicit Q-Learning (IQL), and Direct Preference Optimization (DPO) in an offline RL setting on logged trajectories generated from baseline retrieval strategies over synthetic PA requests derived from publicly available CMS coverage data. On a corpus of 186 policy chunks spanning 10 CMS procedures, CQL achieves 92% decision accuracy (+30 percentage points ...

Originally published on April 08, 2026. Curated by AI News.

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 1 hour ago

Machine Learning

Weird ICML decision [D]

Hello, A friend of mine had a paper with borderline scores accepted at ICML. However, the comment made by the meta reviewers feels like t...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

[2603.13566] EmDT: Embedding Diffusion Transformer for Tabular Data Generation in Fraud Detection

Abstract page for arXiv paper 2603.13566: EmDT: Embedding Diffusion Transformer for Tabular Data Generation in Fraud Detection

arXiv - Machine Learning · 3 min · about 3 hours ago

[2604.05125] Offline RL for Adaptive Policy Retrieval in Prior Authorization

About this article

Related Articles

UMKC Announces New Master of Science in Artificial Intelligence

Accelerating science with AI and simulations

Weird ICML decision [D]

[2603.13566] EmDT: Embedding Diffusion Transformer for Tabular Data Generation in Fraud Detection

No comments

Stay updated with AI News