[2511.04454] Fitting Reinforcement Learning Model to Behavioral Data

[2511.04454] Fitting Reinforcement Learning Model to Behavioral Data under Bandits

arXiv - Machine Learning March 27, 2026 4 min read

About this article

Abstract page for arXiv paper 2511.04454: Fitting Reinforcement Learning Model to Behavioral Data under Bandits

Computer Science > Computational Engineering, Finance, and Science arXiv:2511.04454 (cs) [Submitted on 6 Nov 2025 (v1), last revised 26 Mar 2026 (this version, v2)] Title:Fitting Reinforcement Learning Model to Behavioral Data under Bandits Authors:Hao Zhu, Jasper Hoffmann, Baohe Zhang, Joschka Boedecker View a PDF of the paper titled Fitting Reinforcement Learning Model to Behavioral Data under Bandits, by Hao Zhu and 3 other authors View PDF HTML (experimental) Abstract:We consider the problem of fitting a reinforcement learning (RL) model to some given behavioral data under a multi-armed bandit environment. These models have received much attention in recent years for characterizing human and animal decision making behavior. We provide a generic mathematical optimization problem formulation for the fitting problem of a wide range of RL models that appear frequently in scientific research applications. We then provide a detailed theoretical analysis of its convexity properties. Based on the theoretical results, we introduce a novel solution method for the fitting problem of RL models based on convex relaxation and optimization. Our method is then evaluated in several simulated and real-world bandit environments to compare with some benchmark methods that appear in the literature. Numerical results indicate that our method achieves comparable performance to the state-of-the-art, while significantly reducing computation time. We also provide an open-source Python package f...

Originally published on March 27, 2026. Curated by AI News.

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 3 hours ago

Machine Learning

[D] Looking for definition of open-world ish learning problem

Hello! Recently I did a project where I initially had around 30 target classes. But at inference, the model had to be able to handle a lo...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

Mystery Shopping Meets Machine Learning: Can Algorithms Become the Ultimate Customer Experience Auditor?

Customer expectations across Africa are shifting faster than most organisations can track. A single inconsistent interaction can ignite a...

AI News - General · 8 min · about 4 hours ago

Machine Learning

GitHub to Use User Data for AI Training by Default

submitted by /u/i-drake [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

[2511.04454] Fitting Reinforcement Learning Model to Behavioral Data under Bandits

About this article

Related Articles

UMKC Announces New Master of Science in Artificial Intelligence

[D] Looking for definition of open-world ish learning problem

Mystery Shopping Meets Machine Learning: Can Algorithms Become the Ultimate Customer Experience Auditor?

GitHub to Use User Data for AI Training by Default

No comments

Stay updated with AI News