[2402.15127] Asymptotically and Minimax Optimal Regret Bounds for

[2402.15127] Asymptotically and Minimax Optimal Regret Bounds for Multi-Armed Bandits with Abstention

arXiv - Machine Learning March 24, 2026 4 min read

About this article

Abstract page for arXiv paper 2402.15127: Asymptotically and Minimax Optimal Regret Bounds for Multi-Armed Bandits with Abstention

Computer Science > Machine Learning arXiv:2402.15127 (cs) [Submitted on 23 Feb 2024 (v1), last revised 22 Mar 2026 (this version, v2)] Title:Asymptotically and Minimax Optimal Regret Bounds for Multi-Armed Bandits with Abstention Authors:Junwen Yang, Tianyuan Jin, Vincent Y. F. Tan View a PDF of the paper titled Asymptotically and Minimax Optimal Regret Bounds for Multi-Armed Bandits with Abstention, by Junwen Yang and 2 other authors View PDF HTML (experimental) Abstract:We introduce a novel extension of the canonical multi-armed bandit problem that incorporates an additional strategic innovation: abstention. In this enhanced framework, the agent is not only tasked with selecting an arm at each time step, but also has the option to abstain from accepting the stochastic instantaneous reward before observing it. When opting for abstention, the agent either suffers a fixed regret or gains a guaranteed reward. This added layer of complexity naturally prompts the key question: can we develop algorithms that are both computationally efficient and asymptotically and minimax optimal in this setting? We answer this question in the affirmative by designing and analyzing algorithms whose regrets meet their corresponding information-theoretic lower bounds. Our results offer valuable quantitative insights into the benefits of the abstention option, laying the groundwork for further exploration in other online decision-making problems with such an option. Extensive numerical experiment...

Originally published on March 24, 2026. Curated by AI News.

Llms

If AI is really making us more productive... why does it feel like we are working more, not less...?

The promise of AI was the ultimate system optimisation: Efficiency. On paper, the tools are delivering something similar to what they pro...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Ai Infrastructure

[P] Built an open source tool to find the location of any street picture

Hey guys, Thank you so much for your love and support regarding Netryx Astra V2 last time. Many people are not that technically savvy to ...

Reddit - Machine Learning · 1 min · about 7 hours ago

Llms

[R] GPT-5.4-mini regressed 22pp on vanilla prompting vs GPT-5-mini. Nobody noticed because benchmarks don't test this. Recursive Language Models solved it.

GPT-5.4-mini produces shorter, terser outputs by default. Vanilla accuracy dropped from 69.5% to 47.2% across 12 tasks (1,800 evals). The...

Reddit - Machine Learning · 1 min · about 11 hours ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 12 hours ago

[2402.15127] Asymptotically and Minimax Optimal Regret Bounds for Multi-Armed Bandits with Abstention

About this article

Related Articles

If AI is really making us more productive... why does it feel like we are working more, not less...?

[P] Built an open source tool to find the location of any street picture

[R] GPT-5.4-mini regressed 22pp on vanilla prompting vs GPT-5-mini. Nobody noticed because benchmarks don't test this. Recursive Language Models solved it.

UMKC Announces New Master of Science in Artificial Intelligence

No comments

Stay updated with AI News