[2405.18248] Extreme Value Monte Carlo Tree Search for Classical Planning

[2405.18248] Extreme Value Monte Carlo Tree Search for Classical Planning

arXiv - AI 3 min read

About this article

Abstract page for arXiv paper 2405.18248: Extreme Value Monte Carlo Tree Search for Classical Planning

Computer Science > Artificial Intelligence arXiv:2405.18248 (cs) [Submitted on 28 May 2024 (v1), last revised 26 Mar 2026 (this version, v3)] Title:Extreme Value Monte Carlo Tree Search for Classical Planning Authors:Masataro Asai, Stephen Wissow View a PDF of the paper titled Extreme Value Monte Carlo Tree Search for Classical Planning, by Masataro Asai and 1 other authors View PDF HTML (experimental) Abstract:Despite being successful in board games and reinforcement learning (RL), Monte Carlo Tree Search (MCTS) combined with Multi Armed Bandits (MABs) has seen limited success in domain-independent classical planning until recently. Previous work (Wissow and Asai 2024) showed that UCB1, designed for bounded rewards, does not perform well as applied to cost-to-go estimates in classical planning, which are unbounded in $\R$, and showed improved performance using a Gaussian reward MAB instead. This paper further sharpens our understanding of ideal bandits for planning tasks. Existing work has two issues: first, Gaussian MABs under-specify the support of cost-to-go estimates as $(-\infty,\infty)$, which we can narrow down. Second, Full Bellman backup (Schulte and Keller 2014), which backpropagates sample max/min, lacks theoretical justification. We use \emph{Peaks-Over-Threashold Extreme Value Theory} to resolve both issues at once, and propose a new bandit algorithm (UCB1-Uniform). We formally prove its regret bound and empirically demonstrate its performance in classical pl...

Originally published on March 30, 2026. Curated by AI News.

Related Articles

NSF invests $11M to expand AI professional development for K-12 teachers nationwide

NSF invests $11M to expand AI professional development for K-12 teachers nationwide

The U.S. National Science Foundation today announced an $11 million award to the Computer Science Teachers Association (CSTA) in furthera...

AI News - General · 3 min ·
Aon Launches Radford McLagan Compensation Database Enhancements as AI Redefines Workforce Skills and Compensation
Ai Startups

Aon Launches Radford McLagan Compensation Database Enhancements as AI Redefines Workforce Skills and Compensation

/PRNewswire/ -- Aon plc (NYSE: AON), a leading global professional services firm, today announced enhancements to its Radford McLagan Com...

AI News - General · 5 min ·
New AI track at Arkansas Tech focuses on jobs, ethics
Ai Safety

New AI track at Arkansas Tech focuses on jobs, ethics

Arkansas Tech will launch an AI track in fall 2026, preparing students for high-demand careers while addressing the impacts of the techno...

AI News - General · 4 min ·
AI Identifies Multiple Dementias from One Blood Sample

AI Identifies Multiple Dementias from One Blood Sample

Can a single blood test diagnose dementia? Researchers used AI and proteomics to identify Alzheimer’s, Parkinson’s, and ALS from one sample.

AI News - General · 8 min ·

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime