[2305.09840] Scale-Adaptive Balancing of Exploration and Exploitation in Classical Planning

[2305.09840] Scale-Adaptive Balancing of Exploration and Exploitation in Classical Planning

arXiv - AI 4 min read

About this article

Abstract page for arXiv paper 2305.09840: Scale-Adaptive Balancing of Exploration and Exploitation in Classical Planning

Computer Science > Artificial Intelligence arXiv:2305.09840 (cs) [Submitted on 16 May 2023 (v1), last revised 26 Mar 2026 (this version, v4)] Title:Scale-Adaptive Balancing of Exploration and Exploitation in Classical Planning Authors:Stephen Wissow, Masataro Asai View a PDF of the paper titled Scale-Adaptive Balancing of Exploration and Exploitation in Classical Planning, by Stephen Wissow and 1 other authors View PDF Abstract:Balancing exploration and exploitation has been an important problem in both game tree search and automated planning. However, while the problem has been extensively analyzed within the Multi-Armed Bandit (MAB) literature, the planning community has had limited success when attempting to apply those results. We show that a more detailed theoretical understanding of MAB literature helps improve existing planning algorithms that are based on Monte Carlo Tree Search (MCTS) / Trial Based Heuristic Tree Search (THTS). In particular, THTS uses UCB1 MAB algorithms in an ad hoc manner, as UCB1's theoretical requirement of fixed bounded support reward distributions is not satisfied within heuristic search for classical planning. The core issue lies in UCB1's lack of adaptations to the different scales of the rewards. We propose GreedyUCT-Normal, a MCTS/THTS algorithm with UCB1-Normal bandit for agile classical planning, which handles distributions with different scales by taking the reward variance into consideration, and resulted in an improved algorithmic ...

Originally published on March 30, 2026. Curated by AI News.

Related Articles

NSF invests $11M to expand AI professional development for K-12 teachers nationwide

NSF invests $11M to expand AI professional development for K-12 teachers nationwide

The U.S. National Science Foundation today announced an $11 million award to the Computer Science Teachers Association (CSTA) in furthera...

AI News - General · 3 min ·
Aon Launches Radford McLagan Compensation Database Enhancements as AI Redefines Workforce Skills and Compensation
Ai Startups

Aon Launches Radford McLagan Compensation Database Enhancements as AI Redefines Workforce Skills and Compensation

/PRNewswire/ -- Aon plc (NYSE: AON), a leading global professional services firm, today announced enhancements to its Radford McLagan Com...

AI News - General · 5 min ·
New AI track at Arkansas Tech focuses on jobs, ethics
Ai Safety

New AI track at Arkansas Tech focuses on jobs, ethics

Arkansas Tech will launch an AI track in fall 2026, preparing students for high-demand careers while addressing the impacts of the techno...

AI News - General · 4 min ·
AI Identifies Multiple Dementias from One Blood Sample

AI Identifies Multiple Dementias from One Blood Sample

Can a single blood test diagnose dementia? Researchers used AI and proteomics to identify Alzheimer’s, Parkinson’s, and ALS from one sample.

AI News - General · 8 min ·

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime