[2603.02154] Boltzmann-based Exploration for Robust Decentralized

[2603.02154] Boltzmann-based Exploration for Robust Decentralized Multi-Agent Planning

arXiv - AI March 03, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.02154: Boltzmann-based Exploration for Robust Decentralized Multi-Agent Planning

Computer Science > Multiagent Systems arXiv:2603.02154 (cs) [Submitted on 2 Mar 2026] Title:Boltzmann-based Exploration for Robust Decentralized Multi-Agent Planning Authors:Nhat Nguyen, Duong Nguyen, Gianluca Rizzo, Hung Nguyen View a PDF of the paper titled Boltzmann-based Exploration for Robust Decentralized Multi-Agent Planning, by Nhat Nguyen and 2 other authors View PDF HTML (experimental) Abstract:Decentralized Monte Carlo Tree Search (Dec-MCTS) is widely used for cooperative multi-agent planning but struggles in sparse or skewed reward environments. We introduce Coordinated Boltzmann MCTS (CB-MCTS), which replaces deterministic UCT with a stochastic Boltzmann policy and a decaying entropy bonus for sustained yet focused exploration. While Boltzmann exploration has been studied in single-agent MCTS, applying it in multi-agent systems poses unique challenges. CB-MCTS is the first to address this. We analyze CB-MCTS in the simple-regret setting and show in simulations that it outperforms Dec-MCTS in deceptive scenarios and remains competitive on standard benchmarks, providing a robust solution for multi-agent planning. Comments: Subjects: Multiagent Systems (cs.MA); Artificial Intelligence (cs.AI) Cite as: arXiv:2603.02154 [cs.MA] (or arXiv:2603.02154v1 [cs.MA] for this version) https://doi.org/10.48550/arXiv.2603.02154 Focus to learn more arXiv-issued DOI via DataCite (pending registration) Submission history From: Nhat Nguyen [view email] [v1] Mon, 2 Mar 2026 18...

Originally published on March 03, 2026. Curated by AI News.

Llms

[2506.20964] Evidence-based diagnostic reasoning with multi-agent copilot for human pathology

Abstract page for arXiv paper 2506.20964: Evidence-based diagnostic reasoning with multi-agent copilot for human pathology

arXiv - AI · 4 min · about 8 hours ago

Ai Agents

[2601.08323] AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

Abstract page for arXiv paper 2601.08323: AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

arXiv - AI · 3 min · about 8 hours ago

Llms

[2603.18349] Large-Scale Analysis of Persuasive Content on Moltbook

Abstract page for arXiv paper 2603.18349: Large-Scale Analysis of Persuasive Content on Moltbook

arXiv - AI · 3 min · about 8 hours ago

Ai Agents

[2511.19669] HeaRT: A Hierarchical Circuit Reasoning Tree-Based Agentic Framework for AMS Design Optimization

Abstract page for arXiv paper 2511.19669: HeaRT: A Hierarchical Circuit Reasoning Tree-Based Agentic Framework for AMS Design Optimization

arXiv - AI · 3 min · about 8 hours ago

[2603.02154] Boltzmann-based Exploration for Robust Decentralized Multi-Agent Planning

About this article

Related Articles

[2506.20964] Evidence-based diagnostic reasoning with multi-agent copilot for human pathology

[2601.08323] AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

[2603.18349] Large-Scale Analysis of Persuasive Content on Moltbook

[2511.19669] HeaRT: A Hierarchical Circuit Reasoning Tree-Based Agentic Framework for AMS Design Optimization

No comments

Stay updated with AI News