[2512.09682] Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies

[2512.09682] Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies

arXiv - AI 3 min read

About this article

Abstract page for arXiv paper 2512.09682: Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies

Electrical Engineering and Systems Science > Systems and Control arXiv:2512.09682 (eess) [Submitted on 10 Dec 2025 (v1), last revised 8 May 2026 (this version, v2)] Title:Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies Authors:Mika Persson, Jonas Lidman, Jacob Ljungberg, Samuel Sandelius, Adam Andersson View a PDF of the paper titled Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies, by Mika Persson and 4 other authors View PDF HTML (experimental) Abstract:This work studies the application of Multi-Agent Reinforcement Learning (MARL) to decentralized control of unmanned aerial vehicles to relay a critical data package to a known position. For this purpose, a family of deterministic games is introduced, designed for MARL scaling studies. A robust baseline policy is proposed which restricts agent motion and applies Dijkstra's shortest path algorithm. Computational experiment results show that two off-the-shelf MARL algorithms perform competitively with the baseline for a small number of agents, but face scalability issues as the number of agents increases. Source code and animations are available online at this https URL. Comments: Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA) Cite as: arXiv:2512.09682 [eess.SY]   (or arXiv:2512.09682v2 [eess.SY] f...

Originally published on May 11, 2026. Curated by AI News.

Related Articles

Machine Learning

What to expect from AlphaZero's value predictions [D]

An AlphaZero agent has learnt to predict the value of a game state by training on data generated by self-play by the model and a series o...

Reddit - Machine Learning · 1 min ·
Machine Learning

Open Source Projects related to CNNs to Contribute To? [D]

Around a decade a go I was tinkering a lot with CNNs for real time event detection. I enjoyed that a lot and always wanted to get back in...

Reddit - Machine Learning · 1 min ·
I Work in Hollywood. Everyone Who Used to Make TV Is Now Secretly Training AI | WIRED
Machine Learning

I Work in Hollywood. Everyone Who Used to Make TV Is Now Secretly Training AI | WIRED

For screenwriters like me—and job seekers all over—AI gig work is the new waiting tables. In eight months, I’ve done 20 of these soul-cru...

Wired - AI · 27 min ·
Machine Learning

Are Enterprises Using AI in the Wrong Places?

Most enterprise AI discussions still revolve around one question: But I’m starting to think that may be the wrong question entirely. The ...

Reddit - Artificial Intelligence · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime