[2502.20326] Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application

[2502.20326] Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application

arXiv - AI 4 min read Article

Summary

This paper presents a novel framework for autonomous decision-making in UAVs during search-and-rescue operations, demonstrating effective navigation in GNSS-denied environments.

Why It Matters

The research addresses critical challenges in search-and-rescue missions, particularly in environments where GPS is unreliable. By integrating advanced AI techniques, this work enhances the operational efficiency and safety of UAVs, potentially improving emergency response outcomes.

Key Takeaways

  • Introduces an end-to-end framework for UAVs in search-and-rescue.
  • Utilizes a Twin Delayed Deep Deterministic Policy Gradient controller for improved trajectory planning.
  • Employs a deep Graph Attention Network for efficient task allocation among drones.
  • Achieves centimeter-level altitude stability using a novel sensor fusion approach.
  • Demonstrates successful real-world application with first-place results in a competitive environment.

Computer Science > Robotics arXiv:2502.20326 (cs) [Submitted on 27 Feb 2025 (v1), last revised 16 Feb 2026 (this version, v2)] Title:Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application Authors:Thomas Hickling, Maxwell Hogan, Abdulla Tammam, Nabil Aouf View a PDF of the paper titled Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application, by Thomas Hickling and 3 other authors View PDF HTML (experimental) Abstract:This paper presents the first end-to-end framework that combines guidance, navigation, and centralised task allocation for multiple UAVs performing autonomous search-and-rescue (SAR) in GNSS-denied indoor environments. A Twin Delayed Deep Deterministic Policy Gradient controller is trained with an Artificial Potential Field (APF) reward that blends attractive and repulsive potentials with continuous control, accelerating convergence and yielding smoother, safer trajectories than distance-only baselines. Collaborative mission assignment is solved by a deep Graph Attention Network that, at each decision step, reasons over the drone-task graph to produce near-optimal allocations with negligible on-board compute. To arrest the notorious Z-drift of indoor LiDAR-SLAM, we fuse depth-camera altimetry with IMU vertical velocity in a lightweight complementary filter, giving centimetre-level altitude stability without external beacon...

Related Articles

Nomadic raises $8.4 million to wrangle the data pouring off autonomous vehicles | TechCrunch
Machine Learning

Nomadic raises $8.4 million to wrangle the data pouring off autonomous vehicles | TechCrunch

The company turns footage from robots into structured, searchable datasets with a deep learning model.

TechCrunch - AI · 6 min ·
Machine Learning

The AI Chip War is Just Getting Started

Everyone talks about AI models, but the real bottleneck might be hardware. According to a recent study by Roots Analysis: AI chip market ...

Reddit - Artificial Intelligence · 1 min ·
Robotics

What happens when AI agents can earn and spend real money? I built a small test to find out

I've been sitting with a question for a while: what happens when AI agents aren't just tools to be used, but participants in an economy? ...

Reddit - Artificial Intelligence · 1 min ·
Robotics

AIPass Herald

Some insight onto building a muilti agent autonomous system. This is like the daily newspaper for the project. A quick read to see how ou...

Reddit - Artificial Intelligence · 1 min ·
More in Robotics: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime