Task Assignment and Exploration Optimization for Low Altitude UAV Rescue via Generative AI Enhanced Multi-agent Reinforcement Learning

📅 2025-04-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenges of limited onboard computation for low-altitude UAVs, dynamically scarce ground computing node (GCN) resources, and long-term system instability in unknown rescue environments, this paper proposes a UAV–GER–HAP heterogeneous collaborative framework that jointly optimizes dynamic task allocation and proactive exploration path planning. Methodologically, we integrate a generative diffusion model (GDM) into multi-agent deep deterministic policy gradient (MADDPG), yielding GDM-MADDPG; for the first time, we combine the Hungarian algorithm to jointly decide exploration region selection and task offloading. Additionally, we design a Lyapunov-driven, slot-based stability optimization paradigm. Experimental results demonstrate a 37.2% improvement in task offloading efficiency, a 41.5% reduction in end-to-end latency, and a 52.8% decrease in queue backlog volatility—effectively balancing mission completion time, energy consumption, and long-term system stability.

Technology Category

Application Category

📝 Abstract
Artificial Intelligence (AI)-driven convolutional neural networks enhance rescue, inspection, and surveillance tasks performed by low-altitude uncrewed aerial vehicles (UAVs) and ground computing nodes (GCNs) in unknown environments. However, their high computational demands often exceed a single UAV's capacity, leading to system instability, further exacerbated by the limited and dynamic resources of GCNs. To address these challenges, this paper proposes a novel cooperation framework involving UAVs, ground-embedded robots (GERs), and high-altitude platforms (HAPs), which enable resource pooling through UAV-to-GER (U2G) and UAV-to-HAP (U2H) communications to provide computing services for UAV offloaded tasks. Specifically, we formulate the multi-objective optimization problem of task assignment and exploration optimization in UAVs as a dynamic long-term optimization problem. Our objective is to minimize task completion time and energy consumption while ensuring system stability over time. To achieve this, we first employ the Lyapunov optimization technique to transform the original problem, with stability constraints, into a per-slot deterministic problem. We then propose an algorithm named HG-MADDPG, which combines the Hungarian algorithm with a generative diffusion model (GDM)-based multi-agent deep deterministic policy gradient (MADDPG) approach. We first introduce the Hungarian algorithm as a method for exploration area selection, enhancing UAV efficiency in interacting with the environment. We then innovatively integrate the GDM and multi-agent deep deterministic policy gradient (MADDPG) to optimize task assignment decisions, such as task offloading and resource allocation. Simulation results demonstrate the effectiveness of the proposed approach, with significant improvements in task offloading efficiency, latency reduction, and system stability compared to baseline methods.
Problem

Research questions and friction points this paper is trying to address.

Optimize task assignment and exploration for UAV rescue using multi-agent reinforcement learning
Minimize task completion time and energy consumption in dynamic UAV-GCN systems
Enhance system stability via resource pooling with GERs and HAPs
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generative AI enhances multi-agent reinforcement learning
UAV-GER-HAP cooperation enables resource pooling
Lyapunov optimization ensures system stability
🔎 Similar Papers
No similar papers found.
Xin Tang
Xin Tang
College of Science, Huazhong Agricultural University
pattern recognitionmachine learningdeep learning
Q
Qian Chen
School of Architecture and Transportation Engineering, Guilin University of Electronic Technology, Guilin, 541004, China
W
Wenjie Weng
Guangxi University Key Laboratory of Intelligent Networking and Scenario System (School of Information and Communication, Guilin University of Electronic Technology), Guilin, 541004, China, and also with National Engineering Laboratory for Comprehensive Transportation Big Data Application Technology (Guangxi), Nanning, 530001, China
C
Chao Jin
Guangxi University Key Laboratory of Intelligent Networking and Scenario System (School of Information and Communication, Guilin University of Electronic Technology), Guilin, 541004, China, and also with National Engineering Laboratory for Comprehensive Transportation Big Data Application Technology (Guangxi), Nanning, 530001, China
Zhang Liu
Zhang Liu
University of Colorado Boulder
Distributed SystemsNetworkingCloud ComputingStorage
Jiacheng Wang
Jiacheng Wang
Nanyang Technological University
ISACGenAILow-altitude wireless networkSemantic Communications
Geng Sun
Geng Sun
University of Wollongong
Xiaohuan Li
Xiaohuan Li
Guilin University of Electronic Technology
vehicular networksUAV networksdigital twinand generative AI
D
Dusit Niyato
College of Computing and Data Science, Nanyang Technological University, Singapore 639798