Strategic Coordination of Drones via Short-term Distributed Optimization and Long-term Reinforcement Learning

📅 2023-11-16
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the NP-hard, decentralized optimization problem of autonomous task allocation for drone swarms in large-scale dynamic spatiotemporal environments. We propose a long-short-term collaborative decision-making framework: the long-term layer employs distributed deep reinforcement learning (DRL) to jointly optimize flight and charging policies, while the short-term layer leverages decentralized collective learning for real-time perception–navigation coordination. We introduce the first two-layer decoupled architecture that synergistically integrates DRL’s long-horizon adaptability with collective learning’s low latency and strong privacy preservation—thereby overcoming the fundamental trade-off among scalability, privacy protection, and long-term robustness. Evaluated on a real-world urban traffic dataset, our method improves task completion rate by 27.83% over state-of-the-art collective learning baselines and by 23.17% over advanced DRL baselines, while significantly enhancing energy efficiency, monitoring accuracy, and operational sustainability.
📝 Abstract
This paper addresses the problem of autonomous task allocation by a swarm of autonomous, interactive drones in large-scale, dynamic spatio-temporal environments. When each drone independently determines navigation, sensing, and recharging options to choose from such that system-wide sensing requirements are met, the collective decision-making becomes an NP-hard decentralized combinatorial optimization problem. Existing solutions face significant limitations: distributed optimization methods such as collective learning often lack long-term adaptability, while centralized deep reinforcement learning (DRL) suffers from high computational complexity, scalability and privacy concerns. To overcome these challenges, we propose a novel hybrid optimization approach that combines long-term DRL with short-term collective learning. In this approach, each drone uses DRL methods to proactively determine high-level strategies, such as flight direction and recharging behavior, while leveraging collective learning to coordinate short-term sensing and navigation tasks with other drones in a decentralized manner. Extensive experiments using datasets derived from realistic urban mobility demonstrate that the proposed solution outperforms standalone state-of-the-art collective learning and DRL approaches by $27.83%$ and $23.17%$ respectively. Our findings highlight the complementary strengths of short-term and long-term decision-making, enabling energy-efficient, accurate, and sustainable traffic monitoring through swarms of drones.
Problem

Research questions and friction points this paper is trying to address.

Autonomous task allocation for drone swarms in dynamic environments
Decentralized combinatorial optimization for collective drone decision-making
Overcoming limitations of standalone optimization and reinforcement learning methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hybrid approach combines long-term DRL with short-term collective learning
DRL determines high-level strategies like flight direction
Collective learning coordinates short-term decentralized sensing tasks
🔎 Similar Papers
No similar papers found.