Reinforcement Learning for Enhancing Sensing Estimation in Bistatic ISAC Systems with UAV Swarms

📅 2025-01-11

📈 Citations: 0

✨ Influential: 0

career value

205K/year

🤖 AI Summary

To address low environmental perception accuracy and insufficient communication-sensing coordination in bistatic ISAC systems operating with UAV swarms, this paper proposes a multi-agent reinforcement learning framework based on centralized training with decentralized execution (CTDE). The method jointly optimizes trajectory planning and adaptive power control across distributed radar nodes while implicitly learning a lightweight communication protocol to enable tight coupling between sensing and communication. We model the problem as a partially observable Markov decision process (POMDP) and employ the MAPPO algorithm, integrating decentralized policy optimization with dynamic power adaptation. Experimental results demonstrate that, under diverse dynamic scenarios, the approach reduces target localization error by 32% and mitigates link interference by over 35%, significantly enhancing system robustness and scalability. This work establishes a novel paradigm for large-scale UAV-based ISAC deployment.

Technology Category

Application Category

📝 Abstract

This paper introduces a novel Multi-Agent Reinforcement Learning (MARL) framework to enhance integrated sensing and communication (ISAC) networks using unmanned aerial vehicle (UAV) swarms as sensing radars. By framing the positioning and trajectory optimization of UAVs as a Partially Observable Markov Decision Process, we develop a MARL approach that leverages centralized training with decentralized execution to maximize the overall sensing performance. Specifically, we implement a decentralized cooperative MARL strategy to enable UAVs to develop effective communication protocols, therefore enhancing their environmental awareness and operational efficiency. Additionally, we augment the MARL solution with a transmission power adaptation technique to mitigate interference between the communicating drones and optimize the communication protocol efficiency. Moreover, a transmission power adaptation technique is incorporated to mitigate interference and optimize the learned communication protocol efficiency. Despite the increased complexity, our solution demonstrates robust performance and adaptability across various scenarios, providing a scalable and cost-effective enhancement for future ISAC networks.

Problem

Research questions and friction points this paper is trying to address.

Unmanned Aerial Vehicle (UAV) Teams

Dual-Station ISAC System

Environmental Perception and Judgment Enhancement

Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-Agent Reinforcement Learning (MARL)

Power Control Optimization

ISAC System Efficiency

🔎 Similar Papers

No similar papers found.