Multi-Agent Reinforcement Learning for UAV-Based Chemical Plume Source Localization

πŸ“… 2026-03-12
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This study addresses the challenge of efficiently detecting and localizing toxic gases, such as methane, emanating from abandoned wellheadsβ€”a task poorly suited to conventional methods. To this end, the authors propose a multi-agent deep reinforcement learning (MARL)-based framework for cooperative drone sensing. The approach introduces a virtual anchor mechanism to coordinate multiple unmanned aerial vehicles in conducting simultaneous in situ measurements of gas concentration and wind velocity. By analyzing historical trajectories of these virtual anchors, the system achieves precise source localization. Compared to traditional fluxotaxis-based techniques, the proposed framework demonstrates significant improvements in both localization accuracy and operational efficiency, offering a scalable and effective solution for gas leak monitoring in complex environments.

Technology Category

Application Category

πŸ“ Abstract
Undocumented orphaned wells pose significant health and environmental risks to nearby communities by releasing toxic gases and contaminating water sources, with methane emissions being a primary concern. Traditional survey methods such as magnetometry often fail to detect older wells effectively. In contrast, aerial in-situ sensing using unmanned aerial vehicles (UAVs) offers a promising alternative for methane emission detection and source localization. This study presents a robust and efficient framework based on a multi-agent deep reinforcement learning (MARL) algorithm for the chemical plume source localization (CPSL) problem. The proposed approach leverages virtual anchor nodes to coordinate UAV navigation, enabling collaborative sensing of gas concentrations and wind velocities through onboard and shared measurements. Source identification is achieved by analyzing the historical trajectory of anchor node placements within the plume. Comparative evaluations against the fluxotaxis method demonstrate that the MARL framework achieves superior performance in both localization accuracy and operational efficiency.
Problem

Research questions and friction points this paper is trying to address.

chemical plume source localization
orphaned wells
methane emissions
UAV-based sensing
multi-agent reinforcement learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

multi-agent reinforcement learning
UAV-based sensing
chemical plume source localization
virtual anchor nodes
collaborative navigation
πŸ”Ž Similar Papers
No similar papers found.
Z
Zhirun Li
Department of Electrical and Computer Engineering, University of New Mexico, Albuquerque, NM 87131 USA
D
Derek Hollenbeck
Department of Mechanical Engineering, University of California, Merced, CA 95343 USA
R
Ruikun Wu
Department of Electrical Engineering, Colorado School of Mines, Golden, CO 80401 USA
M
Michelle Sherman
Department of Electrical Engineering, New Mexico Tech, Socorro, NM 87801 USA
Sihua Shao
Sihua Shao
Colorado School of Mines, Assistant Professor
Wireless Communications and Networks
Xiang Sun
Xiang Sun
Professor of Economics, Wuhan University
Matching and Market DesignGame Theory and Information EconomicsSocial and Economic Networks
M
Mostafa Hassanalian
Department of Mechanical Engineering, New Mexico Tech, Socorro, NM 87801 USA