Transformer-Based Reinforcement Learning for Autonomous Orbital Collision Avoidance in Partially Observable Environments

📅 2026-02-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of insufficient reliability in autonomous orbital collision avoidance within partially observable and sparsely monitored space environments. To this end, the authors propose a Transformer-based reinforcement learning framework that, for the first time, integrates the Transformer architecture into Partially Observable Markov Decision Process (POMDP) modeling. By leveraging the Transformer’s long-range temporal attention mechanism alongside a distance-dependent observation model, a sequential state estimator, and a custom-designed conjunction simulation environment, the method effectively handles high-noise, intermittent space observation data. Experimental results demonstrate that the proposed approach significantly outperforms conventional strategies under incomplete observational conditions, achieving more robust and reliable autonomous collision avoidance capabilities.

Technology Category

Application Category

📝 Abstract
We introduce a Transformer-based Reinforcement Learning framework for autonomous orbital collision avoidance that explicitly models the effects of partial observability and imperfect monitoring in space operations. The framework combines a configurable encounter simulator, a distance-dependent observation model, and a sequential state estimator to represent uncertainty in relative motion. A central contribution of this work is the use of transformer-based Partially Observable Markov Decision Process (POMDP) architecture, which leverage long-range temporal attention to interpret noisy and intermittent observations more effectively than traditional architectures. This integration provides a foundation for training collision avoidance agents that can operate more reliably under imperfect monitoring environments.
Problem

Research questions and friction points this paper is trying to address.

orbital collision avoidance
partially observable environments
autonomous space operations
imperfect monitoring
uncertainty in relative motion
Innovation

Methods, ideas, or system contributions that make the work stand out.

Transformer
Reinforcement Learning
Partially Observable Environments
Orbital Collision Avoidance
POMDP
🔎 Similar Papers
No similar papers found.
T
Thomas Georges
Université Paris-Saclay, CentraleSupélec Engineering School, Department of Industrial Engineering, Gif-sur-Yvette, 91190, France
Adam Abdin
Adam Abdin
Associate Professor, PhD. - Engineering of Complex Systems
Operations ResearchOperations ManagementReinforcement LearningRobust OptimizationResilience