Autonomous Legged Mobile Manipulation for Lunar Surface Operations via Constrained Reinforcement Learning

📅 2025-10-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address safety and precision challenges in autonomous operation of quadrupedal mobile manipulators on the Moon’s low-gravity, unstructured terrain, this paper proposes an end-to-end constrained reinforcement learning framework integrating dynamic stability, real-time obstacle avoidance, and energy consumption constraints. Methodologically, it combines whole-body control, 6D task-space trajectory tracking, and high-fidelity lunar surface environment modeling for policy training and validation in simulation. To our knowledge, this is the first work achieving provably safe, integrated locomotion-manipulation control under strict lunar gravity conditions. Experiments demonstrate an average end-effector positioning accuracy of 4 cm and orientation accuracy of 8.1°, while rigorously satisfying hard constraints—including collision avoidance, ZMP-based stability, and energy budget limits—throughout execution. The approach significantly enhances reliability and adaptability for complex lunar surface missions.

Technology Category

Application Category

📝 Abstract
Robotics plays a pivotal role in planetary science and exploration, where autonomous and reliable systems are crucial due to the risks and challenges inherent to space environments. The establishment of permanent lunar bases demands robotic platforms capable of navigating and manipulating in the harsh lunar terrain. While wheeled rovers have been the mainstay for planetary exploration, their limitations in unstructured and steep terrains motivate the adoption of legged robots, which offer superior mobility and adaptability. This paper introduces a constrained reinforcement learning framework designed for autonomous quadrupedal mobile manipulators operating in lunar environments. The proposed framework integrates whole-body locomotion and manipulation capabilities while explicitly addressing critical safety constraints, including collision avoidance, dynamic stability, and power efficiency, in order to ensure robust performance under lunar-specific conditions, such as reduced gravity and irregular terrain. Experimental results demonstrate the framework's effectiveness in achieving precise 6D task-space end-effector pose tracking, achieving an average positional accuracy of 4 cm and orientation accuracy of 8.1 degrees. The system consistently respects both soft and hard constraints, exhibiting adaptive behaviors optimized for lunar gravity conditions. This work effectively bridges adaptive learning with essential mission-critical safety requirements, paving the way for advanced autonomous robotic explorers for future lunar missions.
Problem

Research questions and friction points this paper is trying to address.

Developing autonomous legged robots for lunar surface navigation and manipulation
Addressing safety constraints like collision avoidance and stability in space
Achieving precise end-effector control under lunar gravity conditions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Constrained reinforcement learning for quadrupedal mobile manipulators
Integrates whole-body locomotion with manipulation capabilities
Addresses safety constraints like collision avoidance and stability
🔎 Similar Papers
No similar papers found.
A
Alvaro Belmonte-Baeza
Department of Computer Sciences and Artificial Intelligence, University of Alicante, Spain
M
Miguel Cazorla
Department of Computer Sciences and Artificial Intelligence, University of Alicante, Spain
G
Gabriel J. Garc'ia
Department of Physics, Systems Engineering, and Signal Theory, University of Alicante, Spain
C
Carlos J. P'erez-Del-Pulgar
Department of Systems and Automatics Engineering, University of M'alaga, Spain
Jorge Pomares
Jorge Pomares
Full Professor. University of Alicante, Spain
Space roboticsrobot controlvisual servoing