Towards bridging the gap: Systematic sim-to-real transfer for diverse legged robots

📅 2025-09-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Sim-to-real transfer for legged robots suffers from insufficient motion robustness and energy efficiency, alongside heavy reliance on manual reward tuning. Method: We propose a systematic transfer framework that eliminates the need for dynamic parameter randomization. Our approach integrates a first-principles-based permanent magnet synchronous motor (PMSM) energy consumption model with deep reinforcement learning to design a compact, physics-driven reward function comprising four terms—unifiedly characterizing electrical and mechanical energy losses. Combined with bottom-up dynamical parameter identification, it enables high-fidelity alignment of energy behavior between simulation and reality. Contribution/Results: Evaluated on 13 heterogeneous legged robots, our method reduces the whole-body transport cost of ANYmal by 32% to 1.27—surpassing state-of-the-art approaches. It is the first to achieve robust locomotion while ensuring interpretability, minimal hyperparameter dependence, and high energy efficiency in policy transfer.

Technology Category

Application Category

📝 Abstract
Legged robots must achieve both robust locomotion and energy efficiency to be practical in real-world environments. Yet controllers trained in simulation often fail to transfer reliably, and most existing approaches neglect actuator-specific energy losses or depend on complex, hand-tuned reward formulations. We propose a framework that integrates sim-to-real reinforcement learning with a physics-grounded energy model for permanent magnet synchronous motors. The framework requires a minimal parameter set to capture the simulation-to-reality gap and employs a compact four-term reward with a first-principle-based energetic loss formulation that balances electrical and mechanical dissipation. We evaluate and validate the approach through a bottom-up dynamic parameter identification study, spanning actuators, full-robot in-air trajectories and on-ground locomotion. The framework is tested on three primary platforms and deployed on ten additional robots, demonstrating reliable policy transfer without randomization of dynamic parameters. Our method improves energetic efficiency over state-of-the-art methods, achieving a 32 percent reduction in the full Cost of Transport of ANYmal (value 1.27). All code, models, and datasets will be released.
Problem

Research questions and friction points this paper is trying to address.

Bridging sim-to-real transfer gap for legged robots
Addressing energy efficiency neglect in existing controllers
Overcoming complex reward formulations with compact solution
Innovation

Methods, ideas, or system contributions that make the work stand out.

Sim-to-real reinforcement learning with energy model
Compact four-term reward with energetic loss formulation
Minimal parameter set for reliable policy transfer
🔎 Similar Papers
No similar papers found.