Primal-Dual iLQR for GPU-Accelerated Learning and Control in Legged Robots

📅 2025-06-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address high computational latency and poor scalability in state dimension and prediction horizon for real-time model predictive control (MPC) of legged robots, this paper proposes a GPU-accelerated parallel MPC framework. The core innovation is the first integration of parallel associative scan into the iterative linear-quadratic regulator (iLQR) framework, enabling concurrent parallelization across both time steps and state variables, coupled with an efficient primal-dual interior-point method for solving the Karush–Kuhn–Tucker (KKT) system. Leveraging JAX’s automatic differentiation and native GPU support, the framework enables a single controller to coordinate up to 16 robots with MPC solve times under 25 ms, and natively supports MPC-in-the-loop end-to-end reinforcement learning. Experiments demonstrate speedups of 60% over acados and 700% over crocoddyl on WB-MPC and SRBD-MPC benchmarks, respectively, while exhibiting strong scalability in state dimension and enabling large-scale parallel training across diverse environments.

Technology Category

Application Category

📝 Abstract
This paper introduces a novel Model Predictive Control (MPC) implementation for legged robot locomotion that leverages GPU parallelization. Our approach enables both temporal and state-space parallelization by incorporating a parallel associative scan to solve the primal-dual Karush-Kuhn-Tucker (KKT) system. In this way, the optimal control problem is solved in $mathcal{O}(nlog{N} + m)$ complexity, instead of $mathcal{O}(N(n + m)^3)$, where $n$, $m$, and $N$ are the dimension of the system state, control vector, and the length of the prediction horizon. We demonstrate the advantages of this implementation over two state-of-the-art solvers (acados and crocoddyl), achieving up to a 60% improvement in runtime for Whole Body Dynamics (WB)-MPC and a 700% improvement for Single Rigid Body Dynamics (SRBD)-MPC when varying the prediction horizon length. The presented formulation scales efficiently with the problem state dimensions as well, enabling the definition of a centralized controller for up to 16 legged robots that can be computed in less than 25 ms. Furthermore, thanks to the JAX implementation, the solver supports large-scale parallelization across multiple environments, allowing the possibility of performing learning with the MPC in the loop directly in GPU.
Problem

Research questions and friction points this paper is trying to address.

Accelerating legged robot MPC via GPU parallelization
Solving optimal control with primal-dual KKT system efficiently
Enabling scalable multi-robot control and GPU-based learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

GPU-accelerated primal-dual iLQR for legged robots
Parallel associative scan for KKT system solving
JAX implementation enabling large-scale parallelization
🔎 Similar Papers
No similar papers found.
L
Lorenzo Amatucci
Dynamic Legged Systems Laboratory, Istituto Italiano di Tecnologia (IIT), Genova, Italy; Dipartimento di Informatica, Bioingegneria, Robotica e Ingegneria dei Sistemi (DIBRIS), Università di Genova, Genova, Italy
J
Joao Sousa-Pinto
Dynamic Legged Systems Laboratory, Istituto Italiano di Tecnologia (IIT), Genova, Italy
Giulio Turrisi
Giulio Turrisi
Researcher at the Dynamic Legged Systems Lab, Istituto Italiano di Tecnologia
roboticsmachine learningcontrolreinforcement learninglegged robot
Dominique Orban
Dominique Orban
GERAD and Department of Mathematics and Industrial Engineering, Polytechnique Montréal, Montreal, Canada
V
V. Barasuol
Dynamic Legged Systems Laboratory, Istituto Italiano di Tecnologia (IIT), Genova, Italy
Claudio Semini
Claudio Semini
Head of the Dynamic Legged Systems Lab at Istituto Italiano di Tecnologia
roboticslocomotionquadrupedshydraulicsdynamics