Geometry-aware RL for Manipulation of Varying Shapes and Deformable Objects

📅 2025-02-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the challenge of modeling and controlling dexterous robotic manipulation of multi-modal objects—such as rigid bodies, ropes, and cloth—by proposing HEPi, the first SE(3)-equivariant heterogeneous geometric graph framework. HEPi unifies the 3D interaction modeling among actuators, objects, and environment, explicitly encoding functional heterogeneity and 3D geometric symmetries, while supporting multi-actuator coordination and zero-shot generalization to unseen object geometries. Technically, it integrates heterogeneous graph neural networks, SE(3)-equivariant message passing, and reinforcement learning. Evaluated on novel benchmarks—including rigid-body insertion, rope threading, and cloth hanging—HEPi achieves substantial improvements: +28.6% in average return, 3.2× higher sample efficiency, and superior cross-shape generalization, consistently outperforming both Transformer-based and homogeneous equivariant baselines.

Technology Category

Application Category

📝 Abstract
Manipulating objects with varying geometries and deformable objects is a major challenge in robotics. Tasks such as insertion with different objects or cloth hanging require precise control and effective modelling of complex dynamics. In this work, we frame this problem through the lens of a heterogeneous graph that comprises smaller sub-graphs, such as actuators and objects, accompanied by different edge types describing their interactions. This graph representation serves as a unified structure for both rigid and deformable objects tasks, and can be extended further to tasks comprising multiple actuators. To evaluate this setup, we present a novel and challenging reinforcement learning benchmark, including rigid insertion of diverse objects, as well as rope and cloth manipulation with multiple end-effectors. These tasks present a large search space, as both the initial and target configurations are uniformly sampled in 3D space. To address this issue, we propose a novel graph-based policy model, dubbed Heterogeneous Equivariant Policy (HEPi), utilizing $SE(3)$ equivariant message passing networks as the main backbone to exploit the geometric symmetry. In addition, by modeling explicit heterogeneity, HEPi can outperform Transformer-based and non-heterogeneous equivariant policies in terms of average returns, sample efficiency, and generalization to unseen objects.
Problem

Research questions and friction points this paper is trying to address.

Geometry-aware RL for manipulation
Handling varying shapes and deformable objects
Graph-based policy model for precise control
Innovation

Methods, ideas, or system contributions that make the work stand out.

Heterogeneous graph representation
SE(3) equivariant message passing
Heterogeneous Equivariant Policy (HEPi)
🔎 Similar Papers
No similar papers found.
Tai Hoang
Tai Hoang
PhD Student, Karlsruhe Institute of Technology
Machine LearningRobotics
H
Huy Le
Autonomous Learning Robots, Karlsruhe Institute of Technology; Bosch Center for Artificial Intelligence
P
P. Becker
Autonomous Learning Robots, Karlsruhe Institute of Technology
V
Vien Anh Ngo
Bosch Center for Artificial Intelligence
Gerhard Neumann
Gerhard Neumann
Professor, Karlsruhe Institute of Technology (KIT)
RoboticsMachine Learning