🤖 AI Summary
This work addresses the challenge of reliably and scalably representing collision-free configuration spaces in high-dimensional, complex environments. The authors propose the ILD framework, which uniquely integrates invertible latent-space mapping with explicit modeling of free space as a union of convex polytopes. Specifically, an invertible neural network maps the configuration space to a latent space, where the free region is explicitly represented as a union of convex sets. Visibility-guided sampling ensures connectivity among these sets, and paths planned in the latent space are decoded back to the original space via the invertible mapping, guaranteeing strict feasibility without false positives. Experiments demonstrate that the method significantly improves coverage, connectivity, and planning success across scenarios ranging from 2D to 14 degrees of freedom, supports real-time planning, and adapts robustly to geometric changes in real-world environments.
📝 Abstract
Collision-free path planning in cluttered, real-world environments relies on a representation of the collision-free space, and existing representations broadly fall into two
categories. Explicit representations, such as unions of convex sets, can be plugged into optimization-based planners as hard collision-free constraints, but their parameters scale
poorly with configuration-space dimension. Implicit representations, by contrast, are flexible and scale well to complex geometries, yet typically lack such guarantees. We bridge this
gap with ILD (Invertible Latent Decomposition), a framework that jointly learns an invertible mapping and a union of explicit convex polytopes in the resulting latent space. Planning
is carried out over these latent convex sets, and the invertible mapping decodes the resulting paths back to the original configuration space while preserving feasibility with
respect to the refined explicit safe regions. We further propose Visibility-Guided Sampling (VGS) to keep the convex sets connected for path planning. Across 2D navigation, 6-DoF, and
14-DoF manipulation environments, ILD achieves broader coverage, better inter-set connectivity, and higher path-planning success rates than prior baselines, with zero observed false
positives after test-time refinement. On a 14-DoF bimanual manipulator, we further demonstrate real-time collision-free planning, with test-time refinement adapting to scene-geometry
changes during real-world deployment on a single 6-DoF arm.