Motion Manifold Flow Primitives for Task-Conditioned Trajectory Generation under Complex Task-Motion Dependencies

📅 2024-07-29
📈 Citations: 1
Influential: 1
📄 PDF
🤖 AI Summary
Existing motion generation methods suffer from performance degradation under large task variations and high action diversity, particularly failing to model the strong coupling between linguistic instructions and complex motions—leading to distributional shift and imprecise trajectories. To address this, we propose a novel paradigm that decouples motion manifold learning from task-conditioned modeling. Specifically, we introduce continuous normalizing flow matching (CNF-based flow matching) into the motion latent space for the first time, integrated with a manifold-aware dimensionality-reduction encoder and a task-embedding-conditioned latent space design. This yields an end-to-end language–motion alignment framework supporting many-to-many text–action mappings, significantly enhancing robustness and fine-grained semantic responsiveness. On a multilingual guided trajectory generation benchmark, our method reduces Fréchet Inception Distance (FID) by 32% and improves motion diversity by 2.1×, comprehensively surpassing state-of-the-art approaches.

Technology Category

Application Category

📝 Abstract
Effective movement primitives should be capable of encoding and generating a rich repertoire of trajectories -- typically collected from human demonstrations -- conditioned on task-defining parameters such as vision or language inputs. While recent methods based on the motion manifold hypothesis, which assumes that a set of trajectories lies on a lower-dimensional nonlinear subspace, address challenges such as limited dataset size and the high dimensionality of trajectory data, they often struggle to capture complex task-motion dependencies, i.e., when motion distributions shift drastically with task variations. To address this, we introduce Motion Manifold Flow Primitives (MMFP), a framework that decouples the training of the motion manifold from task-conditioned distributions. Specifically, we employ flow matching models, state-of-the-art conditional deep generative models, to learn task-conditioned distributions in the latent coordinate space of the learned motion manifold. Experiments are conducted on language-guided trajectory generation tasks, where many-to-many text-motion correspondences introduce complex task-motion dependencies, highlighting MMFP's superiority over existing methods.
Problem

Research questions and friction points this paper is trying to address.

Action Generation
Task Variation
Language Instructions
Innovation

Methods, ideas, or system contributions that make the work stand out.

MMFP
Diverse Action Manifold Learning
Language-guided Complex Tasks
🔎 Similar Papers
No similar papers found.
Yonghyeon Lee
Yonghyeon Lee
Postdoctoral Associate @ MIT
Geometric Data AnalysisMachine LearningRobotics
Byeongho Lee
Byeongho Lee
Robotics Laboratory, Seoul National University, Seoul 08826, South Korea
S
Seungyeon Kim
Robotics Laboratory, Seoul National University, Seoul 08826, South Korea
F
F. Park
Robotics Laboratory, Seoul National University, Seoul 08826, South Korea