Motion Manifold Flow Primitives for Task-Conditioned Trajectory Generation under Complex Task-Motion Dependencies

📅 2024-07-29

📈 Citations: 1

✨ Influential: 1

🤖 AI Summary

Existing motion generation methods suffer from performance degradation under large task variations and high action diversity, particularly failing to model the strong coupling between linguistic instructions and complex motions—leading to distributional shift and imprecise trajectories. To address this, we propose a novel paradigm that decouples motion manifold learning from task-conditioned modeling. Specifically, we introduce continuous normalizing flow matching (CNF-based flow matching) into the motion latent space for the first time, integrated with a manifold-aware dimensionality-reduction encoder and a task-embedding-conditioned latent space design. This yields an end-to-end language–motion alignment framework supporting many-to-many text–action mappings, significantly enhancing robustness and fine-grained semantic responsiveness. On a multilingual guided trajectory generation benchmark, our method reduces Fréchet Inception Distance (FID) by 32% and improves motion diversity by 2.1×, comprehensively surpassing state-of-the-art approaches.

Technology Category

Application Category

📝 Abstract

Effective movement primitives should be capable of encoding and generating a rich repertoire of trajectories -- typically collected from human demonstrations -- conditioned on task-defining parameters such as vision or language inputs. While recent methods based on the motion manifold hypothesis, which assumes that a set of trajectories lies on a lower-dimensional nonlinear subspace, address challenges such as limited dataset size and the high dimensionality of trajectory data, they often struggle to capture complex task-motion dependencies, i.e., when motion distributions shift drastically with task variations. To address this, we introduce Motion Manifold Flow Primitives (MMFP), a framework that decouples the training of the motion manifold from task-conditioned distributions. Specifically, we employ flow matching models, state-of-the-art conditional deep generative models, to learn task-conditioned distributions in the latent coordinate space of the learned motion manifold. Experiments are conducted on language-guided trajectory generation tasks, where many-to-many text-motion correspondences introduce complex task-motion dependencies, highlighting MMFP's superiority over existing methods.

Problem

Research questions and friction points this paper is trying to address.

Action Generation

Task Variation

Language Instructions

Innovation

Methods, ideas, or system contributions that make the work stand out.

MMFP

Diverse Action Manifold Learning

Language-guided Complex Tasks

🔎 Similar Papers

No similar papers found.

Authors to Follow