π€ AI Summary
To address the challenges of behavioral pattern ambiguity and unreliable decision-making in generative models for long-horizon robotic task planning, this paper proposes a hybrid diffusion model that integrates symbolic reasoning with continuous trajectory generation. The method jointly models discrete symbolic variable diffusion and continuous action trajectory diffusion within a unified framework, enabling coordinated generation of high-level symbolic plans and low-level control trajectories, while supporting conditional synthesis from partial or complete symbolic constraints. Employing an end-to-end joint training architecture, the model achieves substantial improvements over purely generative baselines across multiple long-horizon manipulation tasks: task success rate increases by 23.6%, behavioral pattern confusion decreases by 41.2%, and planning robustness and interpretability are significantly enhanced.
π Abstract
Constructing robots to accomplish long-horizon tasks is a long-standing challenge within artificial intelligence. Approaches using generative methods, particularly Diffusion Models, have gained attention due to their ability to model continuous robotic trajectories for planning and control. However, we show that these models struggle with long-horizon tasks that involve complex decision-making and, in general, are prone to confusing different modes of behavior, leading to failure. To remedy this, we propose to augment continuous trajectory generation by simultaneously generating a high-level symbolic plan. We show that this requires a novel mix of discrete variable diffusion and continuous diffusion, which dramatically outperforms the baselines. In addition, we illustrate how this hybrid diffusion process enables flexible trajectory synthesis, allowing us to condition synthesized actions on partial and complete symbolic conditions.