Query-Centric Diffusion Policy for Generalizable Robotic Assembly

📅 2025-09-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the misalignment between high-level planning and low-level control in robotic assembly tasks involving complex part interactions and high-noise environments, this paper proposes a query-centric hierarchical diffusion policy framework. The framework explicitly bridges symbolic reasoning and physical execution via a multimodal query mechanism that jointly encodes object geometry, contact points, and structured skill semantics. A point-cloud-driven diffusion policy model, integrated with query-based attention, enables robust key-component identification and goal-directed control. Evaluated on FurnitureBench in both simulation and real-world settings, the method achieves over 50% higher success rates on long-horizon tasks—such as insertion and screwing—compared to unstructured-query baselines. It demonstrates significantly improved robustness to sensory noise and enhanced cross-task generalization capability.

Technology Category

Application Category

📝 Abstract
The robotic assembly task poses a key challenge in building generalist robots due to the intrinsic complexity of part interactions and the sensitivity to noise perturbations in contact-rich settings. The assembly agent is typically designed in a hierarchical manner: high-level multi-part reasoning and low-level precise control. However, implementing such a hierarchical policy is challenging in practice due to the mismatch between high-level skill queries and low-level execution. To address this, we propose the Query-centric Diffusion Policy (QDP), a hierarchical framework that bridges high-level planning and low-level control by utilizing queries comprising objects, contact points, and skill information. QDP introduces a query-centric mechanism that identifies task-relevant components and uses them to guide low-level policies, leveraging point cloud observations to improve the policy's robustness. We conduct comprehensive experiments on the FurnitureBench in both simulation and real-world settings, demonstrating improved performance in skill precision and long-horizon success rate. In the challenging insertion and screwing tasks, QDP improves the skill-wise success rate by over 50% compared to baselines without structured queries.
Problem

Research questions and friction points this paper is trying to address.

Addresses hierarchical policy mismatch between high-level reasoning and low-level control
Improves robotic assembly robustness in contact-rich, noise-sensitive environments
Enhances skill precision and long-horizon success rates for complex assembly tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical framework bridging high-level planning and low-level control
Query-centric mechanism identifying task-relevant components for policy guidance
Leverages point cloud observations to improve policy robustness
🔎 Similar Papers
No similar papers found.
Ziyi Xu
Ziyi Xu
École Polytechnique Fédérale de Lausanne (EPFL)
H
Haohong Lin
Department of Mechanical Engineering, Carnegie Mellon University, Pittsburgh, PA 15213 USA
S
Shiqi Liu
Department of Mechanical Engineering, Carnegie Mellon University, Pittsburgh, PA 15213 USA
Ding Zhao
Ding Zhao
Carnegie Mellon University
Trustworthy AIAI safetyreinforcement learningautonomous vehiclesrobotics