Diffusion Stabilizer Policy for Automated Surgical Robot Manipulations

📅 2025-03-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Surgical robot automation faces challenges in real-world settings due to high noise and frequent failures in demonstration data, leading to poor robustness in policy learning. To address this, we propose DSP, a diffusion-based policy learning framework that introduces a novel two-stage diffusion stabilization training paradigm: (1) adaptive perturbation filtering via action prediction error estimation, and (2) incremental policy refinement. DSP enables reliable training and cross-task transfer directly from noisy or even failed demonstration trajectories—without requiring high-fidelity expert demonstrations. Evaluated across multiple surgical simulation environments, DSP significantly improves policy accuracy and robustness: performance degradation under perturbations is reduced by 42%, and generalization surpasses existing imitation learning approaches.

Technology Category

Application Category

📝 Abstract
Intelligent surgical robots have the potential to revolutionize clinical practice by enabling more precise and automated surgical procedures. However, the automation of such robot for surgical tasks remains under-explored compared to recent advancements in solving household manipulation tasks. These successes have been largely driven by (1) advanced models, such as transformers and diffusion models, and (2) large-scale data utilization. Aiming to extend these successes to the domain of surgical robotics, we propose a diffusion-based policy learning framework, called Diffusion Stabilizer Policy (DSP), which enables training with imperfect or even failed trajectories. Our approach consists of two stages: first, we train the diffusion stabilizer policy using only clean data. Then, the policy is continuously updated using a mixture of clean and perturbed data, with filtering based on the prediction error on actions. Comprehensive experiments conducted in various surgical environments demonstrate the superior performance of our method in perturbation-free settings and its robustness when handling perturbed demonstrations.
Problem

Research questions and friction points this paper is trying to address.

Automating surgical robots for precise surgical tasks.
Training with imperfect or failed surgical trajectories.
Ensuring robustness in handling perturbed surgical demonstrations.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Diffusion-based policy learning framework
Training with imperfect or failed trajectories
Two-stage training with clean and perturbed data
🔎 Similar Papers
No similar papers found.
Chonlam Ho
Chonlam Ho
Shanghai Jiao Tong University - SJTU-UM JI Master student
Jianshu Hu
Jianshu Hu
Shanghai Jiao Tong University
Reinforcement LearningRobotics
H
Hesheng Wang
Department of Automation, Shanghai Jiao Tong University, Shanghai, China
Q
Qi Dou
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
Y
Yutong Ban
UM-SJTU Joint Institute, Shanghai Jiao Tong University, Shanghai, China