Real-time Monocular 2D and 3D Perception of Endoluminal Scenes for Controlling Flexible Robotic Endoscopic Instruments

📅 2026-02-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenges of precise localization and tissue-distance perception for flexible instruments in endoluminal surgery, which critically limit procedural efficiency and safety. To overcome these limitations, we propose a monocular vision-based perception framework that integrates high-fidelity physics simulation with deep learning, enabling, for the first time, real-time joint 2D/3D pose estimation and tissue-distance measurement of continuum surgical instruments within complex endoluminal environments. The developed simulator accurately models instrument dynamics, facilitating robust training and deployment of the perception algorithm. Experimental results demonstrate that the proposed method reduces task completion time by over 70% in trajectory-following tasks while significantly enhancing both manipulation accuracy and contextual scene understanding.

Technology Category

Application Category

📝 Abstract
Endoluminal surgery offers a minimally invasive option for early-stage gastrointestinal and urinary tract cancers but is limited by surgical tools and a steep learning curve. Robotic systems, particularly continuum robots, provide flexible instruments that enable precise tissue resection, potentially improving outcomes. This paper presents a visual perception platform for a continuum robotic system in endoluminal surgery. Our goal is to utilize monocular endoscopic image-based perception algorithms to identify position and orientation of flexible instruments and measure their distances from tissues. We introduce 2D and 3D learning-based perception algorithms and develop a physically-realistic simulator that models flexible instruments dynamics. This simulator generates realistic endoluminal scenes, enabling control of flexible robots and substantial data collection. Using a continuum robot prototype, we conducted module and system-level evaluations. Results show that our algorithms improve control of flexible instruments, reducing manipulation time by over 70% for trajectory-following tasks and enhancing understanding of surgical scenarios, leading to robust endoluminal surgeries.
Problem

Research questions and friction points this paper is trying to address.

endoluminal surgery
monocular perception
flexible robotic instruments
real-time 2D/3D perception
instrument-tissue distance
Innovation

Methods, ideas, or system contributions that make the work stand out.

monocular perception
continuum robot
endoluminal surgery
physically-realistic simulation
learning-based 3D reconstruction
🔎 Similar Papers
No similar papers found.
R
Ruofeng Wei
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
K
Kai Chen
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
Y
Yui Lun Ng
Department of Mechanical Engineering, The University of Hong Kong, China
Y
Yiyao Ma
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China
J
Justin Di-Lang Ho
Department of Mechanical Engineering, The University of Hong Kong, China
H
Hon Sing Tong
Department of Mechanical Engineering, The University of Hong Kong, China
X
Xiaomei Wang
Department of Mechanical Engineering, The University of Hong Kong, China
J
Jing Dai
Department of Mechanical Engineering, The University of Hong Kong, China
Ka-Wai Kwok
Ka-Wai Kwok
Professor at The Chinese University of Hong Kong
Surgical roboticsIntra-op MRI-guided roboticsIntelligent systems and control
Q
Qi Dou
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong, China