Surgeons vs. Computer Vision: A comparative analysis on surgical phase recognition capabilities

πŸ“… 2025-04-26
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Robotic-assisted partial nephrectomy (RAPN) exhibits high procedural nonlinearity, posing significant challenges for surgical phase recognition (SPR). This study systematically compares surgeon versus computer vision performance in SPR, specifically investigating how temporal context and visual landmarks influence human and machine discriminative capabilities. Method: We conduct the first synchronous human–AI evaluation in a nonlinear surgical setting. Leveraging Cholec80 pretraining, we develop CNN and CNN-LSTM models, and build a custom web platform supporting multi-level expert annotation and confidence scoring. Contribution/Results: Integrating video segments with key visual landmarks significantly improves recognition accuracy. Our AI models achieve performance comparable to experienced surgeons. Incorporating temporal modeling boosts accuracy for both humans and AI, demonstrating synergistic gains from combining temporal information with anatomical and instrument-based cues. These findings underscore the critical role of temporal dynamics in nonlinear surgical workflows and provide empirical evidence for context-aware SPR systems.

Technology Category

Application Category

πŸ“ Abstract
Purpose: Automated Surgical Phase Recognition (SPR) uses Artificial Intelligence (AI) to segment the surgical workflow into its key events, functioning as a building block for efficient video review, surgical education as well as skill assessment. Previous research has focused on short and linear surgical procedures and has not explored if temporal context influences experts' ability to better classify surgical phases. This research addresses these gaps, focusing on Robot-Assisted Partial Nephrectomy (RAPN) as a highly non-linear procedure. Methods: Urologists of varying expertise were grouped and tasked to indicate the surgical phase for RAPN on both single frames and video snippets using a custom-made web platform. Participants reported their confidence levels and the visual landmarks used in their decision-making. AI architectures without and with temporal context as trained and benchmarked on the Cholec80 dataset were subsequently trained on this RAPN dataset. Results: Video snippets and presence of specific visual landmarks improved phase classification accuracy across all groups. Surgeons displayed high confidence in their classifications and outperformed novices, who struggled discriminating phases. The performance of the AI models is comparable to the surgeons in the survey, with improvements when temporal context was incorporated in both cases. Conclusion: SPR is an inherently complex task for expert surgeons and computer vision, where both perform equally well when given the same context. Performance increases when temporal information is provided. Surgical tools and organs form the key landmarks for human interpretation and are expected to shape the future of automated SPR.
Problem

Research questions and friction points this paper is trying to address.

Compares human surgeons and AI in surgical phase recognition
Explores impact of temporal context on phase classification accuracy
Focuses on non-linear Robot-Assisted Partial Nephrectomy procedures
Innovation

Methods, ideas, or system contributions that make the work stand out.

AI models trained with temporal context
Custom web platform for expert annotation
Focus on non-linear surgical procedures
πŸ”Ž Similar Papers
No similar papers found.
M
Marco Mezzina
Orsi Academy, Belgium; Faculty of Engineering Science, KU Leuven, Belgium
P
Pieter De Backer
Orsi Academy, Belgium; Department of Urology, OLV Aalst Hospital, Belgium
Tom Vercauteren
Tom Vercauteren
Professor of Interventional Image Computing, King's College London
Medical Image ComputingImage RegistrationComputer-assisted InterventionsEndomicroscopyImage-guided Interventions
Matthew Blaschko
Matthew Blaschko
KU Leuven
Machine LearningComputer VisionMedical Image Analysis
A
Alexandre Mottrie
Orsi Academy, Belgium; Department of Urology, OLV Aalst Hospital, Belgium
Tinne Tuytelaars
Tinne Tuytelaars
KU Leuven - PSI, Belgium
computer visioncontinual learning