HOLa: HoloLens Object Labeling

📅 2024-12-01

🏛️ Current Directions in Biomedical Engineering

📈 Citations: 0

✨ Influential: 0

career value

196K/year

🤖 AI Summary

To address the labor-intensive and inefficient manual annotation of surgical objects in medical augmented reality (AR), this paper proposes the first fully automated, real-time mask generation method deployed natively on the HoloLens 2. Our approach innovatively integrates the zero-shot, parameter-free SAM-Track algorithm into a Unity-Python hybrid framework, enabling streaming segmentation and AR annotation with cross-scenario generalization. It requires no model training or hyperparameter tuning—only a single-frame initialization suffices for continuous tracking. Evaluated on open hepatic surgery and anatomical phantom datasets, our method achieves annotation speeds over 500× faster than manual labeling, with Dice scores ranging from 0.875 to 0.982—comparable in quality to expert annotations. This work constitutes the first empirical validation of real-time segmentation feasibility for SAM-family models on resource-constrained AR headsets, establishing a new paradigm for efficient, robust, and fully automated surgical object annotation in clinical AR navigation.

Technology Category

Application Category

📝 Abstract

In the context of medical Augmented Reality (AR) applications, object tracking is a key challenge and requires a significant amount of annotation masks. As segmentation foundation models like the Segment Anything Model (SAM) begin to emerge, zero-shot segmentation requires only minimal human participation obtaining high-quality object masks. We introduce a HoloLens-Object-Labeling (HOLa) Unity and Python application based on the SAM-Track algorithm that offers fully automatic single object annotation for HoloLens 2 while requiring minimal human participation. HOLa does not have to be adjusted to a specific image appearance and could thus alleviate AR research in any application field.We evaluate HOLa for different degrees of image complexity in open liver surgery and in medical phantom experiments. Using HOLa for image annotation can increase the labeling speed by more than 500 times while providing Dice scores between 0.875 and 0.982, which are comparable to human annotators. Our code is publicly available at: https://github.com/mschwimmbeck/HOLa.

Problem

Research questions and friction points this paper is trying to address.

Augmented Reality

Medical Application

Object Tracking

Innovation

Methods, ideas, or system contributions that make the work stand out.

HoloLens 2

Segment Anything Model (SAM)

Automated Annotation

🔎 Similar Papers

HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction

2024-06-10arXiv.orgCitations: 0

Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring

2024-03-14arXiv.orgCitations: 16