Sparse Anatomical Prompt Semi-Supervised Learning with Masked Image Modeling for CBCT Tooth Segmentation

๐Ÿ“… 2024-02-07
๐Ÿ›๏ธ IEEE International Symposium on Biomedical Imaging
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
CBCT images exhibit high inter-tooth morphological similarity, dense spatial arrangement, and ill-defined boundaries, leading to heavy reliance on manual annotations and limited segmentation accuracy. To address this under extremely low labeling budgets (10% annotation rate), we propose a novel semi-supervised dental segmentation framework. First, we introduce Masked Autoencoders (MAE) for unsupervised representation pretrainingโ€”its first application in CBCT dental segmentation. Second, we design a sparse anatomical prompting mechanism based on Graph Attention Networks (GAT) to explicitly encode inter-tooth spatial topology and boundary structure. Third, we jointly optimize sparse prompt learning with consistency regularization to improve pseudo-label reliability. Experiments demonstrate a 5.2% Dice score improvement over state-of-the-art semi-supervised methods, approaching fully supervised performance while significantly reducing annotation effort and enhancing boundary delineation accuracy.

Technology Category

Application Category

๐Ÿ“ Abstract
Accurate tooth identification and segmentation in Cone Beam Computed Tomography (CBCT) dental images can significantly enhance the efficiency and precesion of manual diagnoses performed by dentists. However, existing segmentation methods are mainly developed based on large data volumes training, on which their annotations are extremely time-consuming. Meanwhile, the teeth of each class in CBCT dental images being closely positioned, coupled with subtle inter-class differences, gives rise to the challenge of indistinct boundaries when training model with limited data. To address these challenges, this study aims to propose a task-oriented Masked Auto-Encoder paradigm to effectively utilize large amounts of unlabeled data to achieve accurate tooth segmentation with limited labeled data. Specifically, we first construct a self-supervised pre-training framework of masked auto encoder to efficiently utilize unlabeled data to enhance the network performance. Subsequently, we introduce a sparse masked prompt mechanism based on graph attention to incorporate boundary information of the teeth, aiding the network in learning the anatomical structural features of teeth. To the best of our knowledge, we are pioneering the integration of the mask pre-training paradigm into the CBCT tooth segmentation task. Extensive experiments demonstrate both the feasibility of our proposed method and the potential of the boundary prompt mechanism.
Problem

Research questions and friction points this paper is trying to address.

CBCT image segmentation
dental identification
high similarity and close arrangement
Innovation

Methods, ideas, or system contributions that make the work stand out.

Masked Autoencoder
Anatomical Hints
Semi-supervised Learning
๐Ÿ”Ž Similar Papers
No similar papers found.
Pengyu Dai
Pengyu Dai
Institute of Science Tokyo
Self-supervised learningVLM
Yafei Ou
Yafei Ou
Tokyo Institute of Technology
Medical Image AnalysisMachine LearningComputer Vision
Y
Yang Liu
Stomatological Hospital of Chongqing Medical University, Chongqing, 401147, China
Y
Yue Zhao
School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China; School of Mechanical Engineering, Zhejiang University, Zhejiang, 310058, China