🤖 AI Summary
This work addresses the cumbersome and inefficient model selection and hyperparameter tuning processes that hinder practical deployment in 3D biomedical image analysis. We propose a two-stage Bayesian optimization–driven automated pipeline that jointly optimizes segmentation models, post-processing parameters, and classifier architectures along with pretraining strategies. A key innovation is the introduction of a segmentation-quality–oriented evaluation metric as the optimization objective, coupled with an auxiliary pseudo-labeling mechanism derived from segmentation outputs to substantially reduce manual annotation effort. By integrating domain-adapted synthetic benchmark data, encoder–classifier head architecture search, and prior knowledge–guided pretraining, our method efficiently identifies optimal configurations across four case studies, demonstrating both effectiveness and practical utility.
📝 Abstract
Deep learning-based segmentation and classification are crucial to large-scale biomedical imaging, particularly for 3D data, where manual analysis is impractical. Although many methods exist, selecting suitable models and tuning parameters remains a major bottleneck in practice. Hence, we introduce the 3D data Analysis Optimization Pipeline, a method designed to facilitate the design and parameterization of segmentation and classification using two Bayesian Optimization stages. First, the pipeline selects a segmentation model and optimizes postprocessing parameters using a domain-adapted syntactic benchmark dataset. To ensure a concise evaluation of segmentation performance, we introduce a segmentation quality metric that serves as the objective function. Second, the pipeline optimizes design choices of a classifier, such as encoder and classifier head architectures, incorporation of prior knowledge, and pretraining strategies. To reduce manual annotation effort, this stage includes an assisted class-annotation workflow that extracts predicted instances from the segmentation results and sequentially presents them to the operator, eliminating the need for manual tracking. In four case studies, the 3D data Analysis Optimization Pipeline efficiently identifies effective model and parameter configurations for individual datasets.