Register Anything: Estimating "Corresponding Prompts" for Segment Anything Model

📅 2025-08-03

📈 Citations: 0

✨ Influential: 0

career value

198K/year

🤖 AI Summary

To address the inefficiency and poor generalizability of conventional region-level image registration—whose two-stage pipeline (segmentation followed by correspondence matching) incurs high computational overhead and domain-specific dependency—this paper introduces a novel “correspondence prompting” paradigm coupled with an “inverse prompting” optimization strategy. Leveraging pre-trained promptable segmentation models (e.g., SAM), our method performs end-to-end, training-free, unsupervised registration by jointly marginalizing over spatial locations and prompt embeddings to identify semantically consistent cross-image correspondences in a single step. It enables multi-region joint optimization while preserving both structural coherence and fine-grained alignment fidelity. Evaluated on 3D medical imaging, 2D histopathology, and aerial imagery, our approach surpasses intensity-based and deep deformation-field learning methods, approaches the performance of weakly supervised alternatives, and operates entirely without annotations or task-specific training data.

Technology Category

Application Category

📝 Abstract

Establishing pixel/voxel-level or region-level correspondences is the core challenge in image registration. The latter, also known as region-based correspondence representation, leverages paired regions of interest (ROIs) to enable regional matching while preserving fine-grained capability at pixel/voxel level. Traditionally, this representation is implemented via two steps: segmenting ROIs in each image then matching them between the two images. In this paper, we simplify this into one step by directly "searching for corresponding prompts", using extensively pre-trained segmentation models (e.g., SAM) for a training-free registration approach, PromptReg. Firstly, we introduce the "corresponding prompt problem", which aims to identify a corresponding Prompt Y in Image Y for any given visual Prompt X in Image X, such that the two respectively prompt-conditioned segmentations are a pair of corresponding ROIs from the two images. Secondly, we present an "inverse prompt" solution that generates primary and optionally auxiliary prompts, inverting Prompt X into the prompt space of Image Y. Thirdly, we propose a novel registration algorithm that identifies multiple paired corresponding ROIs by marginalizing the inverted Prompt X across both prompt and spatial dimensions. Comprehensive experiments are conducted on five applications of registering 3D prostate MR, 3D abdomen MR, 3D lung CT, 2D histopathology and, as a non-medical example, 2D aerial images. Based on metrics including Dice and target registration errors on anatomical structures, the proposed registration outperforms both intensity-based iterative algorithms and learning-based DDF-predicting networks, even yielding competitive performance with weakly-supervised approaches that require fully-segmented training data.

Problem

Research questions and friction points this paper is trying to address.

Estimating corresponding prompts for Segment Anything Model

Simplifying image registration into one-step prompt search

Improving registration accuracy without training data

Innovation

Methods, ideas, or system contributions that make the work stand out.

Introduces corresponding prompt problem for registration

Proposes inverse prompt solution for training-free approach

Develops novel registration algorithm with marginalization

🔎 Similar Papers

EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model