Register Anything: Estimating "Corresponding Prompts" for Segment Anything Model

📅 2025-08-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the inefficiency and poor generalizability of conventional region-level image registration—whose two-stage pipeline (segmentation followed by correspondence matching) incurs high computational overhead and domain-specific dependency—this paper introduces a novel “correspondence prompting” paradigm coupled with an “inverse prompting” optimization strategy. Leveraging pre-trained promptable segmentation models (e.g., SAM), our method performs end-to-end, training-free, unsupervised registration by jointly marginalizing over spatial locations and prompt embeddings to identify semantically consistent cross-image correspondences in a single step. It enables multi-region joint optimization while preserving both structural coherence and fine-grained alignment fidelity. Evaluated on 3D medical imaging, 2D histopathology, and aerial imagery, our approach surpasses intensity-based and deep deformation-field learning methods, approaches the performance of weakly supervised alternatives, and operates entirely without annotations or task-specific training data.

Technology Category

Application Category

📝 Abstract
Establishing pixel/voxel-level or region-level correspondences is the core challenge in image registration. The latter, also known as region-based correspondence representation, leverages paired regions of interest (ROIs) to enable regional matching while preserving fine-grained capability at pixel/voxel level. Traditionally, this representation is implemented via two steps: segmenting ROIs in each image then matching them between the two images. In this paper, we simplify this into one step by directly "searching for corresponding prompts", using extensively pre-trained segmentation models (e.g., SAM) for a training-free registration approach, PromptReg. Firstly, we introduce the "corresponding prompt problem", which aims to identify a corresponding Prompt Y in Image Y for any given visual Prompt X in Image X, such that the two respectively prompt-conditioned segmentations are a pair of corresponding ROIs from the two images. Secondly, we present an "inverse prompt" solution that generates primary and optionally auxiliary prompts, inverting Prompt X into the prompt space of Image Y. Thirdly, we propose a novel registration algorithm that identifies multiple paired corresponding ROIs by marginalizing the inverted Prompt X across both prompt and spatial dimensions. Comprehensive experiments are conducted on five applications of registering 3D prostate MR, 3D abdomen MR, 3D lung CT, 2D histopathology and, as a non-medical example, 2D aerial images. Based on metrics including Dice and target registration errors on anatomical structures, the proposed registration outperforms both intensity-based iterative algorithms and learning-based DDF-predicting networks, even yielding competitive performance with weakly-supervised approaches that require fully-segmented training data.
Problem

Research questions and friction points this paper is trying to address.

Estimating corresponding prompts for Segment Anything Model
Simplifying image registration into one-step prompt search
Improving registration accuracy without training data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Introduces corresponding prompt problem for registration
Proposes inverse prompt solution for training-free approach
Develops novel registration algorithm with marginalization
S
Shiqi Huang
Beijing Institute of Technology, Beijing, China
T
Tingfa Xu
Beijing Institute of Technology, Beijing, China
W
Wen Yan
University College London, London, UK
Dean Barratt
Dean Barratt
Professor of Medical Image Computing, University College London
Medical imagingultrasoundimage analysisprostate cancer
Y
Yipeng Hu
University College London, London, UK