Modification Takes Courage: Seamless Image Stitching via Reference-Driven Inpainting

πŸ“… 2024-11-15
πŸ›οΈ arXiv.org
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address prominent seam artifacts in image stitching caused by inconsistent color tones and large disparities, this paper proposes a reference-driven stitching rectification paradigm that unifies fusion and geometric correction as a reference-guided image inpainting taskβ€”thereby expanding the editable region and enhancing local adaptability. Methodologically, we introduce the first self-supervised fine-tuned text-to-image diffusion model for stitching (requiring no annotated data) and establish the first multimodal large language model (MLLM)-based metric for stitching quality assessment, jointly incorporating semantic understanding and visual fidelity. Experiments demonstrate that our approach significantly outperforms state-of-the-art methods in content consistency, transition naturalness, and zero-shot generalization, particularly under challenging scenarios. This work provides a novel, principled framework for high-fidelity image stitching.

Technology Category

Application Category

πŸ“ Abstract
Current image stitching methods often produce noticeable seams in challenging scenarios such as uneven hue and large parallax. To tackle this problem, we propose the Reference-Driven Inpainting Stitcher (RDIStitcher), which reformulates the image fusion and rectangling as a reference-based inpainting model, incorporating a larger modification fusion area and stronger modification intensity than previous methods. Furthermore, we introduce a self-supervised model training method, which enables the implementation of RDIStitcher without requiring labeled data by fine-tuning a Text-to-Image (T2I) diffusion model. Recognizing difficulties in assessing the quality of stitched images, we present the Multimodal Large Language Models (MLLMs)-based metrics, offering a new perspective on evaluating stitched image quality. Compared to the state-of-the-art (SOTA) method, extensive experiments demonstrate that our method significantly enhances content coherence and seamless transitions in the stitched images. Especially in the zero-shot experiments, our method exhibits strong generalization capabilities. Code: https://github.com/yayoyo66/RDIStitcher
Problem

Research questions and friction points this paper is trying to address.

Addresses noticeable seams in image stitching due to uneven hue and parallax.
Proposes a reference-based inpainting model for seamless image fusion.
Introduces self-supervised training and MLLM-based metrics for quality evaluation.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Reference-Driven Inpainting for seamless stitching
Self-supervised training using Text-to-Image diffusion
MLLMs-based metrics for image quality evaluation
πŸ”Ž Similar Papers
No similar papers found.
Z
Ziqi Xie
Tongji University, Shanghai
X
Xiao Lai
Tongji University, Shanghai
Weidong Zhao
Weidong Zhao
Shandong University
Numerical analysis
X
Xianhui Liu
Tongji University, Shanghai
W
Wenlong Hou
Tongji University, Shanghai