POLARIS: Projection-Orthogonal Least Squares for Robust and Adaptive Inversion in Diffusion Models

📅 2025-11-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In diffusion model denoising, layer-wise accumulation of approximation errors in noise estimation degrades reconstruction quality. This work first characterizes the propagation mechanism of such errors throughout the reverse process and proposes a novel source-driven error control paradigm—replacing conventional cumulative compensation. We mathematically model the noise approximation bias at each denoising step and introduce a step-wise adaptable guidance scale ω, optimized per layer via projection-based orthogonal least squares. The method improves latent-space reconstruction fidelity with only a one-line code modification, significantly enhancing perceptual fidelity in image editing and inpainting tasks. It achieves state-of-the-art performance across multiple benchmarks without incurring additional computational overhead. The core innovation lies in the synergistic design of error溯源 modeling (i.e., tracing approximation errors to their origin) and lightweight, step-adaptive scale modulation.

Technology Category

Application Category

📝 Abstract
The Inversion-Denoising Paradigm, which is based on diffusion models, excels in diverse image editing and restoration tasks. We revisit its mechanism and reveal a critical, overlooked factor in reconstruction degradation: the approximate noise error. This error stems from approximating the noise at step t with the prediction at step t-1, resulting in severe error accumulation throughout the inversion process. We introduce Projection-Orthogonal Least Squares for Robust and Adaptive Inversion (POLARIS), which reformulates inversion from an error-compensation problem into an error-origin problem. Rather than optimizing embeddings or latent codes to offset accumulated drift, POLARIS treats the guidance scale ω as a step-wise variable and derives a mathematically grounded formula to minimize inversion error at each step. Remarkably, POLARIS improves inversion latent quality with just one line of code. With negligible performance overhead, it substantially mitigates noise approximation errors and consistently improves the accuracy of downstream tasks.
Problem

Research questions and friction points this paper is trying to address.

Addresses noise approximation error accumulation in diffusion model inversion
Reformulates inversion as error-origin problem to minimize step-wise errors
Improves inversion quality and downstream task accuracy with minimal overhead
Innovation

Methods, ideas, or system contributions that make the work stand out.

Reformulates inversion as error-origin problem
Treats guidance scale as step-wise variable
Minimizes inversion error at each step
Wenshuo Chen
Wenshuo Chen
Shandong University undergraduate student
Generative ModelsXAI
H
Haosen Li
The Hong Kong University of Science and Technology (Guangzhou)
S
Shaofeng Liang
The Hong Kong University of Science and Technology (Guangzhou)
L
Lei Wang
Griffith University, Data61/CSIRO
H
Haozhe Jia
The Hong Kong University of Science and Technology (Guangzhou)
K
Kaishen Yuan
The Hong Kong University of Science and Technology (Guangzhou)
J
Jieming Wu
The Hong Kong University of Science and Technology (Guangzhou)
Bowen Tian
Bowen Tian
The Hong Kong University of Science and Technology (Guangzhou)
Model FusionNeural Network FunctionalsSemi-Supervised Learning
Y
Yutao Yue
The Hong Kong University of Science and Technology (Guangzhou)