ControlMambaIR: Conditional Controls with State-Space Model for Image Restoration

📅 2025-06-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the longstanding trade-off between perceptual quality and fidelity in image restoration, this paper proposes the first conditional control framework integrating the Mamba state-space model with diffusion models. Methodologically, we innovatively incorporate the Mamba architecture into the conditioning network of diffusion models—replacing conventional CNNs and attention mechanisms—and empirically demonstrate that direct noise prediction yields superior perceptual quality while better preserving structural details. Extensive experiments on benchmarks including Rain100H/L, GoPro, and SSID show that our approach achieves significantly lower LPIPS and FID scores than state-of-the-art methods, while maintaining competitive PSNR and SSIM values. This work is the first to validate the effectiveness of Mamba-driven diffusion models for fine-grained generative control and joint optimization of perceptual quality and fidelity.

Technology Category

Application Category

📝 Abstract
This paper proposes ControlMambaIR, a novel image restoration method designed to address perceptual challenges in image deraining, deblurring, and denoising tasks. By integrating the Mamba network architecture with the diffusion model, the condition network achieves refined conditional control, thereby enhancing the control and optimization of the image generation process. To evaluate the robustness and generalization capability of our method across various image degradation conditions, extensive experiments were conducted on several benchmark datasets, including Rain100H, Rain100L, GoPro, and SSID. The results demonstrate that our proposed approach consistently surpasses existing methods in perceptual quality metrics, such as LPIPS and FID, while maintaining comparable performance in image distortion metrics, including PSNR and SSIM, highlighting its effectiveness and adaptability. Notably, ablation experiments reveal that directly noise prediction in the diffusion process achieves better performance, effectively balancing noise suppression and detail preservation. Furthermore, the findings indicate that the Mamba architecture is particularly well-suited as a conditional control network for diffusion models, outperforming both CNN- and Attention-based approaches in this context. Overall, these results highlight the flexibility and effectiveness of ControlMambaIR in addressing a range of image restoration perceptual challenges.
Problem

Research questions and friction points this paper is trying to address.

Enhancing image restoration in deraining, deblurring, and denoising tasks
Improving conditional control in diffusion models using Mamba architecture
Balancing noise suppression and detail preservation in degraded images
Innovation

Methods, ideas, or system contributions that make the work stand out.

Mamba network with diffusion model integration
Direct noise prediction in diffusion process
Mamba as superior conditional control network
🔎 Similar Papers
No similar papers found.
C
Cheng Yang
School of Mathematical Sciences, Dalian University of Technology, Dalian, 116024, China
L
Lijing Liang
School of Mathematical Sciences, Dalian University of Technology, Dalian, 116024, China
Zhixun Su
Zhixun Su
Dalian University of Technology
Computer visionGraphicsNumerical computingComputational Geometry