FlowIID: Single-Step Intrinsic Image Decomposition via Latent Flow Matching

📅 2026-01-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of high parameter counts in existing intrinsic image decomposition methods, which hinder deployment in resource-constrained or real-time settings. To this end, we introduce flow matching—a technique previously unexplored in this domain—and propose a novel single-step decomposition framework operating in a variational autoencoder (VAE)-guided latent space. By jointly optimizing the latent representation and the flow matching module, our method efficiently and stably disentangles albedo and shading in a single inference step. Extensive experiments on multiple benchmark datasets demonstrate that the proposed model achieves decomposition performance comparable to or better than state-of-the-art approaches while using significantly fewer parameters. This favorable balance of efficiency and accuracy makes our approach particularly suitable for real-time and lightweight applications.

Technology Category

Application Category

📝 Abstract
Intrinsic Image Decomposition (IID) separates an image into albedo and shading components. It is a core step in many real-world applications, such as relighting and material editing. Existing IID models achieve good results, but often use a large number of parameters. This makes them costly to combine with other models in real-world settings. To address this problem, we propose a flow matching-based solution. For this, we design a novel architecture, FlowIID, based on latent flow matching. FlowIID combines a VAE-guided latent space with a flow matching module, enabling a stable decomposition of albedo and shading. FlowIID is not only parameter-efficient, but also produces results in a single inference step. Despite its compact design, FlowIID delivers competitive and superior results compared to existing models across various benchmarks. This makes it well-suited for deployment in resource-constrained and real-time vision applications.
Problem

Research questions and friction points this paper is trying to address.

Intrinsic Image Decomposition
parameter efficiency
real-time vision
model deployment
albedo and shading
Innovation

Methods, ideas, or system contributions that make the work stand out.

Flow Matching
Intrinsic Image Decomposition
Latent Space
Parameter-Efficient
Single-Step Inference
🔎 Similar Papers
No similar papers found.
M
Mithlesh Singla
Indian Institute of Technology Gandhinagar, India
S
Seema Kumari
Indian Institute of Technology Gandhinagar, India
Shanmuganathan Raman
Shanmuganathan Raman
Professor, EE and CSE, Indian Institute of Technology Gandhinagar
Computer VisionComputer GraphicsDeep LearningArtificial IntelligenceAI