HDRAgent: An Agentic Framework for Multi-Exposure HDR Imaging

📅 2026-06-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the ghosting artifacts commonly encountered in traditional multi-exposure HDR imaging of dynamic scenes, which stem from the use of fixed feedforward reconstruction paradigms. To overcome this limitation, the paper introduces an adaptive reconstruction framework that incorporates an agent-based mechanism into HDR imaging for the first time. The proposed approach leverages a multimodal large language model to enable scene understanding and dynamic region parsing, and integrates fine-grained contextual knowledge matching, a perception–distortion feedback loop, and an agent-guided generative alignment strategy to dynamically optimize the reconstruction process. This method significantly suppresses ghosting and local artifacts, achieving state-of-the-art or superior performance in both objective metrics and visual quality.
📝 Abstract
Most existing multi-exposure HDR methods follow a fixed feed-forward reconstruction paradigm, making them prone to ghosting artifacts in complex dynamic scenes. To address this issue, we propose HDRAgent, the first agent-driven framework for HDR imaging, which adaptively selects reconstruction strategies according to the current scene conditions. Specifically, to provide scene-specific prior knowledge, we introduce a fine-grained contextual knowledge matching (FCM) module. This module leverages multimodal large language model (MLLM)-derived scene perception to retrieve relevant historical cases and tool knowledge, organizing them into structured evidence for MLLM-based adaptive tool scheduling. In addition, we propose a perception--distortion feedback mechanism that transforms post-execution quality assessment and artifact diagnosis into structured feedback, which is accumulated in historical memory to help subsequent contextual knowledge refinement and strategy selection. Furthermore, considering that extreme motion can invalidate alignment methods, we design an agent-guided generative alignment strategy that uses MLLM-based dynamic-region parsing to reconstruct unreliable contents in non-reference frames under reference-frame guidance. Experiments demonstrate that HDRAgent effectively reduces ghosting and local artifacts while achieving competitive or superior objective performance and visual quality.
Problem

Research questions and friction points this paper is trying to address.

HDR imaging
ghosting artifacts
dynamic scenes
multi-exposure
reconstruction
Innovation

Methods, ideas, or system contributions that make the work stand out.

agentic framework
contextual knowledge matching
perception-distortion feedback
generative alignment
multi-exposure HDR
🔎 Similar Papers