Row–Column Separated Attention Based Low‐Light Image/Video Enhancement

📅 2024-08-29
🏛️ Computer graphics forum (Print)
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenges of excessive local noise, detail loss, and insufficient exploitation of global context in low-light image and video enhancement. To this end, we propose a lightweight Row-Column Separated Attention (RCSA) module, integrated into an enhanced U-Net architecture. RCSA efficiently captures global contextual information by aggregating row-wise and column-wise mean and maximum statistics from feature maps, thereby guiding localized enhancement while significantly reducing computational overhead. Furthermore, we extend this design to video enhancement for the first time and introduce two novel temporal consistency loss functions to ensure smooth inter-frame coherence. Extensive experiments on the LOL, MIT-Adobe FiveK, and SDSD datasets demonstrate state-of-the-art performance, and the code is publicly released.

Technology Category

Application Category

📝 Abstract
U‐Net structure is widely used for low‐light image/video enhancement. The enhanced images result in areas with large local noise and loss of more details without proper guidance for global information. Attention mechanisms can better focus on and use global information. However, attention to images could significantly increase the number of parameters and computations. We propose a Row–Column Separated Attention module (RCSA) inserted after an improved U‐Net. The RCSA module's input is the mean and maximum of the row and column of the feature map, which utilizes global information to guide local information with fewer parameters. We propose two temporal loss functions to apply the method to low‐light video enhancement and maintain temporal consistency. Extensive experiments on the LOL, MIT Adobe FiveK image, and SDSD video datasets demonstrate the effectiveness of our approach.
Problem

Research questions and friction points this paper is trying to address.

low-light enhancement
image/video enhancement
attention mechanism
global information
temporal consistency
Innovation

Methods, ideas, or system contributions that make the work stand out.

Row-Column Separated Attention
Low-Light Enhancement
U-Net
Temporal Consistency
Global Information Guidance
🔎 Similar Papers
No similar papers found.
C
Chengqi Dong
School of Artificial Intelligence, Jilin University, China
Z
Zhiyuan Cao
College of Software, Jilin University, China
T
Tuoshi Qi
School of Artificial Intelligence, Jilin University, China
K
Kexin Wu
School of Artificial Intelligence, Jilin University, China
Yixing Gao
Yixing Gao
Jilin University
Assistive humanoid robotsComputer visionHuman-robot interaction
F
Fan Tang
Institute of Computing Technology, Chinese Academy of Sciences, China