Implicit Neural Representation for Video and Image Super-Resolution

📅 2025-03-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing video and image super-resolution methods suffer from poor detail consistency, temporal instability, and reliance on computationally expensive optical flow estimation. To address these issues, this paper proposes SR-INR—the first unified implicit neural representation (INR) framework for both image and video super-resolution. SR-INR conditions a lightweight, end-to-end differentiable MLP solely on low-resolution inputs and 3D spatiotemporal coordinates, directly regressing high-resolution pixel values without explicit motion modeling. Its core innovation is a joint spatiotemporal coordinate encoding scheme, which ensures inter-frame detail consistency and strong temporal stability while preserving architectural simplicity. Experiments demonstrate that SR-INR achieves or surpasses state-of-the-art performance across multiple benchmarks. Notably, it attains superior computational efficiency—featuring significantly fewer parameters, faster inference, and reduced computational overhead—while maintaining high-fidelity spatiotemporal coherence.

Technology Category

Application Category

📝 Abstract
We present a novel approach for super-resolution that utilizes implicit neural representation (INR) to effectively reconstruct and enhance low-resolution videos and images. By leveraging the capacity of neural networks to implicitly encode spatial and temporal features, our method facilitates high-resolution reconstruction using only low-resolution inputs and a 3D high-resolution grid. This results in an efficient solution for both image and video super-resolution. Our proposed method, SR-INR, maintains consistent details across frames and images, achieving impressive temporal stability without relying on the computationally intensive optical flow or motion estimation typically used in other video super-resolution techniques. The simplicity of our approach contrasts with the complexity of many existing methods, making it both effective and efficient. Experimental evaluations show that SR-INR delivers results on par with or superior to state-of-the-art super-resolution methods, while maintaining a more straightforward structure and reduced computational demands. These findings highlight the potential of implicit neural representations as a powerful tool for reconstructing high-quality, temporally consistent video and image signals from low-resolution data.
Problem

Research questions and friction points this paper is trying to address.

Enhances low-resolution videos and images using implicit neural representation.
Achieves high-resolution reconstruction without optical flow or motion estimation.
Provides efficient, temporally stable super-resolution with reduced computational demands.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses implicit neural representation for super-resolution
Leverages 3D grid for high-resolution reconstruction
Achieves temporal stability without optical flow
🔎 Similar Papers
No similar papers found.
M
M. Aiyetigbo
Clemson University
W
Wanqi Yuan
Clemson University
F
Feng Luo
Clemson University
Nianyi Li
Nianyi Li
Assistant Professor, Clemson University
Computer VisionComputational PhotographyMachine Learning