Exploring Real-Time Super-Resolution: Benchmarking and Fine-Tuning for Streaming Content

📅 2026-02-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limitations of existing real-time super-resolution methods in compressed video streaming scenarios, where performance is often degraded and benchmarking resources are scarce. To bridge this gap, we introduce StreamSR, a new dataset derived from YouTube videos encompassing diverse content and resolutions, and present a systematic evaluation of eleven state-of-the-art real-time super-resolution models. Furthermore, we propose EfRLFN, an efficient model that integrates hyperbolic tangent activation with a lightweight channel attention mechanism, enhanced by a composite loss function and architectural refinements. EfRLFN achieves significant visual quality improvements while maintaining real-time inference speeds. Experimental results demonstrate that fine-tuning on StreamSR consistently yields notable performance gains across multiple benchmarks. The dataset, code, and evaluation framework are publicly released to support future research.

Technology Category

Application Category

📝 Abstract
Recent advancements in real-time super-resolution have enabled higher-quality video streaming, yet existing methods struggle with the unique challenges of compressed video content. Commonly used datasets do not accurately reflect the characteristics of streaming media, limiting the relevance of current benchmarks. To address this gap, we introduce a comprehensive dataset - StreamSR - sourced from YouTube, covering a wide range of video genres and resolutions representative of real-world streaming scenarios. We benchmark 11 state-of-the-art real-time super-resolution models to evaluate their performance for the streaming use-case. Furthermore, we propose EfRLFN, an efficient real-time model that integrates Efficient Channel Attention and a hyperbolic tangent activation function - a novel design choice in the context of real-time super-resolution. We extensively optimized the architecture to maximize efficiency and designed a composite loss function that improves training convergence. EfRLFN combines the strengths of existing architectures while improving both visual quality and runtime performance. Finally, we show that fine-tuning other models on our dataset results in significant performance gains that generalize well across various standard benchmarks. We made the dataset, the code, and the benchmark available at https://github.com/EvgeneyBogatyrev/EfRLFN.
Problem

Research questions and friction points this paper is trying to address.

real-time super-resolution
streaming content
compressed video
benchmarking
dataset
Innovation

Methods, ideas, or system contributions that make the work stand out.

real-time super-resolution
streaming content
efficient channel attention
composite loss function
dataset benchmarking
🔎 Similar Papers
No similar papers found.
E
Evgeney Bogatyrev
Lomonosov Moscow State University, 119991, Moscow, Russia; AI Center, Lomonosov Moscow State University; MSU Institute for Artificial Intelligence, Lomonosov Moscow State University
Khaled Abud
Khaled Abud
Researcher, Lomonosov Moscow State University, MSU Institute for Artificial Intelligence
Deep LearningImage Quality AssessmentImage GenerationDiffusion ModelsAdversarial Robustness
I
Ivan Molodetskikh
Lomonosov Moscow State University, 119991, Moscow, Russia; AI Center, Lomonosov Moscow State University
N
Nikita Alutis
Lomonosov Moscow State University, 119991, Moscow, Russia; AI Center, Lomonosov Moscow State University
D
Dmitry Vatolin
Lomonosov Moscow State University, 119991, Moscow, Russia; AI Center, Lomonosov Moscow State University; MSU Institute for Artificial Intelligence, Lomonosov Moscow State University