🤖 AI Summary
To address the issues of poor local minima susceptibility and limited interpretability arising from random initialization in conventional deep neural networks (DNNs) for image interpolation, this paper proposes a learnable graph filtering framework grounded in the graph shift total variation (GSTV) prior. Methodologically, we first establish a theoretical mapping from pseudo-linear interpolation operators to directed graph filtering structures, enabling deterministic, performance-guaranteed initialization of the adjacency matrix. Subsequently, we unroll the Douglas–Rachford splitting algorithm into a lightweight, fully interpretable neural architecture. Our approach significantly reduces model parameter count while achieving state-of-the-art interpolation performance across multiple benchmark datasets. The contributions include: (i) a theoretically principled initialization strategy rooted in graph signal processing; (ii) an intrinsically interpretable network design derived from proximal optimization; and (iii) superior efficiency and accuracy trade-offs, jointly ensuring rigorous mathematical foundations, structural transparency, and computational efficacy.
📝 Abstract
Conventional deep neural nets (DNNs) initialize network parameters at random and then optimize each one via stochastic gradient descent (SGD), resulting in substantial risk of poor-performing local minima.Focusing on the image interpolation problem and leveraging a recent theorem that maps a (pseudo-)linear interpolator Θ to a directed graph filter that is a solution to a MAP problem regularized with a graph shift variation (GSV) prior, we first initialize a directed graph adjacency matrix A based on a known interpolator Θ, establishing a baseline performance.Then, towards further gain, we learn perturbation matrices P and P(2) from data to augment A, whose restoration effects are implemented via Douglas-Rachford (DR) iterations, which we unroll into a lightweight interpretable neural net.Experimental results demonstrate state-of-the-art image interpolation results, while drastically reducing network parameters.