Theoretical guarantees for stochastic gradient sampling methods via Gaussian convolution inequalities

📅 2026-04-27

📈 Citations: 0

✨ Influential: 0

career value

244K/year

🤖 AI Summary

This work addresses the lack of sharp non-asymptotic theoretical guarantees for the invariant measure bias in stochastic gradient Markov Chain Monte Carlo (SG-MCMC) methods. Under mild assumptions, it investigates the accuracy of the invariant measure for stochastic gradient kinetic Langevin dynamics. By introducing a novel Gaussian convolution inequality, the paper establishes, for the first time, a tight first-order Wasserstein distance bound that scales linearly with the step size. This result resolves a long-standing open problem in the field, significantly strengthening the theoretical foundations of SG-MCMC algorithms. Moreover, it provides a quantitative characterization of the invariant measure bias, and the newly developed inequality is of independent interest with potential applications beyond this context.

Technology Category

Application Category

📝 Abstract

We derive first-order (in the stepsize) bounds on the bias in Wasserstein distances of the invariant measure of stochastic gradient kinetic Langevin dynamics with minimal assumptions on the stochastic gradient noise. These bounds sharpen existing non-asymptotic guarantees for stochastic-gradient MCMC methods and provide a quantitative resolution of a previously open problem on invariant measure accuracy. The main technical ingredients are new Gaussian convolution inequalities controlling the Wasserstein-$p$ distance between a Gaussian convolved with a mean-zero perturbation and the Gaussian itself. We anticipate that these inequalities will be of independent interest beyond the present application.

Problem

Research questions and friction points this paper is trying to address.

stochastic gradient sampling

invariant measure accuracy

Wasserstein distance

non-asymptotic guarantees

Gaussian convolution inequalities

Innovation

Methods, ideas, or system contributions that make the work stand out.

Gaussian convolution inequalities

Wasserstein distance

stochastic gradient MCMC