🤖 AI Summary
Diffusion models are commonly assumed to rely solely on Gaussian priors, raising fundamental questions about whether higher-order cumulants (i.e., n-point correlation functions) beyond Gaussian statistics are erased during the forward process and whether they can be recovered in the reverse process.
Method: The authors combine cumulant generating functional analysis, score estimation theory, analytic derivation, and exactly solvable toy models, complemented by numerical validation in scalar lattice field theory.
Contribution/Results: For variance-increasing forward processes, the authors provide the first rigorous proof that all higher-order cumulants are conserved; their information is fully encoded in the score function and precisely reconstructed by the reverse sampling process. Crucially, even when the forward process endpoint is approximately Gaussian, higher-order cumulants remain non-vanishing and are accurately recovered—challenging the conventional view that diffusion models fundamentally depend only on Gaussian assumptions. This establishes a theoretical foundation for modeling non-Gaussian structure in diffusion-based generative modeling.
📝 Abstract
To analyse how diffusion models learn correlations beyond Gaussian ones, we study the behaviour of higher-order cumulants, or connected n-point functions, under both the forward and backward process. We derive explicit expressions for the moment- and cumulant-generating functionals, in terms of the distribution of the initial data and properties of forward process. It is shown analytically that during the forward process higher-order cumulants are conserved in models without a drift, such as the variance-expanding scheme, and that therefore the endpoint of the forward process maintains nontrivial correlations. We demonstrate that since these correlations are encoded in the score function, higher-order cumulants are learnt in the backward process, also when starting from a normal prior. We confirm our analytical results in an exactly solvable toy model with nonzero cumulants and in scalar lattice field theory.