🤖 AI Summary
Diffusion models achieve high-fidelity generation but suffer from substantial computational overhead due to multi-step ODE sampling. While existing gradient-based solvers reduce function evaluations, they rely on intricate training strategies and often fail to preserve fine-grained details. To address this, we propose the Generalized Adversarial Solver (GAS), which simplifies the parameterization of ODE solvers, incorporates a distillation loss, and introduces discriminator-guided adversarial training—without requiring auxiliary training techniques. GAS effectively suppresses artifacts and enhances detail recovery. Experiments demonstrate that, under comparable computational budgets, GAS achieves superior generation quality over state-of-the-art solver training methods using fewer sampling steps—particularly improving texture sharpness and structural fidelity.
📝 Abstract
While diffusion models achieve state-of-the-art generation quality, they still suffer from computationally expensive sampling. Recent works address this issue with gradient-based optimization methods that distill a few-step ODE diffusion solver from the full sampling process, reducing the number of function evaluations from dozens to just a few. However, these approaches often rely on intricate training techniques and do not explicitly focus on preserving fine-grained details. In this paper, we introduce the Generalized Solver: a simple parameterization of the ODE sampler that does not require additional training tricks and improves quality over existing approaches. We further combine the original distillation loss with adversarial training, which mitigates artifacts and enhances detail fidelity. We call the resulting method the Generalized Adversarial Solver and demonstrate its superior performance compared to existing solver training methods under similar resource constraints. Code is available at https://github.com/3145tttt/GAS.