๐ค AI Summary
Precipitation nowcasting faces dual challenges: deterministic models suffer from inherent blur, while generative models exhibit limited accuracy. To address this, we propose a two-stage short-term forecasting framework. In the first stage, a spatiotemporal conditional network generates high-accuracy deterministic predictions. In the second stage, a latent diffusion model (LDM) is jointly optimized with a variational autoencoder (VAE) in latent spaceโenabling end-to-end stochastic modeling and, for the first time, co-training of LDM, VAE, and the conditional network. This design decouples deterministic forecasting from uncertainty quantification, balancing structural fidelity and sample diversity. Evaluated on multiple radar echo datasets, our method significantly outperforms state-of-the-art approaches in both forecast accuracy and fine-grained detail preservation, while reducing inference computational overhead.
๐ Abstract
Precipitation nowcasting is a critical spatio-temporal prediction task for society to prevent severe damage owing to extreme weather events. Despite the advances in this field, the complex and stochastic nature of this task still poses challenges to existing approaches. Specifically, deterministic models tend to produce blurry predictions while generative models often struggle with poor accuracy. In this paper, we present a simple yet effective model architecture termed STLDM, a diffusion-based model that learns the latent representation from end to end alongside both the Variational Autoencoder and the conditioning network. STLDM decomposes this task into two stages: a deterministic forecasting stage handled by the conditioning network, and an enhancement stage performed by the latent diffusion model. Experimental results on multiple radar datasets demonstrate that STLDM achieves superior performance compared to the state of the art, while also improving inference efficiency. The code is available in https://github.com/sqfoo/stldm_official.