Finetuning a Weather Foundation Model with Lightweight Decoders for Unseen Physical Processes

📅 2025-06-23

📈 Citations: 0

✨ Influential: 0

career value

200K/year

🤖 AI Summary

This study addresses the limited generalizability of the meteorological foundation model Aurora to hydrological variables unseen during pretraining. To overcome this, we propose an efficient adaptation strategy that freezes Aurora’s backbone and appends a lightweight feed-forward decoder. Unlike full-model fine-tuning, our method trains only the shallow decoder while leveraging frozen backbone-extracted latent representations for cross-variable transfer—demonstrating, for the first time, that Aurora’s latent space encodes physically consistent, transferable information. Experiments across multiple novel hydrological variables show that our approach achieves prediction accuracy comparable to full fine-tuning, while reducing training time by 50% and GPU memory consumption by 35%, all without compromising autoregressive stability. This work establishes a new paradigm for resource-efficient extension of foundation models in computationally constrained settings.

Technology Category

Application Category

📝 Abstract

Recent advances in AI weather forecasting have led to the emergence of so-called "foundation models", typically defined by expensive pretraining and minimal fine-tuning for downstream tasks. However, in the natural sciences, a desirable foundation model should also encode meaningful statistical relationships between the underlying physical variables. This study evaluates the performance of the state-of-the-art Aurora foundation model in predicting hydrological variables, which were not considered during pretraining. We introduce a lightweight approach using shallow decoders trained on the latent representations of the pretrained model to predict these new variables. As a baseline, we compare this to fine-tuning the full model, which allows further optimization of the latent space while incorporating new variables into both inputs and outputs. The decoder-based approach requires 50% less training time and 35% less memory, while achieving strong accuracy across various hydrological variables and preserving desirable properties of the foundation model, such as autoregressive stability. Notably, decoder accuracy depends on the physical correlation between the new variables and those used during pretraining, indicating that Aurora's latent space captures meaningful physical relationships. In this sense, we argue that an important quality metric for foundation models in Earth sciences is their ability to be extended to new variables without a full fine-tuning. This provides a new perspective for making foundation models more accessible to communities with limited computational resources, while supporting broader adoption in Earth sciences.

Problem

Research questions and friction points this paper is trying to address.

Extend weather foundation model to predict unseen hydrological variables

Evaluate lightweight decoder approach versus full model fine-tuning

Assess physical relationship encoding in foundation model latent space

Innovation

Methods, ideas, or system contributions that make the work stand out.

Lightweight decoders for new variables prediction

50% less training time with decoders

Latent space captures physical relationships

🔎 Similar Papers

No similar papers found.