Out-of-Support Generalisation via Weight Space Sequence Modelling

📅 2026-02-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Deep learning models often produce overconfident yet unreliable predictions on out-of-support (OoS) samples. This work proposes WeightCaster, a novel framework that reframes OoS generalization as a sequence prediction problem in weight space. By partitioning training data into concentric shells treated as discrete time steps, WeightCaster enables interpretable, uncertainty-aware predictions without requiring explicit inductive biases. The approach achieves a favorable balance between computational efficiency and reliability, matching or surpassing state-of-the-art methods on both synthetic cosine datasets and real-world air quality sensor data. Consequently, it significantly enhances model robustness and trustworthiness in out-of-support scenarios.

Technology Category

Application Category

📝 Abstract
As breakthroughs in deep learning transform key industries, models are increasingly required to extrapolate on datapoints found outside the range of the training set, a challenge we coin as out-of-support (OoS) generalisation. However, neural networks frequently exhibit catastrophic failure on OoS samples, yielding unrealistic but overconfident predictions. We address this challenge by reformulating the OoS generalisation problem as a sequence modelling task in the weight space, wherein the training set is partitioned into concentric shells corresponding to discrete sequential steps. Our WeightCaster framework yields plausible, interpretable, and uncertainty-aware predictions without necessitating explicit inductive biases, all the while maintaining high computational efficiency. Emprical validation on a synthetic cosine dataset and real-world air quality sensor readings demonstrates performance competitive or superior to the state-of-the-art. By enhancing reliability beyond in-distribution scenarios, these results hold significant implications for the wider adoption of artificial intelligence in safety-critical applications.
Problem

Research questions and friction points this paper is trying to address.

out-of-support generalisation
extrapolation
neural networks
distribution shift
catastrophic failure
Innovation

Methods, ideas, or system contributions that make the work stand out.

out-of-support generalisation
weight space sequence modelling
WeightCaster
uncertainty-aware prediction
extrapolation
🔎 Similar Papers
No similar papers found.