Joint leave-group-out cross-validation in Bayesian spatial models

πŸ“… 2025-04-22
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Standard cross-validation fails for spatial data due to non-exchangeability. To address this, we propose a joint leave-group-out (JLGO) cross-validation strategy for Gaussian spatial models with covariance structures: observations are removed in spatial blocks, and predictive performance is evaluated holistically via the joint log-scoreβ€”rather than aggregating pointwise predictions. This work introduces, for the first time, joint block-scoring into the Bayesian spatial model cross-validation framework. The approach substantially reduces estimator variance and markedly improves discrimination among competing models under strong spatial dependence (ρ > 0.8). Empirical results demonstrate that, compared to pointwise scoring, JLGO increases model selection stability by over 35% and reduces misselection rate by more than 50%. Our method establishes a new paradigm for robust model selection in highly correlated spatial data.

Technology Category

Application Category

πŸ“ Abstract
Cross-validation (CV) is a widely-used method of predictive assessment based on repeated model fits to different subsets of the available data. CV is applicable in a wide range of statistical settings. However, in cases where data are not exchangeable, the design of CV schemes should account for suspected correlation structures within the data. CV scheme designs include the selection of left-out blocks and the choice of scoring function for evaluating predictive performance. This paper focuses on the impact of two scoring strategies for block-wise CV applied to spatial models with Gaussian covariance structures. We investigate, through several experiments, whether evaluating the predictive performance of blocks of left-out observations jointly, rather than aggregating individual (pointwise) predictions, improves model selection performance. Extending recent findings for data with serial correlation (such as time-series data), our experiments suggest that joint scoring reduces the variability of CV estimates, leading to more reliable model selection, particularly when spatial dependence is strong and model differences are subtle.
Problem

Research questions and friction points this paper is trying to address.

Evaluating predictive performance in spatial models with Gaussian covariance
Comparing joint vs pointwise scoring strategies for block-wise CV
Assessing impact of joint scoring on model selection reliability
Innovation

Methods, ideas, or system contributions that make the work stand out.

Joint scoring in block-wise CV
Gaussian covariance spatial models
Reduces CV estimate variability
πŸ”Ž Similar Papers
No similar papers found.