🤖 AI Summary
To address the failure of cross-domain predictive generalization under distributional shift caused by latent confounding, this paper proposes the first Bayesian Generative Invariance (BGI) framework. Unlike frequentist generative invariance (GI) methods—which cannot identify causal structures and lack uncertainty quantification—BGI endows the number of observable environments with an asymptotic prior interpretation, enabling identifiable causal discovery and well-calibrated out-of-distribution prediction without hyperparameter tuning or prior knowledge of distribution shifts. The method integrates multi-environment joint Bayesian modeling with the principle of generative invariance, supporting robust high-dimensional inference. Experiments demonstrate that BGI achieves empirical coverage rates close to nominal levels in low- to medium-dimensional settings, significantly outperforming conservative regularization and existing frequentist approaches.
📝 Abstract
Predicting outcomes in external domains is challenging due to hidden confounders that influence both predictors and outcomes, complicating generalization under distribution shifts. Traditional methods often rely on stringent assumptions or overly conservative regularization, compromising estimation and predictive accuracy. Generative Invariance (GI) is a novel framework that facilitates predictions in unseen domains without requiring hyperparameter tuning or knowledge of specific distribution shifts. However, the available frequentist version of GI does not always enable identification and lacks uncertainty quantification for its predictions. This paper develops a Bayesian formulation that extends GI with well-calibrated external predictions and facilitates causal discovery. We present theoretical guarantees showing that prior distributions assign asymptotic meaning to the number of distinct datasets that could be observed. Simulations and an application case highlight the remarkable empirical coverage behavior of our approach, nearly unchanged when transitioning from low- to moderate-dimensional settings.