Identifying Causal Direction via Variational Bayesian Compression

📅 2025-05-12

📈 Citations: 0

✨ Influential: 0

career value

222K/year

🤖 AI Summary

This paper addresses causal direction identification between two variables from purely observational data. We propose a novel causal discovery method grounded in the algorithmic Markov condition, which—uniquely—models the variational Bayesian neural network (VBNN) learning process directly as an estimator of encoding length under causal factorization, thereby integrating the minimum description length principle with the strong representational capacity of neural networks. Unlike conventional Gaussian process-based approaches, our framework overcomes their computational complexity and expressiveness limitations by jointly optimizing variational inference, parameter compression, and causal structure. Extensive experiments on both synthetic and real-world benchmarks demonstrate that our method significantly outperforms state-of-the-art complexity-driven and structural causal model regression approaches, advancing bivariate causal discovery to a new performance frontier.

Technology Category

Application Category

📝 Abstract

Telling apart the cause and effect between two random variables with purely observational data is a challenging problem that finds applications in various scientific disciplines. A key principle utilized in this task is the algorithmic Markov condition, which postulates that the joint distribution, when factorized according to the causal direction, yields a more succinct codelength compared to the anti-causal direction. Previous approaches approximate these codelengths by relying on simple functions or Gaussian processes (GPs) with easily evaluable complexity, compromising between model fitness and computational complexity. To overcome these limitations, we propose leveraging the variational Bayesian learning of neural networks as an interpretation of the codelengths. Consequently, we can enhance the model fitness while promoting the succinctness of the codelengths, while avoiding the significant computational complexity of the GP-based approaches. Extensive experiments on both synthetic and real-world benchmarks in cause-effect identification demonstrate the effectiveness of our proposed method, surpassing the overall performance of related complexity-based and structural causal model regression-based approaches.

Problem

Research questions and friction points this paper is trying to address.

Identifying causal direction between two variables using observational data

Improving codelength succinctness via variational Bayesian neural networks

Outperforming existing complexity-based and regression-based causal methods

Innovation

Methods, ideas, or system contributions that make the work stand out.

Variational Bayesian learning for codelength interpretation

Neural networks enhance model fitness and succinctness

Avoids high computational complexity of GP-based methods

🔎 Similar Papers

The Causal Information Bottleneck and Optimal Causal Variable Abstractions