🤖 AI Summary
This work addresses causal graph learning from observational data. We propose DrBO, the first framework to integrate Bayesian optimization (BO) into directed acyclic graph (DAG) structure search for efficient approximation of the optimal causal graph. To overcome the scalability limitations of Gaussian processes in high-dimensional graph spaces—traditional BO’s bottleneck—we innovatively employ a continually trained dropout neural network as the surrogate model, jointly achieving reliable uncertainty estimation, strong generalization, and computational efficiency. DrBO jointly optimizes both the DAG constraint and the causal scoring function. Empirical evaluation on synthetic and real-world benchmarks demonstrates that DrBO significantly outperforms state-of-the-art methods: it recovers higher-fidelity causal structures with fewer iterations and substantially reduced runtime. The implementation is publicly available.
📝 Abstract
Existing score-based methods for directed acyclic graph (DAG) learning from observational data struggle to recover the causal graph accurately and sample-efficiently. To overcome this, in this study, we propose DrBO (DAG recovery via Bayesian Optimization)-a novel DAG learning framework leveraging Bayesian optimization (BO) to find high-scoring DAGs. We show that, by sophisticatedly choosing the promising DAGs to explore, we can find higher-scoring ones much more efficiently. To address the scalability issues of conventional BO in DAG learning, we replace Gaussian Processes commonly employed in BO with dropout neural networks, trained in a continual manner, which allows for (i) flexibly modeling the DAG scores without overfitting, (ii) incorporation of uncertainty into the estimated scores, and (iii) scaling with the number of evaluations. As a result, DrBO is computationally efficient and can find the accurate DAG in fewer trials and less time than existing state-of-the-art methods. This is demonstrated through an extensive set of empirical evaluations on many challenging settings with both synthetic and real data. Our implementation is available at https://github.com/baosws/DrBO.