Nonlinear Causal Discovery with Confounders

📅 2023-02-07
🏛️ Journal of the American Statistical Association
📈 Citations: 17
Influential: 1
📄 PDF
🤖 AI Summary
This paper addresses causal structure learning for nonlinear directed acyclic graphs (DAGs) under latent confounding with Gaussian noise. We propose DeFuSE: a two-stage method that first performs deconfounding correction to mitigate latent-variable bias, then jointly estimates the causal ordering and nonlinear functional relationships via a sequential neural network architecture. Key contributions include: (i) the first identifiability result for latent confounding in nonlinear DAG models; (ii) introduction of a sublinear growth assumption to ensure model identifiability; and (iii) formulation of a strong causal minimality condition, with theoretical proof of estimation consistency. Implemented using feedforward neural networks, DeFuSE balances scalability with rigorous statistical guarantees. Experiments demonstrate substantial improvements over state-of-the-art methods that either ignore confounding or assume linearity, on both synthetic benchmarks and real-world gene regulatory network inference. An open-source Python implementation is publicly available.
📝 Abstract
Abstract This article introduces a causal discovery method to learn nonlinear relationships in a directed acyclic graph with correlated Gaussian errors due to confounding. First, we derive model identifiability under the sublinear growth assumption. Then, we propose a novel method, named the Deconfounded Functional Structure Estimation (DeFuSE), consisting of a deconfounding adjustment to remove the confounding effects and a sequential procedure to estimate the causal order of variables. We implement DeFuSE via feedforward neural networks for scalable computation. Moreover, we establish the consistency of DeFuSE under an assumption called the strong causal minimality. In simulations, DeFuSE compares favorably against state-of-the-art competitors that ignore confounding or nonlinearity. Finally, we demonstrate the utility and effectiveness of the proposed approach with an application to gene regulatory network analysis. The Python implementation is available at https://github.com/chunlinli/defuse. Supplementary materials for this article are available online.
Problem

Research questions and friction points this paper is trying to address.

Identifying nonlinear causal relationships with confounding effects
Proposing DeFuSE for deconfounding and causal order estimation
Validating method via gene regulatory network analysis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deconfounding adjustment for removing confounding effects
Sequential procedure for estimating causal order
Feedforward neural networks for scalable computation
🔎 Similar Papers
No similar papers found.
C
Chunlin Li
School of Statistics, University of Minnesota
Xiaotong Shen
Xiaotong Shen
Singapore-MIT Alliance for Research and Technology
RoboticsV2V communicationCooperative Perception & Planning
W
W. Pan
Division of Biostatistics, University of Minnesota