Multilinear and Linear Programs for Partially Identifiable Queries in Quasi-Markovian Structural Causal Models

📅 2025-09-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the problem of computing tight bounds for partially identifiable probabilistic queries in quasi-Markov structural causal models (SCMs), where endogenous variables are observable and their distributions known, but exogenous confounders remain incompletely specified. To solve this, we propose a novel column-generation-based algorithm that reformulates the original multilinear program into a sequence of tractable auxiliary linear integer programs. Theoretically, we prove that, under single interventions, an equivalent representation of the exogenous variables admits a polynomial-size characterization—overcoming the traditional exponential-complexity barrier. Our method integrates Bayesian network modeling, causal inference principles, and optimization techniques. Experiments demonstrate that our approach significantly outperforms state-of-the-art methods in computational efficiency, bound tightness, and model parsimony.

Technology Category

Application Category

📝 Abstract
We investigate partially identifiable queries in a class of causal models. We focus on acyclic Structural Causal Models that are quasi-Markovian (that is, each endogenous variable is connected with at most one exogenous confounder). We look into scenarios where endogenous variables are observed (and a distribution over them is known), while exogenous variables are not fully specified. This leads to a representation that is in essence a Bayesian network where the distribution of root variables is not uniquely determined. In such circumstances, it may not be possible to precisely compute a probability value of interest. We thus study the computation of tight probability bounds, a problem that has been solved by multilinear programming in general, and by linear programming when a single confounded component is intervened upon. We present a new algorithm to simplify the construction of such programs by exploiting input probabilities over endogenous variables. For scenarios with a single intervention, we apply column generation to compute a probability bound through a sequence of auxiliary linear integer programs, thus showing that a representation with polynomial cardinality for exogenous variables is possible. Experiments show column generation techniques to be superior to existing methods.
Problem

Research questions and friction points this paper is trying to address.

Computing tight probability bounds in quasi-Markovian causal models
Handling partial identifiability when exogenous variables are unspecified
Developing efficient algorithms for causal inference with confounded components
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multilinear programming for probability bounds
Column generation with linear integer programs
Polynomial cardinality for exogenous variables
🔎 Similar Papers
No similar papers found.
J
João P. Arroyo
Universidade de São Paulo, São Paulo, Brazil
J
João G. Rodrigues
Universidade de São Paulo, São Paulo, Brazil
D
Daniel Lawand
Universidade de São Paulo, São Paulo, Brazil
D
Denis D. Mauá
Universidade de São Paulo, São Paulo, Brazil
Junkyu Lee
Junkyu Lee
IBM
Artificial IntelligenceGraphical ModelsHeuristic SearchPlanning
Radu Marinescu
Radu Marinescu
IBM Research
Artificial Intelligence
A
Alex Gray
Centaur Institute, USA
E
Eduardo R. Laurentino
Instituto de Ciência e Tecnologia Itaú, São Paulo, Brazil
F
Fabio G. Cozman
Universidade de São Paulo, São Paulo, Brazil