🤖 AI Summary
This paper addresses the challenge of causal inference under multivalued treatments—binary, categorical, and continuous—as well as their interaction effects, a gap unaddressed by existing methods predominantly focused on binary interventions and neglecting concurrent, multiple treatments. We propose the first general-purpose framework based on double machine learning (DML), integrating Neyman orthogonality and cross-fitting to accommodate heterogeneous treatment types and complex confounding structures. The framework employs machine learning–driven propensity score and outcome models (e.g., random forests, Lasso). We establish asymptotic normality of the proposed estimator and derive a robust variance estimator. Extensive simulations demonstrate high accuracy and robustness across diverse data-generating processes. Empirically, we apply the method to a cohort of 2,455 HIV patients in Nigeria, providing the first quantitative assessment of the joint causal effect of three distinct treatments on HIV-associated nephropathy.
📝 Abstract
Causal inference literature has extensively focused on binary treatments, with relatively fewer methods developed for multi-valued treatments. In particular, methods for multiple simultaneously assigned treatments remain understudied despite their practical importance. This paper introduces two settings: (1) estimating the effects of multiple treatments of different types (binary, categorical, and continuous) and the effects of treatment interactions, and (2) estimating the average treatment effect across categories of multi-valued regimens. To obtain robust estimates for both settings, we propose a class of methods based on the Double Machine Learning (DML) framework. Our methods are well-suited for complex settings of multiple treatments/regimens, using machine learning to model confounding relationships while overcoming regularization and overfitting biases through Neyman orthogonality and cross-fitting. To our knowledge, this work is the first to apply machine learning for robust estimation of interaction effects in the presence of multiple treatments. We further establish the asymptotic distribution of our estimators and derive variance estimators for statistical inference. Extensive simulations demonstrate the performance of our methods. Finally, we apply the methods to study the effect of three treatments on HIV-associated kidney disease in an adult HIV cohort of 2455 participants in Nigeria.