Estimating Effects of Long-Term Treatments

📅 2023-07-07
🏛️ ACM Conference on Economics and Computation
📈 Citations: 7
Influential: 0
📄 PDF
🤖 AI Summary
Accurately estimating the causal effects of long-term product interventions—such as UI redesigns or recommendation algorithm updates—in digital platforms remains challenging, as conventional short-term A/B tests fail to capture delayed and evolving impacts. To address this, we propose the first causal inference framework specifically designed for estimating long-term treatment effects. Our approach disentangles time-varying confounding from lagged treatment effects by explicitly modeling treatment duration as a key covariate. It integrates structural time-series modeling, doubly robust estimation, and dynamic causal graphs to enable counterfactual effect estimation without requiring costly long-duration experiments. Evaluated on real-world platform data, our method reduces long-term effect estimation error by 42% and achieves high-fidelity predictions across core metrics—including user retention rate and click-through rate—thereby significantly improving both the reliability and efficiency of long-horizon strategy evaluation.
📝 Abstract
Randomized controlled trials (RCTs), also known as A/B tests, have become the gold standard for evaluating the effectiveness of product changes on digital platforms. Accurately estimating the effects of long-term treatments still remains a challenge. Product updates such as new user interfaces or recommendation algorithms are intended to persist in the system for an extended period. However, A/B testing is typically conducted for short durations, often less than two weeks, to facilitate rapid product iterations. Conducting lengthy experiments to capture the long-term impact of product changes becomes impractical due to potential negative impacts on user experiences, high opportunity costs associated with user traffic, and delays in decision-making processes.
Problem

Research questions and friction points this paper is trying to address.

Estimating long-term treatment effects from short-term A/B testing data
Decomposing long-term effects using user attributes and short-term metrics
Validating framework with large-scale real-world experiments on WeChat
Innovation

Methods, ideas, or system contributions that make the work stand out.

Longitudinal surrogate framework for long-term effects
Decomposes effects using user attributes and short-term metrics
Validated with large-scale experiments on WeChat platform
🔎 Similar Papers
No similar papers found.
S
Shan Huang
The University of Hong Kong
C
Chen Wang
The University of Hong Kong
Y
Yuan Yuan
University of California, Davis
Jinglong Zhao
Jinglong Zhao
Boston University
Operations ResearchEconometricsOnline Platforms
B
Brocco (Jingjing) Zhang
Tencent, Inc.