🤖 AI Summary
Spatial dependence in individual economic outcomes across geographic units biases conventional standard error corrections—such as clustering or distance-based decay—that rely on pre-specified administrative boundaries or parametric functional forms. To address this, we propose Thresholding Multiple Outcomes (TMO): a method that jointly estimates the cross-sectional correlation structure shared across multiple outcome variables observed at the same unit, via multivariate covariance thresholding, nonparametric correlation matrix modeling, and robust reweighting—obviating the need for ex ante geographic partitioning. Monte Carlo simulations and county-level regressions in the U.S. demonstrate that TMO substantially reduces standard error bias. Applied to nine recent empirical studies, TMO alters statistical significance for over 60% of estimated effects, underscoring its substantial improvement in the robustness of causal inference under spatial dependence.
📝 Abstract
Empirical research in economics often examines the behavior of agents located in a geographic space. In such cases, statistical inference is complicated by the interdependence of economic outcomes across locations. A common approach to account for this dependence is to cluster standard errors based on a predefined geographic partition. A second strategy is to model dependence in terms of the distance between units. Dependence, however, does not necessarily stop at borders and is typically not determined by distance alone. This paper introduces a method that leverages observations of multiple outcomes to adjust standard errors for cross-sectional dependence. Specifically, a researcher, while interested in a particular outcome variable, often observes dozens of other variables for the same units. We show that these outcomes can be used to estimate dependence under the assumption that the cross-sectional correlation structure is shared across outcomes. We develop a procedure, which we call Thresholding Multiple Outcomes (TMO), that uses this estimate to adjust standard errors in a given regression setting. We show that adjustments of this form can lead to sizable reductions in the bias of standard errors in calibrated U.S. county-level regressions. Re-analyzing nine recent papers, we find that the proposed correction can make a substantial difference in practice.