On the consistent and scalable detection of spatial patterns.

📅 2026-02-02
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing methods for spatial pattern detection lack statistical consistency and struggle to scale to large-scale spatial omics data. This work unifies prevailing approaches within a quadratic form statistical framework, establishing—for the first time—a theoretical foundation for consistent spatial pattern detection. We derive general conditions under which such methods achieve consistency and propose a scalable, corrected algorithm that efficiently and robustly handles datasets with millions of spatial locations. The method’s effectiveness is validated on real single-cell lineage tracing data, demonstrating substantial improvements in both reliability and scalability for large-scale spatial omics analysis.

Technology Category

Application Category

📝 Abstract
Detecting spatial patterns is fundamental to scientific discovery, yet current methods lack statistical consensus and face computational barriers when applied to large-scale spatial omics datasets. We unify major approaches through a single quadratic form and derive general consistency conditions. We reveal that several widely used methods, including Moran's I, are inconsistent, and propose scalable corrections. The resulting test enables robust pattern detection across millions of spatial locations and single-cell lineage-tracing datasets.
Problem

Research questions and friction points this paper is trying to address.

spatial patterns
statistical consistency
scalability
spatial omics
computational barriers
Innovation

Methods, ideas, or system contributions that make the work stand out.

spatial pattern detection
statistical consistency
quadratic form
scalable correction
spatial omics
J
Jiayu Su
Program for Mathematical Genomics, Columbia University, New York, NY, USA
J
Jun Hou Fung
Program for Mathematical Genomics, Columbia University, New York, NY, USA
Haoyu Wang
Haoyu Wang
Department of Mathematics, Yale University
ProbabilityHigh-Dimensional StatisticsRandom Matrix Theory
D
Dian Yang
Department of Systems Biology, Columbia University, New York, NY, USA
David A. Knowles
David A. Knowles
Columbia University & New York Genome Center
Machine learningComputational genomicsRNA splicing
Raul Rabadan
Raul Rabadan
Professor, Columbia University
Mathematical BiologyGenomicsTheoretical Physics