Composite method for fast computation of individual level spatial epidemic models

๐Ÿ“… 2025-09-04
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Individual-level spatial epidemic models (ILMs) incur prohibitive computational costs for large populations and high-dimensional covariate settings. To address this, we propose the Composite Individual-Level Model (C-ILM): first, individuals are clustered into low-interference subpopulations using a Dirichlet Process Mixture Model (DPMM), which better captures heterogeneous spatial infection structures than conventional clustering methods (e.g., K-means); second, inter-cluster transmission is explicitly modeled via four โ€œspike functionsโ€, enabling parallel computation and composite likelihood inference. Evaluations on simulated data and the 2001 UK foot-and-mouth disease outbreak demonstrate that C-ILM achieves several-fold speedup over standard ILMs while improving model fit (e.g., WAIC) and posterior predictive accuracy. Thus, C-ILM establishes a scalable, high-fidelity paradigm for large-scale spatial epidemic modeling.

Technology Category

Application Category

๐Ÿ“ Abstract
Individual-level models, also known as ILMs, are commonly used in epidemics modelling, as they can flexibly incorporate individual-level covariates that influence susceptibility and transmissibility upon infection. However, inference for ILMs is computationally intensive, especially as the total population size increases and additional covariates are incorporated. We propose a composite method, the composite ILM (C-ILM), that clusters the population into minimally-interfered subpopulations, with between-cluster infections enabled through a ``spark function.'' This approach allows for parallel computation of subsets before aggregation. Focusing on C-ILM, we consider four ``spark functions'', and introduce a Dirichlet process mixture modelling (DPMM) algorithm for clustering. Simulation results indicate that, in addition to faster computation, C-ILM performs well in parameter estimation and posterior predictions. Furthermore, within C-ILM framework, DPMM algorithm demonstrates superior performance compared to the conventional $K$-means algorithm. We apply the methods to data from the 2001 UK foot-and-mouth disease outbreak. The results provide evidence that C-ILM is not only computationally efficient but also achieves a better model fit compared to the basic spatial ILM.
Problem

Research questions and friction points this paper is trying to address.

Accelerates computation of individual-level epidemic models
Enables parallel processing via population clustering
Improves parameter estimation and model fitting efficiency
Innovation

Methods, ideas, or system contributions that make the work stand out.

Composite ILM clusters population for parallel computation
Uses spark function for between-cluster infection modeling
Dirichlet process mixture algorithm improves clustering performance
๐Ÿ”Ž Similar Papers
No similar papers found.
Y
Yirao Zhang
Department of Mathematics and Statistics, University of Calgary
Rob Deardon
Rob Deardon
University of Calgary
L
Lorna Deeth
Department of Mathematics and Statistics, University of Guelph