Collaborative estimation and evaluation of SARS-CoV-2 variant nowcasting in the United States

πŸ“… 2026-06-05
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Existing platforms struggle to collaboratively forecast the dynamic trajectories of SARS-CoV-2 variants across U.S. states. This work proposes the first state-level nowcasting Hub specifically designed for viral variants, integrating a multi-model framework grounded in genomic sequencing data to estimate the current relative abundance of designated variants and incorporating a scoring system to evaluate model performance. The Hub addresses a critical gap in real-time variant surveillance by enabling localized modeling alongside national benchmark comparisons. Results demonstrate that nationally aggregated benchmark models exhibit overall robustness, while state-specific local models generally outperform national onesβ€”except in regions with low sequencing volumes, where prediction stability markedly declines.
πŸ“ Abstract
The ability to estimate and predict pathogen variant dynamics can inform public health responses, including planning for increased transmission or severity, shifts in population immunity, or changes to vaccine or therapeutic effectiveness. The COVID-19 pandemic demonstrated the importance of monitoring SARS-CoV-2 variant evolution through viral genome sequencing, enabling predictive models to estimate variant frequencies in the recent past, present, and short-term future. Collaborative forecasting Hubs provided a valuable way to centralize predictive modeling of epidemiological indicators such as cases, hospitalizations, and deaths during the pandemic; however, none existed for variant dynamics. Here, we discuss the creation of the United States SARS-CoV-2 Variant Nowcast Hub, designed to solicit estimates of the relative abundance of a specified set of SARS-CoV-2 variants at the U.S. state level. We discuss the design decisions and challenges in building the Hub and its scoring procedures. Using submissions from the Hub's first respiratory virus season (nowcast dates October 9th, 2024 to June 4th, 2025), we evaluate five individual models and a baseline model. We found that the baseline model, which pools sequences across the U.S., performs well overall, with most individual models performing similarly or slightly worse. Locations with lower sequencing volumes exhibited greater variability in model performance. Models submitted for a single location outperformed those submitted for all locations, potentially due to greater timeliness and magnitude of local data. Much remains to be investigated regarding relative model performance across different phases of variant emergence, and we conclude by proposing future directions within and beyond this Hub.
Problem

Research questions and friction points this paper is trying to address.

SARS-CoV-2 variant
nowcasting
collaborative forecasting
variant dynamics
epidemiological modeling
Innovation

Methods, ideas, or system contributions that make the work stand out.

variant nowcasting
collaborative forecasting
SARS-CoV-2 surveillance
model evaluation
genomic epidemiology
I
Isaac MacArthur
Department of Mathematics and Statistics, University of Massachusetts, Amherst, United States of America
T
Thomas Robacker
Department of Mathematics and Statistics, University of Massachusetts, Amherst, United States of America
B
Bren Case
Epidemiology & Biostatistics, University of Georgia, Athens, GA
S
Spencer J. Fox
School of Informatics, Computing, and Cybersystems, Northern Arizona University, Flagstaff, Arizona
D
Dylan H. Morris
The Center for Forecasting and Outbreak Analytics, Centers for Disease Control and Prevention
E
Evan L. Ray
Department of Mathematics and Statistics, University of Massachusetts, Amherst, United States of America
B
Benjamin Rogers
Department of Mathematics and Statistics, University of Massachusetts, Amherst, United States of America
B
Becky Sweger
Department of Mathematics and Statistics, University of Massachusetts, Amherst, United States of America
N
Natalie M. Linton
California Department of Public Health
J
John L. Huddleston
Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, Washington
A
Andrew Magee
The Center for Forecasting and Outbreak Analytics, Centers for Disease Control and Prevention
Z
Zachary Susswein
The Center for Forecasting and Outbreak Analytics, Centers for Disease Control and Prevention
J
Jover Lee
Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, Washington
Trevor Bedford
Trevor Bedford
Professor, Fred Hutchinson Cancer Center
evolutionphylogeneticsinfectious diseasevisualization
M
Marlin D. Figgins
Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, Washington
E
Ehsan Suez
Institute of Bioinformatics, University of Georgia, Athens, Georgia
R
Rajath Prabhakar
School of Informatics, Computing, and Cybersystems, Northern Arizona University, Flagstaff, Arizona
T
Tomas Leon
California Department of Public Health
B
Brent Siegel
California Department of Public Health
M
Mugdha Thakur
California Department of Public Health
C
Christopher M. Hoover
California Department of Public Health
R
Rahil Ryder
California Department of Public Health
J
Jesse Elder
California Department of Public Health
M
Michael Kupperman
Theoretical Biology and Biophysics, Los Alamos National Laboratory, New Mexico
R
Ruian Ke
Theoretical Biology and Biophysics, Los Alamos National Laboratory, New Mexico