Reproducible workflow for online AI in digital health

📅 2025-09-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In digital health interventions, online AI systems must balance dynamic adaptability with scientific reproducibility—yet current practices fail to ensure cross-iteration data traceability, algorithmic auditability, and result comparability. This paper introduces the first end-to-end reproducible workflow for online AI, spanning its entire lifecycle. It integrates streaming data processing, versioned data storage, algorithmic audit logging, containerized deployment, and automated experiment tracking. Validated through multiple real-world deployments, the workflow enables complete, time-stamped recording and retrospective analysis of both data and model behavior across iterative updates. It significantly enhances algorithmic transparency, experimental reproducibility, and regulatory compliance. The core contribution lies in bridging the gap between adaptive learning and scientific rigor, establishing a methodological foundation and engineering paradigm for trustworthy, evolution-aware online AI in digital health.

Technology Category

Application Category

📝 Abstract
Online artificial intelligence (AI) algorithms are an important component of digital health interventions. These online algorithms are designed to continually learn and improve their performance as streaming data is collected on individuals. Deploying online AI presents a key challenge: balancing adaptability of online AI with reproducibility. Online AI in digital interventions is a rapidly evolving area, driven by advances in algorithms, sensors, software, and devices. Digital health intervention development and deployment is a continuous process, where implementation - including the AI decision-making algorithm - is interspersed with cycles of re-development and optimization. Each deployment informs the next, making iterative deployment a defining characteristic of this field. This iterative nature underscores the importance of reproducibility: data collected across deployments must be accurately stored to have scientific utility, algorithm behavior must be auditable, and results must be comparable over time to facilitate scientific discovery and trustworthy refinement. This paper proposes a reproducible scientific workflow for developing, deploying, and analyzing online AI decision-making algorithms in digital health interventions. Grounded in practical experience from multiple real-world deployments, this workflow addresses key challenges to reproducibility across all phases of the online AI algorithm development life-cycle.
Problem

Research questions and friction points this paper is trying to address.

Balancing adaptability with reproducibility in online AI
Ensuring algorithm auditability and data comparability over time
Developing a reproducible workflow for digital health AI
Innovation

Methods, ideas, or system contributions that make the work stand out.

Reproducible workflow for online AI
Continuous learning with streaming data
Life-cycle development and deployment phases
🔎 Similar Papers
No similar papers found.
Susobhan Ghosh
Susobhan Ghosh
Harvard University
Reinforcement LearningMobile Health
B
Bhanu T. Gulapalli
Department of Computer Science, Harvard University, Cambridge, MA, USA, 02135
D
Daiqi Gao
Department of Statistics, Harvard University, Cambridge, MA, USA, 02135
A
Asim Gazi
Department of Computer Science, Harvard University, Cambridge, MA, USA, 02135
A
Anna Trella
Department of Computer Science, Harvard University, Cambridge, MA, USA, 02135
Ziping Xu
Ziping Xu
Postdoc Fellow at Harvard University
Statistical Reinforcement learningMachine Learning TheoryMobile Health
K
Kelly Zhang
Department of Mathematics, Imperial College London, London, UK
S
Susan A. Murphy
Department of Computer Science, Harvard University, Cambridge, MA, USA, 02135