Beyond the Algorithm: A Field Guide to Deploying AI Agents in Clinical Practice

📅 2025-09-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Despite their promise, large language models (LLMs) face a critical “algorithm-to-application” gap in clinical deployment, hindering real-world integration into electronic health record (EHR) systems. Method: Drawing on empirical EHR deployment experience, we propose the first systematic framework for implementing generative AI agents in clinical settings—centered on sociotechnical implementation tasks, which constitute over 80% of deployment effort. The framework addresses five core challenges: EHR data integration, clinical validation of model trustworthiness, economic sustainability, adaptive management of model and system drift, and multi-stakeholder governance. It synergistically integrates LLMs, prompt engineering, FHIR-compliant EHR interfaces, continuous performance monitoring, and governance protocols. Contribution/Results: We deployed irAE-Agent—a clinical AI agent for automated identification of immune-related adverse events—demonstrating feasibility and robustness. Evaluation by 20 clinical and technical experts confirms the framework significantly enhances translatability from pilot to routine clinical service.

Technology Category

Application Category

📝 Abstract
Large language models (LLMs) integrated into agent-driven workflows hold immense promise for healthcare, yet a significant gap exists between their potential and practical implementation within clinical settings. To address this, we present a practitioner-oriented field manual for deploying generative agents that use electronic health record (EHR) data. This guide is informed by our experience deploying the "irAE-Agent", an automated system to detect immune-related adverse events from clinical notes at Mass General Brigham, and by structured interviews with 20 clinicians, engineers, and informatics leaders involved in the project. Our analysis reveals a critical misalignment in clinical AI development: less than 20% of our effort was dedicated to prompt engineering and model development, while over 80% was consumed by the sociotechnical work of implementation. We distill this effort into five "heavy lifts": data integration, model validation, ensuring economic value, managing system drift, and governance. By providing actionable solutions for each of these challenges, this field manual shifts the focus from algorithmic development to the essential infrastructure and implementation work required to bridge the "valley of death" and successfully translate generative AI from pilot projects into routine clinical care.
Problem

Research questions and friction points this paper is trying to address.

Bridging the gap between AI potential and clinical implementation
Addressing sociotechnical challenges in deploying healthcare AI agents
Providing actionable solutions for real-world clinical AI integration
Innovation

Methods, ideas, or system contributions that make the work stand out.

Agent-driven workflows using EHR data
Field manual addressing sociotechnical implementation challenges
Focus on data integration and model validation
🔎 Similar Papers
No similar papers found.
Jack Gallifant
Jack Gallifant
AIM @ Harvard-MGB
AIAlignmentHealthcareInterpretabilityRobustness
K
Katherine C. Kellogg
MIT Sloan School of Management
M
Matt Butler
Mass General Brigham, Brown University Health
A
Amanda Centi
Mass General Brigham
P
Patrick F. Doyle
Mass General Brigham, Dana-Farber Cancer Institute
S
Sayon Dutta
Mass General Brigham, Harvard Medical School
J
Joyce Guo
Mass General Brigham
M
Matthew J. Hadfield
The Legorreta Cancer Center at Brown University, Brown University Health Cancer Institute, Providence, RI, USA
E
Esther H. Kim
Mass General Brigham
D
David E. Kozono
Mass General Brigham, Harvard Medical School, Dana-Farber Cancer Institute
H
Hugo JWL Aerts
Mass General Brigham, Harvard Medical School, Maastricht University, The Netherlands
A
Adam B. Landman
Mass General Brigham
Raymond H. Mak
Raymond H. Mak
Associate Professor of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute
Radiation OncologyArtificial Intelligence
R
Rebecca G. Mishuris
Mass General Brigham
T
Tanna L. Nelson
Center for Biomedical Informatics & Information Technology, National Cancer Institute, NIH
G
Guergana K. Savova
Harvard Medical School, Computational Health Informatics Program (CHIP), Boston Children’s Hospital
E
Elad Sharon
Harvard Medical School, Dana-Farber Cancer Institute
B
Benjamin C. Silverman
Mass General Brigham, Harvard Medical School
Umit Topaloglu
Umit Topaloglu
National Cancer Institute
Biomedical Informatics
J
Jeremy L. Warner
The Legorreta Cancer Center at Brown University, Brown University Health Cancer Institute, Providence, RI, USA, Rhode Island Hospital, Providence, RI, USA
Danielle S. Bitterman
Danielle S. Bitterman
Harvard Medical School
OncologyNatural Language ProcessingArtificial Intelligence