🤖 AI Summary
Public health has long lacked standardized, machine-readable formats for case and syndrome definitions, severely hindering cross-system interoperability, epidemiological analysis, qualitative data sharing, and AI modeling. To address this, we propose OpenSyndrome Schema—the first open, structured specification for syndrome definitions—and construct the first comprehensive, standardized dataset of case definitions covering multiple diseases. We further develop an NLP-based automated conversion pipeline and a collaborative platform that maps human-readable definitions to executable logical representations, enabling consistent expression, version-controlled management, and cross-system deployment. All artifacts are released as open-source software and data at https://opensyndrome.org. This work significantly enhances public health data integration efficiency and AI readiness, establishing foundational infrastructure for global disease surveillance and intelligent analytics.
📝 Abstract
Case definitions are essential for effectively communicating public health threats. However, the absence of a standardized, machine-readable format poses significant challenges to interoperability, epidemiological research, the exchange of qualitative data, and the effective application of computational analysis methods, including artificial intelligence (AI). This complicates comparisons and collaborations across organizations and regions, limits data integration, and hinders technological innovation in public health. To address these issues, we propose the first open, machine-readable format for representing case and syndrome definitions. Additionally, we introduce the first comprehensive dataset of standardized case definitions and tools to convert existing human-readable definitions into machine-readable formats. We also provide an accessible online platform for browsing, analyzing, and contributing new definitions, available at https://opensyndrome.org. The Open Syndrome Definition format enables consistent, scalable use of case definitions across systems, unlocking AI's potential to strengthen public health preparedness and response. The source code for the format can be found at https://github.com/OpenSyndrome/schema under the MIT license.