🤖 AI Summary
To address the challenges of delayed deterioration detection in emergency department (ED) patients and weak multimodal fusion of physiological waveforms and clinical text, this study introduces the first standardized multimodal ED benchmark—comprising synchronized ECG, PPG, and ABP waveforms paired with diagnostic and intervention annotations. We propose a waveform-aware cross-modal alignment architecture integrating temporal convolutional networks (TCNs) and Transformer encoders, enhanced by contrastive learning to achieve fine-grained waveform representation learning and precise clinical semantic alignment. Evaluated on five acute condition classification tasks, our method achieves an average 12.3% F1-score improvement over baselines, with inference latency under 200 ms. The system has been successfully deployed and validated across three tertiary hospitals’ EDs, demonstrating significant enhancement in real-time clinical decision support capabilities.