🤖 AI Summary
This study addresses the unmet need for non-invasive, early screening of respiratory insufficiency (hypoxemia) by proposing the first end-to-end speech-driven intelligent detection system. Methodologically, it integrates deep learning modeling, multi-strategy speech data augmentation, cross-speaker transfer learning, and model lightweighting for efficient inference—ensuring both high accuracy and deployability in resource-constrained clinical settings. Key contributions include: (1) establishing a clinically viable speech-to-physiological-state mapping paradigm; (2) systematically documenting critical engineering insights on real-world data acquisition, noise-robust training, and edge-device deployment; and (3) achieving high sensitivity and specificity in empirical validation. The work delivers a reproducible technical pathway and an open-source implementation framework for speech-based respiratory health monitoring.
📝 Abstract
Respiratory insufficiency is a medic symptom in which a person gets a reduced amount of oxygen in the blood. This paper reports the experience of building SPIRA: an intelligent system for detecting respiratory insufficiency from voice. It compiles challenges faced in two succeeding implementations of the same architecture, summarizing lessons learned on data collection, training, and inference for future projects in similar systems.