π€ AI Summary
This study addresses the challenge of anomaly detection in physiological sensor networks, where missing data or device failures often lead to false alarms, necessitating methods that simultaneously achieve high accuracy and clinical interpretability. The authors propose DEM, a three-stage βglass-boxβ framework: it first employs XGBoost as a high-performance expert model, then models its residuals and distills them into an interpretable decision tree, ensuring that predictions are inherently explainable. A novel distillation fidelity metric is introduced to enable user-controlled trade-offs between accuracy and interpretability. Evaluated on four physiological datasets, DEM achieves AUC scores of 0.9964 for clinical anomalies and 0.9047 for stress events, with inference speeds 1,235 times faster than SHAP, demonstrating its suitability for real-time monitoring applications.
π Abstract
Anomaly detection in physiological sensor data from Wireless Body Area Networks (WBANs) can be caused by sensor faults, network disruptions, or missing data, leading to false alarms. Hence, it demands both high predictive accuracy and clinically interpretable explanations. Existing approaches rely either on black-box models that achieve strong performance but offer no transparency, or on post-prediction explanation methods such as SHAP and LIME. In this paper, we propose the Distilled Explanation Model (DEM), a three-stage glass-box framework that distills the non-linear knowledge of a gradient boosting expert into an interpretable decision tree operating on residuals relative to a linear baseline, so that the explanation is not an approximation but the prediction itself. DEM introduces a novel distillation fidelity metric that quantifies how faithfully the explanation tree captures the expert model's non-linear contribution, providing a principled measure of explanation trustworthiness absent from prior interpretable models. Evaluated across four physiological datasets, including MIMIC-IV, WESAD, eICU, and an in-house SmartNet WBAN corpus, DEM achieves an AUC of 0.9964 on clinical contextual anomaly detection and 0.9047 on wearable stress detection while producing human-readable if-then rules at a controllable depth. Inference requires 0.17ms per 1000 samples, rendering DEM 1235x faster than SHAP-based post-hoc explanation and suitable for real-time physiological monitoring. Ablation studies confirm that the XGBoost distillation step provides measurable gains over naive residual fitting, and depth-sensitivity analysis demonstrates an explicit, user-controlled accuracy-interpretability trade-off unique to DEM among existing intrinsically interpretable models.