Deep Learning for Metabolic Rate Estimation from Biosignals: A Comparative Study of Architectures and Signal Selection

📅 2025-11-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the problem of accurate metabolic equivalent of task (MET) estimation from wearable physiological signals—namely heart rate, respiration, and acceleration. We systematically decouple and evaluate the independent effects of neural architecture design and input signal selection. We propose the first adaptive modeling framework that jointly accounts for activity intensity and inter-individual variability. Our analysis identifies minute ventilation as the optimal single-signal feature. Among architectures—including Transformer, CNN, ResNet, and attention-augmented variants—the Transformer achieves the lowest overall RMSE (0.87 W/kg) and exceptional accuracy during light activities (0.29 W/kg; NRMSE = 0.04). Furthermore, multi-signal fusion substantially enhances performance of lightweight models. All code and pre-trained models are publicly released.

Technology Category

Application Category

📝 Abstract
Energy expenditure estimation aims to infer human metabolic rate from physiological signals such as heart rate, respiration, or accelerometer data, and has been studied primarily with classical regression methods. The few existing deep learning approaches rarely disentangle the role of neural architecture from that of signal choice. In this work, we systematically evaluate both aspects. We compare classical baselines with newer neural architectures across single signals, signal pairs, and grouped sensor inputs for diverse physical activities. Our results show that minute ventilation is the most predictive individual signal, with a transformer model achieving the lowest root mean square error (RMSE) of 0.87 W/kg across all activities. Paired and grouped signals, such as those from the Hexoskin smart shirt (five signals), offer good alternatives for faster models like CNN and ResNet with attention. Per-activity evaluation revealed mixed outcomes: notably better results in low-intensity activities (RMSE down to 0.29 W/kg; NRMSE = 0.04), while higher-intensity tasks showed larger RMSE but more comparable normalized errors. Finally, subject-level analysis highlights strong inter-individual variability, motivating the need for adaptive modeling strategies. Our code and models will be publicly available at https://github.com/Sarvibabakhani/deeplearning-biosignals-ee .
Problem

Research questions and friction points this paper is trying to address.

Systematically evaluates neural architectures and signal selection for metabolic rate estimation
Compares deep learning models across single/paired/grouped biosignals during physical activities
Analyzes performance variations across activity intensities and individual subject differences
Innovation

Methods, ideas, or system contributions that make the work stand out.

Transformer model achieves best accuracy with minute ventilation
CNN and ResNet with attention work well with grouped signals
Study reveals need for adaptive modeling across individuals
🔎 Similar Papers
No similar papers found.
S
Sarvenaz Babakhani
Institute for Artificial Intelligence, University of Stuttgart, Stuttgart, Germany
D
David Remy
Institute for Adaptive Mechanical Systems, University of Stuttgart, Stuttgart, Germany
Alina Roitberg
Alina Roitberg
University of Stuttgart
Deep LearningHuman Activity RecognitionMultimodal RecognitionAssistive Robotics