Effect of Activation Function and Model Optimizer on the Performance of Human Activity Recognition System Using Various Deep Learning Models

📅 2025-12-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Prior work on human activity recognition (HAR) in medical settings emphasizes model architecture while neglecting critical interactions among activation functions and optimizers. Method: This study systematically investigates the coupling effects of six activation–optimizer combinations (ReLU/Sigmoid/Tanh × SGD/Adam/RMSprop/Adagrad) on HAR performance across six activity classes, using BiLSTM and ConvLSTM architectures evaluated on HMDB51 and UCF101 subsets via cross-dataset experiments. Results: ConvLSTM with Adam or RMSprop achieves 99.00% accuracy with consistent performance across both datasets; BiLSTM attains 98.00% on UCF101 but drops sharply to 60.00% on HMDB51, demonstrating ConvLSTM’s superior robustness to activation–optimizer pairings. This work is the first to uncover the tripartite synergy among architecture, activation function, and optimizer in HAR, establishing a reproducible hyperparameter configuration paradigm for clinical deployment.

Technology Category

Application Category

📝 Abstract
Human Activity Recognition (HAR) plays a vital role in healthcare, surveillance, and innovative environments, where reliable action recognition supports timely decision-making and automation. Although deep learning-based HAR systems are widely adopted, the impact of Activation Functions (AFs) and Model Optimizers (MOs) on performance has not been sufficiently analyzed, particularly regarding how their combinations influence model behavior in practical scenarios. Most existing studies focus on architecture design, while the interaction between AF and MO choices remains relatively unexplored. In this work, we investigate the effect of three commonly used activation functions (ReLU, Sigmoid, and Tanh) combined with four optimization algorithms (SGD, Adam, RMSprop, and Adagrad) using two recurrent deep learning architectures, namely BiLSTM and ConvLSTM. Experiments are conducted on six medically relevant activity classes selected from the HMDB51 and UCF101 datasets, considering their suitability for healthcare-oriented HAR applications. Our experimental results show that ConvLSTM consistently outperforms BiLSTM across both datasets. ConvLSTM, combined with Adam or RMSprop, achieves an accuracy of up to 99.00%, demonstrating strong spatio-temporal learning capabilities and stable performance. While BiLSTM performs reasonably well on UCF101, with accuracy approaching 98.00%, its performance drops to approximately 60.00% on HMDB51, indicating limited robustness across datasets and weaker sensitivity to AF and MO variations. This study provides practical insights for optimizing HAR systems, particularly for real-world healthcare environments where fast and precise activity detection is critical.
Problem

Research questions and friction points this paper is trying to address.

Investigates activation functions and optimizers' impact on HAR performance.
Evaluates deep learning models for healthcare-oriented activity recognition tasks.
Analyzes combinations of AFs and MOs to optimize model behavior.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Evaluating activation functions and optimizers for deep learning models
Using ConvLSTM with Adam/RMSprop for high-accuracy activity recognition
Testing on medical activity datasets to optimize healthcare HAR systems
🔎 Similar Papers
No similar papers found.
S
Subrata Kumer Paula
Department of Computer Science and Engineering, Bangladesh Army University of Engineering&Technology (BAUET), Qadirabad, Dayarampur, Natore-6431, Bangladesh
D
Dewan Nafiul Islam Noora
Department of Computer Science and Engineering, University of Rajshahi, Rajshahi-6205, Bangladesh
R
Rakhi Rani Paula
Department of Computer Science and Engineering, Bangladesh Army University of Engineering&Technology (BAUET), Qadirabad, Dayarampur, Natore-6431, Bangladesh
M
Md. Ekramul Hamid
Department of Computer Science and Engineering, University of Rajshahi, Rajshahi-6205, Bangladesh
F
Fahmid Al Farid
Faculty of Computer Science and Informatics, Berlin School of Business and Innovation, Karl-Marx-Straße 97-99, Berlin, 12043, Germany
Hezerul Abdul Karim
Hezerul Abdul Karim
Professor, Faculty of Engineering, Multimedia University, Cyberjaya, Selangor, Malaysia
3D image and video codingvideo transmission over cognitive radioerror resiliencetelemetry
M
Md. Maruf Al Hossain Prince
Department of Computer Science and Engineering, Bangladesh Army University of Science and Technology (BAUST), Saidpur-5220, Bangladesh
A
Abu Saleh Musa Miah
Department of Computer Science and Engineering, University of Rajshahi, Rajshahi-6205, Bangladesh