A Fast and Minimal System to Identify Depression Using Smartphones: Explainable Machine Learning-Based Approach

📅 2025-08-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing depression detection systems require prolonged data collection, hindering timely early screening. To address this challenge—particularly in resource-constrained settings—we propose a minimalist, rapid depression screening system tailored for students. The system leverages a smartphone application to capture one second of behavioral data reflecting the past seven days. It employs multi-strategy feature selection (including Boruta) and LightGBM modeling, augmented with SHAP-based interpretability analysis to identify salient depression-associated behavioral markers. The resulting lightweight model achieves a balanced accuracy of 77.9% using only a small set of stable features; the base LightGBM model attains an F1-score of 78.5% and sensitivity of 82.4%. This approach significantly improves screening speed and deployability while ensuring transparency and low infrastructure requirements—enabling efficient, interpretable, and accessible early intervention for at-risk student populations.

Technology Category

Application Category

📝 Abstract
Background: Existing robust, pervasive device-based systems developed in recent years to detect depression require data collected over a long period and may not be effective in cases where early detection is crucial. Objective: Our main objective was to develop a minimalistic system to identify depression using data retrieved in the fastest possible time. Methods: We developed a fast tool that retrieves the past 7 days' app usage data in 1 second (mean 0.31, SD 1.10 seconds). A total of 100 students from Bangladesh participated in our study, and our tool collected their app usage data. To identify depressed and nondepressed students, we developed a diverse set of ML models. We selected important features using the stable approach, along with 3 main types of feature selection (FS) approaches. Results: Leveraging only the app usage data retrieved in 1 second, our light gradient boosting machine model used the important features selected by the stable FS approach and correctly identified 82.4% (n=42) of depressed students (precision=75%, F1-score=78.5%). Moreover, after comprehensive exploration, we presented a parsimonious stacking model where around 5 features selected by the all-relevant FS approach Boruta were used in each iteration of validation and showed a maximum precision of 77.4% (balanced accuracy=77.9%). A SHAP analysis of our best models presented behavioral markers that were related to depression. Conclusions: Due to our system's fast and minimalistic nature, it may make a worthwhile contribution to identifying depression in underdeveloped and developing regions. In addition, our detailed discussion about the implication of our findings can facilitate the development of less resource-intensive systems to better understand students who are depressed.
Problem

Research questions and friction points this paper is trying to address.

Develops minimal system for rapid depression detection
Uses smartphone app data collected within seconds
Applies explainable ML models to identify depression
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses app usage data for depression detection
Employs explainable machine learning models
Leverages fast one-second data retrieval
🔎 Similar Papers
No similar papers found.
M
Md Sabbir Ahmed
Design Inclusion and Access Lab, North South University, Dhaka, Bangladesh
Nova Ahmed
Nova Ahmed
DIAL, North South University, Bangladesh, https://orcid.org/0000-0002-7715-1742
ICTDLow cost IoTFeminist HCISocial Justice