Few-Shot Optimization for Sensor Data Using Large Language Models: A Case Study on Fatigue Detection

📅 2025-05-24
🏛️ Italian National Conference on Sensors
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
For sensor-based few-shot classification tasks—such as fatigue detection—that exhibit high inter-subject variability and low signal-to-noise ratios, conventional example selection methods struggle to identify representative samples. To address this, we propose HED-LM: a hybrid example selection framework that integrates Euclidean-distance-based preliminary filtering with temporal feature embedding, and—novelty introduced herein—leverages a large language model (GPT-4) for context-aware semantic relevance re-ranking of candidate examples. This paradigm synergistically combines numerical similarity and semantic consistency. Evaluated on accelerometer-based fatigue detection, HED-LM achieves a macro-F1 score of 69.13±10.71%, outperforming random selection and distance-only baselines by 16.6% and 2.3%, respectively. The method significantly enhances prompt quality in few-shot settings, thereby improving robustness and generalization under limited labeled data.

Technology Category

Application Category

📝 Abstract
In this paper, we propose a novel few-shot optimization with Hybrid Euclidean Distance with Large Language Models (HED-LM) to improve example selection for sensor-based classification tasks. While few-shot prompting enables efficient inference with limited labeled data, its performance largely depends on the quality of selected examples. HED-LM addresses this challenge through a hybrid selection pipeline that filters candidate examples based on Euclidean distance and re-ranks them using contextual relevance scored by large language models (LLMs). To validate its effectiveness, we apply HED-LM to a fatigue detection task using accelerometer data characterized by overlapping patterns and high inter-subject variability. Unlike simpler tasks such as activity recognition, fatigue detection demands more nuanced example selection due to subtle differences in physiological signals. Our experiments show that HED-LM achieves a mean macro F1-score of 69.13 ± 10.71%, outperforming both random selection (59.30 ± 10.13%) and distance-only filtering (67.61 ± 11.39%). These represent relative improvements of 16.6% and 2.3%, respectively. The results confirm that combining numerical similarity with contextual relevance improves the robustness of few-shot prompting. Overall, HED-LM offers a practical solution to improve performance in real-world sensor-based learning tasks and shows potential for broader applications in healthcare monitoring, human activity recognition, and industrial safety scenarios.
Problem

Research questions and friction points this paper is trying to address.

Improving few-shot example selection for sensor-based classification
Enhancing fatigue detection using hybrid distance-contextual filtering
Optimizing sensor data performance with LLM-augmented selection
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hybrid Euclidean Distance with LLMs
Filters and re-ranks examples contextually
Improves few-shot sensor data classification
🔎 Similar Papers
No similar papers found.
E
Elsen Ronando
Graduate School of Life Science and Systems Engineering, Kyushu Institute of Technology, 2-4 Hibikino, Wakamatsu Ward, Kitakyushu 808-0135, Japan; Department of Informatics, Universitas 17 Agustus 1945 Surabaya, Semolowaru No. 45, Kota Surabaya 60118, Indonesia
Sozo Inoue
Sozo Inoue
Kyushu Institute of Technology, Japan
Ubiquitous computing / pervasive healthcare / activity recognition / smart life care