BdSL-SPOTER: A Transformer-Based Framework for Bengali Sign Language Recognition with Cultural Adaptation

📅 2025-11-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenges of strong cultural specificity and scarce annotated data in Bangla Sign Language (BdSL) recognition, this paper proposes a lightweight pose-driven Transformer framework. Methodologically: (i) a culturally adaptive gesture preprocessing module is introduced to capture region-specific BdSL characteristics; (ii) an optimized learnable positional encoding scheme is designed to enhance spatiotemporal modeling of skeletal keypoints; and (iii) a curriculum learning strategy is integrated to improve generalization and convergence speed under low-data regimes. The architecture employs only four compact Transformer encoder layers, significantly reducing parameter count and FLOPs. Evaluated on the BdSLW60 benchmark, the model achieves 97.92% Top-1 accuracy while accelerating inference by 32% over prior approaches. This work delivers an efficient, deployable solution for low-resource sign language recognition, balancing accuracy, computational efficiency, and cultural adaptability.

Technology Category

Application Category

📝 Abstract
We introduce BdSL-SPOTER, a pose-based transformer framework for accurate and efficient recognition of Bengali Sign Language (BdSL). BdSL-SPOTER extends the SPOTER paradigm with cultural specific preprocessing and a compact four-layer transformer encoder featuring optimized learnable positional encodings, while employing curriculum learning to enhance generalization on limited data and accelerate convergence. On the BdSLW60 benchmark, it achieves 97.92% Top-1 validation accuracy, representing a 22.82% improvement over the Bi-LSTM baseline, all while keeping computational costs low. With its reduced number of parameters, lower FLOPs, and higher FPS, BdSL-SPOTER provides a practical framework for real-world accessibility applications and serves as a scalable model for other low-resource regional sign languages.
Problem

Research questions and friction points this paper is trying to address.

Recognizing Bengali Sign Language with cultural adaptation
Improving accuracy on limited data using curriculum learning
Providing efficient framework for low-resource sign languages
Innovation

Methods, ideas, or system contributions that make the work stand out.

Transformer-based pose framework for sign language recognition
Cultural preprocessing and compact four-layer encoder design
Curriculum learning enhances generalization on limited data
🔎 Similar Papers
No similar papers found.
S
Sayad Ibna Azad
Department of Computer Science and Engineering, Islamic University of Technology, Board Bazar, Gazipur-1704, Bangladesh
Md. Atiqur Rahman
Md. Atiqur Rahman
Lecturer, Islamic University of Technology
Computer VisionFew Shot LearningFedearated LearningSelf Supervised Learning