đ¤ AI Summary
This study addresses the challenge of inferring athletesâ action intentions from video data. We propose a pose-driven weakly supervised learning framework for racket-sport intent recognitionâspecifically, distinguishing offensive versus defensive stroke stylesâand fatigue state assessment. To mitigate difficulties in annotating sensitive human motion videos and comply with privacy constraints, our method jointly leverages single-frame pose estimation and temporal modeling of pose sequences, exploiting naturally diverse, emotion-conditioned pose trajectories in sports scenariosâwithout requiring fine-grained action labels. Experiments on real-world match videosâcharacterized by high noise and variabilityâdemonstrate strong robustness: the model achieves an F1 score of 75.3% and an AUC-ROC of 82.1%, significantly outperforming fully supervised baselines. To our knowledge, this is the first work to systematically apply weakly supervised temporal pose modeling to competitive intent understanding, establishing a novel paradigm for non-intrusive, interpretable sports behavior analysis.
đ Abstract
Posture-based mental state inference has significant potential in diagnosing fatigue, preventing injury, and enhancing performance across various domains. Such tools must be research-validated with large datasets before being translated into practice. Unfortunately, such vision diagnosis faces serious challenges due to the sensitivity of human subject data. To address this, we identify sports settings as a viable alternative for accumulating data from human subjects experiencing diverse emotional states. We test our hypothesis in the game of cricket and present a posture-based solution to identify human intent from activity videos. Our method achieves over 75% F1 score and over 80% AUC-ROC in discriminating aggressive and defensive shot intent through motion analysis. These findings indicate that posture leaks out strong signals for intent inference, even with inherent noise in the data pipeline. Furthermore, we utilize existing data statistics as weak supervision to validate our findings, offering a potential solution for overcoming data labelling limitations. This research contributes to generalizable techniques for sports analytics and also opens possibilities for applying human behavior analysis across various fields.