🤖 AI Summary
This work addresses two key challenges in embodied emotion recognition—namely, the difficulty of identifying emotions grounded in bodily sensations and physical experiences, and the limitation of existing datasets to coarse-grained categories (e.g., Ekman’s six basic emotions). To this end, we introduce CHEER-Ekman, the first fine-grained embodied emotion classification dataset. Methodologically, we extend embodied emotion modeling to a six-class framework for the first time and propose a large language model (LLM)-based Best-Worst Scaling (BWS) annotation paradigm. We further demonstrate that simplified prompting and chain-of-thought (CoT) reasoning significantly enhance small-model performance. Experiments show that CHEER-Ekman effectively supports embodied emotion modeling; our approach outperforms supervised learning baselines on the new benchmark; and optimized small models achieve accuracy comparable to LLMs—offering an efficient, resource-conscious alternative for deployment-constrained scenarios.
📝 Abstract
Emotions manifest through physical experiences and bodily reactions, yet identifying such embodied emotions in text remains understudied. We present an embodied emotion classification dataset, CHEER-Ekman, extending the existing binary embodied emotion dataset with Ekman's six basic emotion categories. Using automatic best-worst scaling with large language models, we achieve performance superior to supervised approaches on our new dataset. Our investigation reveals that simplified prompting instructions and chain-of-thought reasoning significantly improve emotion recognition accuracy, enabling smaller models to achieve competitive performance with larger ones.