Cracking the Code: Enhancing Implicit Hate Speech Detection through Coding Classification

๐Ÿ“… 2025-06-05
๐Ÿ›๏ธ Proceedings of the 5th Workshop on Trustworthy NLP (TrustNLP 2025)
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Implicit hate speech (im-HS) poses significant challenges for automated detection due to its semantic opacity and strong contextual dependency. To address this, we propose the first six-dimensional codetype encoding taxonomy, systematically modeling both semantic and rhetorical characteristics of im-HS. Crucially, we integrate this encoding strategy deeply into large language model (LLM) prompt engineering and semantic embeddingโ€”marking the first such fusion of codetype guidance with LLM-based inference. We develop a cross-lingual, prompt-driven encoding detection framework grounded in this principle. Evaluated on benchmark Chinese and English im-HS datasets, our method achieves absolute F1-score improvements of 4.2โ€“7.8 percentage points over state-of-the-art baselines. These results robustly demonstrate the effectiveness and cross-lingual generalizability of codetype-guided LLMs for implicit hate speech identification.

Technology Category

Application Category

๐Ÿ“ Abstract
The internet has become a hotspot for hate speech (HS), threatening societal harmony and individual well-being. While automatic detection methods perform well in identifying explicit hate speech (ex-HS), they struggle with more subtle forms, such as implicit hate speech (im-HS). We tackle this problem by introducing a new taxonomy for im-HS detection, defining six encoding strategies named codetypes. We present two methods for integrating codetypes into im-HS detection: 1) prompting large language models (LLMs) directly to classify sentences based on generated responses, and 2) using LLMs as encoders with codetypes embedded during the encoding process. Experiments show that the use of codetypes improves im-HS detection in both Chinese and English datasets, validating the effectiveness of our approach across different languages.
Problem

Research questions and friction points this paper is trying to address.

Improving detection of subtle implicit hate speech online
Introducing a new taxonomy with six coding strategies
Enhancing cross-language performance in hate speech detection
Innovation

Methods, ideas, or system contributions that make the work stand out.

Introducing a new taxonomy for im-HS detection
Prompting LLMs to classify sentences using codetypes
Embedding codetypes during LLM encoding process
๐Ÿ”Ž Similar Papers
No similar papers found.
L
Lu Wei
The University of Osaka, Osaka, Japan
L
Liangzhi Li
The University of Osaka, Osaka, Japan
Tong Xiang
Tong Xiang
Osaka University
Natural Language ProcessingComputational Social Science
X
Xiao Liu
Meetyou AI Lab, Xiamen, China
Noa Garcia
Noa Garcia
Osaka University
computer visionmachine learning