SVC 2025: the First Multimodal Deception Detection Challenge

📅 2025-08-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing deepfake detection models exhibit strong performance within single domains but suffer from severe cross-domain generalization deficits. To address the challenge of multimodal (audio-visual-text) cross-domain deepfake detection, this work establishes the first benchmark challenge tailored to heterogeneous datasets and proposes a novel paradigm integrating deep multimodal fusion with rigorous cross-domain generalization evaluation. Innovatively, we introduce a distributionally robust evaluation framework that prioritizes model interpretability and deployability on unseen target domains. The challenge attracted 21 international teams, significantly advancing research on real-world generalization of multimodal deepfake detection. It provides a new benchmark and methodological foundation for trustworthy AI security verification.

Technology Category

Application Category

📝 Abstract
Deception detection is a critical task in real-world applications such as security screening, fraud prevention, and credibility assessment. While deep learning methods have shown promise in surpassing human-level performance, their effectiveness often depends on the availability of high-quality and diverse deception samples. Existing research predominantly focuses on single-domain scenarios, overlooking the significant performance degradation caused by domain shifts. To address this gap, we present the SVC 2025 Multimodal Deception Detection Challenge, a new benchmark designed to evaluate cross-domain generalization in audio-visual deception detection. Participants are required to develop models that not only perform well within individual domains but also generalize across multiple heterogeneous datasets. By leveraging multimodal data, including audio, video, and text, this challenge encourages the design of models capable of capturing subtle and implicit deceptive cues. Through this benchmark, we aim to foster the development of more adaptable, explainable, and practically deployable deception detection systems, advancing the broader field of multimodal learning. By the conclusion of the workshop competition, a total of 21 teams had submitted their final results. https://sites.google.com/view/svc-mm25 for more information.
Problem

Research questions and friction points this paper is trying to address.

Addressing cross-domain generalization in deception detection
Overcoming performance degradation from domain shifts
Leveraging multimodal data for subtle deceptive cues
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multimodal data integration for deception detection
Cross-domain generalization in audio-visual analysis
Adaptable models for diverse deception scenarios
🔎 Similar Papers
No similar papers found.
Xun Lin
Xun Lin
Postdoc, CUHK; PhD, Beihang University
Subtle Visual ComputingMedia Security
Xiaobao Guo
Xiaobao Guo
Nanyang Technological University
Taorui Wang
Taorui Wang
Harbin Institute of Technology(Shenzhen), Great Bay University
Computer VisionMLLM
Y
Yingjie Ma
Great Bay University, Dongguan, Guangdong, China
J
Jiajian Huang
Great Bay University, Dongguan, Guangdong, China
J
Jiayu Zhang
Great Bay University, Dongguan, Guangdong, China
J
Junzhe Cao
Great Bay University, Dongguan, Guangdong, China
Zitong Yu
Zitong Yu
U.S. Food and Drug Administration
Medical imagingDeep learningMachine learningImage reconstruction