ForensicConcept: Transferable Forensic Concepts for AIGI Detection

📅 2026-06-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Current AI-generated image detectors suffer from poor generalization on out-of-distribution data and lack interpretability. This work proposes the first transferable and interpretable forensic concept framework: it localizes critical image patches via Transformer attribution, constructs a concept codebook through clustering, and achieves cross-architecture concept alignment and transfer by integrating generation artifact features extracted by CleanDIFT with neighborhood structural consistency measured by CKNNA. The method consistently outperforms existing approaches on GenImage, GAN-family, and Chameleon benchmarks. Notably, the CKNNA alignment score effectively predicts transfer performance, providing auditable evidence to support detection decisions.
📝 Abstract
AI-generated image detectors achieve high accuracy on in-distribution data but often fail on unseen generators. A key obstacle to understanding this failure is the black-box nature of current detectors: they do not reveal which evidence drives their decisions. We propose ForensicConcept, a framework that extracts explicit forensic concepts from detectors and enables their transfer across backbones. Our method localizes decision-critical patches via Transformer attribution, clusters them into a compact concept codebook, and uses a concept-aligned projection to produce auditable evidence readouts. Motivated by prior studies showing that DINO representations can guide diffusion generation and exhibit concept-level correspondence with diffusion features, we introduce a generation-trace reference based on CleanDIFT diffusion features and quantify backbone-trace alignment via neighborhood-structure consistency (CKNNA). We further propose concept codebook injection to transfer diffusion-derived concepts into target backbones. Experiments on GenImage, GAN-family, and Chameleon benchmarks show consistent improvements over prior methods. We also find that CKNNA alignment predicts transfer effectiveness, providing a principled explanation for why some backbones yield more transferable forensic evidence than others.
Problem

Research questions and friction points this paper is trying to address.

AIGI detection
forensic concepts
transferability
black-box detectors
unseen generators
Innovation

Methods, ideas, or system contributions that make the work stand out.

ForensicConcept
concept transfer
diffusion features
CKNNA alignment
auditable detection
🔎 Similar Papers
2024-06-21Journal of Artificial Intelligence ResearchCitations: 6
M
Menyanshu Zhou
Key Laboratory of Multimedia Trusted Perception and Efficient Computing, Ministry of Education of China, School of Informatics, Xiamen University, Xiamen 361005, P.R.China
Z
Ziyin Zhou
Key Laboratory of Multimedia Trusted Perception and Efficient Computing, Ministry of Education of China, School of Informatics, Xiamen University, Xiamen 361005, P.R.China
Ke Sun
Ke Sun
Xiamen University
Computer VisionMachine Learning
Yunpeng Luo
Yunpeng Luo
Bytedance Inc.
Jiayi Ji
Jiayi Ji
Rutgers University
X
Xiaoshuai Sun
Key Laboratory of Multimedia Trusted Perception and Efficient Computing, Ministry of Education of China, School of Informatics, Xiamen University, Xiamen 361005, P.R.China; Sino-Russian Research Center for Digital Economy
R
Rongrong Ji
Key Laboratory of Multimedia Trusted Perception and Efficient Computing, Ministry of Education of China, School of Informatics, Xiamen University, Xiamen 361005, P.R.China