Unseen Visual Anomaly Generation

📅 2024-06-03

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

Visual anomaly detection suffers from the scarcity of real anomalous samples, while existing synthesis methods often lack realism or rely heavily on annotated data. This paper proposes AnomalyAny, the first framework enabling zero-shot generation of diverse, high-fidelity, cross-category unseen anomalous images at test time—requiring only a single normal image and a textual description. Its core innovations are an attention-guided anomaly optimization mechanism and a prompt-driven anomaly refinement strategy, which jointly integrate Stable Diffusion, text-conditional generation, and gradient-based optimization. Evaluated on MVTec AD and VisA, the synthesized anomalies significantly improve downstream detection performance while exhibiting high visual realism and discriminative difficulty. AnomalyAny establishes a novel paradigm for few-shot and zero-shot anomaly detection.

Technology Category

Application Category

📝 Abstract

Visual anomaly detection (AD) presents significant challenges due to the scarcity of anomalous data samples. While numerous works have been proposed to synthesize anomalous samples, these synthetic anomalies often lack authenticity or require extensive training data, limiting their applicability in real-world scenarios. In this work, we propose Anomaly Anything (AnomalyAny), a novel framework that leverages Stable Diffusion (SD)'s image generation capabilities to generate diverse and realistic unseen anomalies. By conditioning on a single normal sample during test time, AnomalyAny is able to generate unseen anomalies for arbitrary object types with text descriptions. Within AnomalyAny, we propose attention-guided anomaly optimization to direct SD attention on generating hard anomaly concepts. Additionally, we introduce prompt-guided anomaly refinement, incorporating detailed descriptions to further improve the generation quality. Extensive experiments on MVTec AD and VisA datasets demonstrate AnomalyAny's ability in generating high-quality unseen anomalies and its effectiveness in enhancing downstream AD performance.

Problem

Research questions and friction points this paper is trying to address.

Generating diverse realistic unseen visual anomalies

Overcoming scarcity of anomalous training data samples

Enhancing anomaly detection performance via synthetic anomalies

Innovation

Methods, ideas, or system contributions that make the work stand out.

Leverages Stable Diffusion for realistic anomaly generation

Uses attention-guided optimization for hard anomalies

Incorporates prompt-guided refinement for quality improvement

🔎 Similar Papers

No similar papers found.

Authors to Follow