SynSpill: Improved Industrial Spill Detection With Synthetic Data

📅 2025-08-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In industrial leak detection, the scarcity of real annotated data—exacerbated by the rarity and sensitivity of leakage events—severely degrades zero-shot performance of vision-language models (VLMs), while conventional fine-tuning remains infeasible due to data constraints. To address this, we propose SynSpill, a scalable, high-fidelity synthetic data generation framework tailored for safety-critical scenarios, coupled with parameter-efficient fine-tuning (PEFT) strategies adaptable to diverse architectures—including YOLO, DETR, and large-scale VLMs. SynSpill effectively bridges domain gaps, significantly enhancing both zero-shot and few-shot generalization without requiring real annotations. Extensive experiments demonstrate consistent and comparable performance gains across all evaluated detectors and VLMs, validating the method’s effectiveness, robustness, and cross-architectural scalability.

Technology Category

Application Category

📝 Abstract
Large-scale Vision-Language Models (VLMs) have transformed general-purpose visual recognition through strong zero-shot capabilities. However, their performance degrades significantly in niche, safety-critical domains such as industrial spill detection, where hazardous events are rare, sensitive, and difficult to annotate. This scarcity -- driven by privacy concerns, data sensitivity, and the infrequency of real incidents -- renders conventional fine-tuning of detectors infeasible for most industrial settings. We address this challenge by introducing a scalable framework centered on a high-quality synthetic data generation pipeline. We demonstrate that this synthetic corpus enables effective Parameter-Efficient Fine-Tuning (PEFT) of VLMs and substantially boosts the performance of state-of-the-art object detectors such as YOLO and DETR. Notably, in the absence of synthetic data (SynSpill dataset), VLMs still generalize better to unseen spill scenarios than these detectors. When SynSpill is used, both VLMs and detectors achieve marked improvements, with their performance becoming comparable. Our results underscore that high-fidelity synthetic data is a powerful means to bridge the domain gap in safety-critical applications. The combination of synthetic generation and lightweight adaptation offers a cost-effective, scalable pathway for deploying vision systems in industrial environments where real data is scarce/impractical to obtain. Project Page: https://synspill.vercel.app
Problem

Research questions and friction points this paper is trying to address.

Detecting rare industrial spills with limited real data
Improving spill detection using synthetic data generation
Enhancing vision models for safety-critical industrial applications
Innovation

Methods, ideas, or system contributions that make the work stand out.

Synthetic data generation for spill detection
Parameter-Efficient Fine-Tuning of VLMs
Lightweight adaptation for industrial vision systems
🔎 Similar Papers
No similar papers found.