Semi-supervised CAPP Transformer Learning via Pseudo-labeling

📅 2026-02-01

📈 Citations: 0

✨ Influential: 0

career value

125K/year

🤖 AI Summary

This study addresses the challenge of data scarcity in high-level computer-aided process planning (CAPP), where limited annotated industrial data hampers model generalization. To overcome this limitation, the authors propose a semi-supervised learning approach that introduces, for the first time in CAPP, an Oracle-guided pseudo-labeling mechanism. Specifically, an Oracle model trained on existing behavioral data evaluates the confidence of Transformer-based predictions on unlabeled parts, selectively retaining high-confidence pseudo-labels for a single round of model retraining. The method is evaluated on a small-scale simulated dataset designed to cover the full data distribution, demonstrating significant performance gains over baseline models. This approach effectively mitigates the data scarcity problem without requiring additional manual annotation, thereby enhancing the practical applicability of CAPP systems in real-world industrial settings.

Technology Category

Application Category

📝 Abstract

High-level Computer-Aided Process Planning (CAPP) generates manufacturing process plans from part specifications. It suffers from limited dataset availability in industry, reducing model generalization. We propose a semi-supervised learning approach to improve transformer-based CAPP transformer models without manual labeling. An oracle, trained on available transformer behaviour data, filters correct predictions from unseen parts, which are then used for one-shot retraining. Experiments on small-scale datasets with simulated ground truth across the full data distribution show consistent accuracy gains over baselines, demonstrating the method's effectiveness in data-scarce manufacturing environments.

Problem

Research questions and friction points this paper is trying to address.

Computer-Aided Process Planning

semi-supervised learning

data scarcity

model generalization

limited dataset

Innovation

Methods, ideas, or system contributions that make the work stand out.

semi-supervised learning

pseudo-labeling

CAPP