StyleAutoEncoder for Manipulating Image Attributes Using Pre-trained StyleGAN

📅 2024-12-28
🏛️ Pacific-Asia Conference on Knowledge Discovery and Data Mining
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address fine-grained style editing of pre-trained StyleGAN images, this work proposes a fine-tuning-free, unsupervised, single-shot-driven disentangled encoding framework. Methodologically, it achieves the first unsupervised linear separation and interpretable semantic mapping of attributes in the StyleGAN latent space; this is accomplished via feature distillation, contrastive latent-code clustering, an attribute-aware dual-path autoencoder, and gradient-guided optimization of semantic directions—enabling high-precision attribute localization and disentanglement. Evaluated on FFHQ and AFHQ, the method attains 92.6% attribute editing accuracy, improves fidelity by 37%, and processes images at 50 ms per image. Its core contributions are: (i) eliminating reliance on manual annotations or model fine-tuning; and (ii) establishing the first interpretable, composable, and lightweight unsupervised editing paradigm for StyleGAN.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Image Style Transfer
Computational Efficiency
Feature Adaptation
Innovation

Methods, ideas, or system contributions that make the work stand out.

StyleAutoEncoder
efficient feature tuning
flexible neural network design
🔎 Similar Papers
2024-07-01arXiv.orgCitations: 3