PCA-VAE: Differentiable Subspace Quantization without Codebook Collapse

📅 2026-02-21

📈 Citations: 0

✨ Influential: 0

career value

197K/year

🤖 AI Summary

This work addresses key limitations of vector-quantized variational autoencoders—namely, the non-differentiability of quantizers, reliance on straight-through estimators, and codebook collapse—by introducing PCA-VAE. The proposed method replaces conventional vector quantization with a differentiable PCA bottleneck trained online via Oja’s rule, thereby eliminating the need for a discrete codebook, commitment loss, or lookup noise. This approach learns orthogonal latent representations ordered by variance and, to our knowledge, is the first to integrate differentiable PCA into generative modeling as a quantization alternative. On CelebA-HQ, PCA-VAE achieves superior reconstruction quality compared to VQ-GAN and SimVQ while using only 1/10 to 1/100 of the latent bit budget, and naturally yields semantically interpretable latent dimensions corresponding to attributes such as pose, lighting, and gender.

Technology Category

Application Category

📝 Abstract

Vector-quantized autoencoders deliver high-fidelity latents but suffer inherent flaws: the quantizer is non-differentiable, requires straight-through hacks, and is prone to collapse. We address these issues at the root by replacing VQ with a simple, principled, and fully differentiable alternative: an online PCA bottleneck trained via Oja's rule. The resulting model, PCA-VAE, learns an orthogonal, variance-ordered latent basis without codebooks, commitment losses, or lookup noise. Despite its simplicity, PCA-VAE exceeds VQ-GAN and SimVQ in reconstruction quality on CelebAHQ while using 10-100x fewer latent bits. It also produces naturally interpretable dimensions (e.g., pose, lighting, gender cues) without adversarial regularization or disentanglement objectives. These results suggest that PCA is a viable replacement for VQ: mathematically grounded, stable, bit-efficient, and semantically structured, offering a new direction for generative models beyond vector quantization.

Problem

Research questions and friction points this paper is trying to address.

vector quantization

codebook collapse

non-differentiable quantizer

straight-through estimator

Innovation

Methods, ideas, or system contributions that make the work stand out.

PCA-VAE

differentiable quantization

codebook collapse

online PCA