Wukong's 72 Transformations: High-fidelity Textured 3D Morphing via Flow Models

📅 2025-11-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of high-fidelity, training-free 3D shape-and-texture morphing—bypassing conventional reliance on manual correspondence establishment and explicit deformation trajectory estimation. The proposed method formulates morphing as an optimal transport–based barycentric interpolation problem. It integrates a streaming Transformer to generate geometric priors, similarity-guided semantic consistency constraints, and a progressive sequence initialization strategy. Given only source/target text or image prompts, it achieves joint, smooth geometric and textural transitions without registration or supervision, effectively suppressing oversmoothing artifacts. Qualitative and quantitative evaluations across diverse shape-texture co-morphing tasks demonstrate superior performance over existing unsupervised and weakly supervised approaches. To our knowledge, this is the first method enabling purely prompt-driven, end-to-end consistent, high-quality 3D semantic morphing.

Technology Category

Application Category

📝 Abstract
We present WUKONG, a novel training-free framework for high-fidelity textured 3D morphing that takes a pair of source and target prompts (image or text) as input. Unlike conventional methods -- which rely on manual correspondence matching and deformation trajectory estimation (limiting generalization and requiring costly preprocessing) -- WUKONG leverages the generative prior of flow-based transformers to produce high-fidelity 3D transitions with rich texture details. To ensure smooth shape transitions, we exploit the inherent continuity of flow-based generative processes and formulate morphing as an optimal transport barycenter problem. We further introduce a sequential initialization strategy to prevent abrupt geometric distortions and preserve identity coherence. For faithful texture preservation, we propose a similarity-guided semantic consistency mechanism that selectively retains high-frequency details and enables precise control over blending dynamics. This avoids common artifacts like oversmoothing while maintaining semantic fidelity. Extensive quantitative and qualitative evaluations demonstrate that WUKONG significantly outperforms state-of-the-art methods, achieving superior results across diverse geometry and texture variations.
Problem

Research questions and friction points this paper is trying to address.

Generates high-fidelity textured 3D morphing from prompts without training
Ensures smooth shape transitions by formulating morphing as optimal transport
Preserves texture details and semantic fidelity to avoid artifacts
Innovation

Methods, ideas, or system contributions that make the work stand out.

Training-free framework using flow-based transformers
Optimal transport barycenter for smooth shape transitions
Similarity-guided semantic consistency for texture preservation
🔎 Similar Papers
No similar papers found.
M
Minghao Yin
Visual AI Lab, The University of Hong Kong
Yukang Cao
Yukang Cao
Research Fellow, Nanyang Technological University
3D computer vision
K
Kai Han
Visual AI Lab, The University of Hong Kong