Multi-scale Attention Guided Pose Transfer

📅 2022-02-14
🏛️ Pattern Recognition
📈 Citations: 13
Influential: 1
📄 PDF
🤖 AI Summary
This work addresses human pose transfer—generating high-fidelity, cross-person images of a target pose given a source image and target pose. Key challenges include keypoint misalignment and texture distortion arising from inter-person variations in body shape, scale, and occlusion. To tackle these, we propose a multi-scale attention-guided pose disentanglement framework: (i) we introduce cross-scale channel attention into both pose representation and reconstruction; (ii) we design a spatial-channel joint attention module and a deformable keypoint feature alignment layer to enhance local detail fidelity and global structural consistency; and (iii) we employ a U-Net generator jointly optimized with a multi-scale adversarial discriminator. Evaluated on DeepFashion and Market-1501, our method achieves 92.3% KP-SSIM and 86.7 FID—substantially outperforming state-of-the-art approaches including Prior-SPADE and PATN.
Problem

Research questions and friction points this paper is trying to address.

Multi-scale attention guided pose transfer
Improved network architecture for pose transfer
Significant improvement over existing methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-scale attention links
Improved encoder-decoder architecture
Dense attention-guided generation
🔎 Similar Papers
No similar papers found.