Pose Prior Learner: Unsupervised Categorical Prior Learning for Pose Estimation

📅 2024-10-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the absence of generalizable pose priors in unsupervised category-level pose estimation by proposing the first fully unsupervised pose prior learning paradigm. Methodologically, it introduces a hierarchical memory network that stores and distills prototypical pose parts in a disentangled manner; joint optimization of template transformation and image reconstruction enables iterative prototype regression, substantially improving occlusion robustness. Key contributions include: (1) the first formal framework for unsupervised category-level pose prior learning; and (2) a hierarchical memory-driven mechanism for prototype disentanglement and dynamic regression. Extensive evaluation on human and animal pose datasets demonstrates consistent superiority over state-of-the-art unsupervised baselines—particularly under severe occlusion, where pose estimation accuracy improves markedly. The entire pipeline operates without any manual annotations or supervised pose priors.

Technology Category

Application Category

📝 Abstract
A prior represents a set of beliefs or assumptions about a system, aiding inference and decision-making. In this paper, we introduce the challenge of unsupervised categorical prior learning in pose estimation, where AI models learn a general pose prior for an object category from images in a self-supervised manner. Although priors are effective in estimating pose, acquiring them can be difficult. We propose a novel method, named Pose Prior Learner (PPL), to learn a general pose prior for any object category. PPL uses a hierarchical memory to store compositional parts of prototypical poses, from which we distill a general pose prior. This prior improves pose estimation accuracy through template transformation and image reconstruction. PPL learns meaningful pose priors without any additional human annotations or interventions, outperforming competitive baselines on both human and animal pose estimation datasets. Notably, our experimental results reveal the effectiveness of PPL using learned prototypical poses for pose estimation on occluded images. Through iterative inference, PPL leverages the pose prior to refine estimated poses, regressing them to any prototypical poses stored in memory. Our code, model, and data will be publicly available.
Problem

Research questions and friction points this paper is trying to address.

Unsupervised categorical prior learning
Pose estimation from images
Improving accuracy without human annotations
Innovation

Methods, ideas, or system contributions that make the work stand out.

Unsupervised categorical prior learning
Hierarchical memory for pose storage
Self-supervised pose estimation enhancement
🔎 Similar Papers
No similar papers found.
Z
Ziyu Wang
Show Lab, National University of Singapore, Singapore; Deep NeuroCognition Lab, CFAR and I2R, Agency for Science, Technology and Research, Singapore; Nanyang Technological University, Singapore
S
Shuangpeng Han
Deep NeuroCognition Lab, CFAR and I2R, Agency for Science, Technology and Research, Singapore; Nanyang Technological University, Singapore
Mengmi Zhang
Mengmi Zhang
Assistant professor and PI of Deep NeuroCognition Lab, Nanyang Technological University, Singapore
neuroscience-inspired AIcomputer visioncomputational neurosciencecognitive science