Pose Splatter: A 3D Gaussian Splatting Model for Quantifying Animal Pose and Appearance

📅 2025-05-23
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing 3D animal pose estimation methods suffer from reliance on geometric priors, labor-intensive dense keypoint annotations, and frame-wise optimization—compromising accuracy, generalizability, and scalability. This paper introduces the first end-to-end framework for joint 3D pose and appearance modeling that requires no geometric priors, no keypoints, and no manual annotations. Our approach unifies shape carving with 3D Gaussian splatting and employs rotation-invariant visual embeddings—replacing conventional 3D keypoints entirely. Evaluated on multi-species datasets (mouse, rat, and zebra finch), it achieves high-fidelity reconstructions with pose representations better aligned with human perception and strong cross-individual and cross-scene generalization. The method significantly improves efficiency and spatiotemporal resolution for large-scale, long-duration behavioral analysis, establishing a novel paradigm for fine-grained behavioral quantification.

Technology Category

Application Category

📝 Abstract
Accurate and scalable quantification of animal pose and appearance is crucial for studying behavior. Current 3D pose estimation techniques, such as keypoint- and mesh-based techniques, often face challenges including limited representational detail, labor-intensive annotation requirements, and expensive per-frame optimization. These limitations hinder the study of subtle movements and can make large-scale analyses impractical. We propose Pose Splatter, a novel framework leveraging shape carving and 3D Gaussian splatting to model the complete pose and appearance of laboratory animals without prior knowledge of animal geometry, per-frame optimization, or manual annotations. We also propose a novel rotation-invariant visual embedding technique for encoding pose and appearance, designed to be a plug-in replacement for 3D keypoint data in downstream behavioral analyses. Experiments on datasets of mice, rats, and zebra finches show Pose Splatter learns accurate 3D animal geometries. Notably, Pose Splatter represents subtle variations in pose, provides better low-dimensional pose embeddings over state-of-the-art as evaluated by humans, and generalizes to unseen data. By eliminating annotation and per-frame optimization bottlenecks, Pose Splatter enables analysis of large-scale, longitudinal behavior needed to map genotype, neural activity, and micro-behavior at unprecedented resolution.
Problem

Research questions and friction points this paper is trying to address.

Accurate 3D animal pose and appearance modeling without manual annotations
Overcoming limitations of current 3D pose estimation techniques
Enabling large-scale behavioral analysis with high-resolution detail
Innovation

Methods, ideas, or system contributions that make the work stand out.

3D Gaussian splatting for animal pose modeling
Rotation-invariant visual embedding technique
No prior geometry knowledge or annotations needed
🔎 Similar Papers
No similar papers found.
J
Jack Goffinet
Department of Computer Science, Duke University
Y
Youngjo Min
Department of Computer Science, Duke University
Carlo Tomasi
Carlo Tomasi
Department of Computer Science, Duke University
David E. Carlson
David E. Carlson
Associate Professor, Duke University
Machine LearningDeep LearningData ScienceEnvironmental HealthBrain Models