๐ค AI Summary
Existing Mamba-based 3D human pose lifting methods flatten joint sequences into 1D token sequences, failing to preserve the intrinsic skeletal topology and neglecting the heterogeneous motion patterns across joints. To address these limitations, we propose a structural-aware and motion-adaptive Mamba framework. First, we design a structural-aware state fusion module that explicitly encodes kinematic dependencies among joints via graph-structured priors. Second, we introduce a motion-adaptive modulation mechanism that jointly performs joint-specific dynamic modeling at both feature and state levels. By decoupling representation learning from strict sequential ordering, our approach mitigates over-homogenization of motion dynamics. Extensive experiments demonstrate state-of-the-art performance on Human3.6M and MPI-INF-3DHP benchmarks, achieving superior accuracy while reducing computational overhead by up to 32% compared to prior Mamba variantsโthus striking an effective balance between precision and efficiency.
๐ Abstract
Recent Mamba-based methods for the pose-lifting task tend to model joint dependencies by 2D-to-1D mapping with diverse scanning strategies. Though effective, they struggle to model intricate joint connections and uniformly process all joint motion trajectories while neglecting the intrinsic differences across motion characteristics. In this work, we propose a structure-aware and motion-adaptive framework to capture spatial joint topology along with diverse motion dynamics independently, named as SAMA. Specifically, SAMA consists of a Structure-aware State Integrator (SSI) and a Motion-adaptive State Modulator (MSM). The Structure-aware State Integrator is tasked with leveraging dynamic joint relationships to fuse information at both the joint feature and state levels in the state space, based on pose topology rather than sequential state transitions. The Motion-adaptive State Modulator is responsible for joint-specific motion characteristics recognition, thus applying tailored adjustments to diverse motion patterns across different joints. Through the above key modules, our algorithm enables structure-aware and motion-adaptive pose lifting. Extensive experiments across multiple benchmarks demonstrate that our algorithm achieves advanced results with fewer computational costs.