🤖 AI Summary
Existing evaluation metrics—primarily mean squared error (MSE)—for marker reconstruction in optical motion capture exhibit severe misalignment with human perceptual quality. Method: This paper first systematically identifies the fundamental perceptual misalignment of conventional metrics; then, grounded in perceptual psychology principles, it designs and validates a novel set of perception-aware evaluation metrics; finally, it establishes a rigorous comparative evaluation framework across multiple models and datasets via large-scale human subjective rating experiments, analyzed using Spearman rank correlation and Pearson linear correlation coefficient (PLCC). Contribution/Results: The proposed metrics achieve significantly improved consistency with human judgment, yielding an average PLCC improvement of 0.42. This work establishes the first reliable, perceptually aligned benchmark for motion capture reconstruction evaluation.
📝 Abstract
Animation data is often obtained through optical motion capture systems, which utilize a multitude of cameras to establish the position of optical markers. However, system errors or occlusions can result in missing markers, the manual cleaning of which can be time-consuming. This has sparked interest in machine learning-based solutions for missing marker reconstruction in the academic community. Most academic papers utilize a simplistic mean square error as the main metric. In this paper, we show that this metric does not correlate with subjective perception of the fill quality. Additionally, we introduce and evaluate a set of better-correlated metrics that can drive progress in the field.