🤖 AI Summary
This study addresses the need for natural human–handover interactions with hip-mounted supernumerary robotic limbs (SRLs) in intimate personal spaces. To this end, we introduce 3HANDS—the first dedicated dataset for SRL–human handover interactions. Motivated by three key human behavioral characteristics—primary task execution, implicit coordination, and personal space constraints—we propose the first unified generative framework integrating trajectory generation, endpoint selection, and timing prediction. The framework leverages multimodal motion-capture data from real human collaborative handovers and combines conditional generative models (e.g., conditional VAEs and diffusion models) with temporal behavior prediction techniques. A user study (N=10) demonstrates that our generated interactions significantly improve perceived naturalness and comfort while reducing physical workload, outperforming heuristic baselines across all metrics.
📝 Abstract
Supernumerary robotic limbs (SRLs) are robotic structures integrated closely with the user's body, which augment human physical capabilities and necessitate seamless, naturalistic human-machine interaction. For effective assistance in physical tasks, enabling SRLs to hand over objects to humans is crucial. Yet, designing heuristic-based policies for robots is time-consuming, difficult to generalize across tasks, and results in less human-like motion. When trained with proper datasets, generative models are powerful alternatives for creating naturalistic handover motions. We introduce 3HANDS, a novel dataset of object handover interactions between a participant performing a daily activity and another participant enacting a hip-mounted SRL in a naturalistic manner. 3HANDS captures the unique characteristics of SRL interactions: operating in intimate personal space with asymmetric object origins, implicit motion synchronization, and the user's engagement in a primary task during the handover. To demonstrate the effectiveness of our dataset, we present three models: one that generates naturalistic handover trajectories, another that determines the appropriate handover endpoints, and a third that predicts the moment to initiate a handover. In a user study (N=10), we compare the handover interaction performed with our method compared to a baseline. The findings show that our method was perceived as significantly more natural, less physically demanding, and more comfortable.