🤖 AI Summary
This work addresses the limited sample efficiency and success rates of low-level motion planning in complex robotic manipulation tasks by proposing the iCEM+TL framework, which integrates zero-shot transfer learning with an improved Cross-Entropy Method (iCEM) and introduces task-decomposition-driven reward redesign (RR). By transferring key parameters from simpler upstream tasks to more complex downstream ones without additional training, the approach substantially enhances planning performance. In simulated stacking, sliding, and shelf-placing tasks, the method achieves up to a 23% improvement in success rate and demonstrates effective real-world transferability, as validated on a Franka Emika robot for the stacking task.
📝 Abstract
As robotic systems become more sophisticated, the growing complexity of their motion planning models and the longer training times pose substantial challenges. Evolutionary algorithms such as the Sample-efficient Cross-Entropy Method (iCEM) have recently demonstrated promising potential for low-level real-time planning by leveraging efficient knowledge reuse strategies to improve performance. Although effective in many control tasks, iCEM's performance can be constrained in more complex scenarios, particularly those requiring stacking, sliding, and shelf placement. In this work, we propose a novel iCEM+TL framework that explicitly leverages Transfer Learning (TL), where key iCEM parameters are transferred from simpler upstream tasks to guide more complex downstream tasks. Additionally, we applied Reward Redesign (RR) through task decomposition for stacking objects and shelf placement to optimize task-specific performance. Results from the simulation show that our framework achieves success rate improvements of up to 23%. The framework is further validated on a real Franka Emika robot in a stacking task, demonstrating its practical feasibility for real-world deployment.