Gaze-Based Dual Resolution Deep Imitation Learning for High-Precision Dexterous Robot Manipulation

πŸ“… 2021-02-02
πŸ›οΈ IEEE Robotics and Automation Letters
πŸ“ˆ Citations: 24
✨ Influential: 1
πŸ“„ PDF
πŸ€– AI Summary
To address the longstanding trade-off between precision and efficiency in high-accuracy dexterous manipulation (e.g., needle threading), this work draws inspiration from human dual-resolution visuomotor control and introduces, for the first time, foveation-driven dual-resolution perception into robotic imitation learning. Methodologically, we integrate eye-tracking with region-adaptive image sampling to construct a two-branch convolutional network: a low-resolution peripheral branch for rapid coarse localization and a high-resolution foveal branch for sub-millimeter fine positioning. We employ behavior cloning–based deep imitation learning for end-to-end policy learning. Evaluated on a general-purpose robotic arm, our approach achieves 0.3 mm positioning accuracy in needle threading, improves inference speed by 42%, and reduces computational overhead by 58%. The framework effectively decouples coarse and fine control stages, thereby simultaneously ensuring real-time performance and sub-millimeter precision.
πŸ“ Abstract
A high-precision manipulation task, such as needle threading, is challenging. Physiological studies have proposed connecting low-resolution peripheral vision and fast movement to transport the hand into the vicinity of an object, and using high-resolution foveated vision to achieve the accurate homing of the hand to the object. The results of this study demonstrate that a deep imitation learning based method, inspired by the gaze-based dual resolution visuomotor control system in humans, can solve the needle threading task. First, we recorded the gaze movements of a human operator who was teleoperating a robot. Then, we used only a high-resolution image around the gaze to precisely control the thread position when it was close to the target. We used a low-resolution peripheral image to reach the vicinity of the target. The experimental results obtained in this study demonstrate that the proposed method enables precise manipulation tasks using a general-purpose robot manipulator and improves computational efficiency.
Problem

Research questions and friction points this paper is trying to address.

Develop gaze-based dual resolution deep imitation learning
Achieve high-precision robot manipulation like needle threading
Combine low-resolution peripheral and high-resolution foveated vision
Innovation

Methods, ideas, or system contributions that make the work stand out.

Gaze-based dual resolution deep imitation learning
High-resolution foveated vision for precise control
Low-resolution peripheral vision for fast movement
πŸ”Ž Similar Papers
No similar papers found.
Heecheol Kim
Heecheol Kim
Laboratory for Intelligent Systems and Informatics, Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, Japan
Y
Y. Ohmura
Laboratory for Intelligent Systems and Informatics, Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, Japan
Y
Y. Kuniyoshi
Laboratory for Intelligent Systems and Informatics, Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, Japan