🤖 AI Summary
This work addresses the pressing need for low-cost, reproducible hardware-software co-design solutions in real-world embodied AI, where existing systems are often proprietary and expensive. We present OpenEAI, a fully open-source full-stack platform comprising the 6+1 degree-of-freedom robotic arm OpenEAI-Arm and the vision-language-action (VLA) model OpenEAI-VLA based on Qwen3-VL-4B. By integrating compliance-aware control to enhance hardware precision and employing a two-stage training strategy using only open-source data, our system enables efficient policy learning. Experiments demonstrate that OpenEAI-Arm outperforms two commercial robotic arms across four real-world tasks, while OpenEAI-VLA achieves success rates comparable to pi0 with significantly less training data. To our knowledge, this is the first end-to-end open-source framework spanning robotic hardware design to VLA training.
📝 Abstract
Embodied AI in the real world requires both accurate hardware and robust vision-language-action (VLA) policies. We present OpenEAI-Platform, a fully open-source platform that integrates a low-cost 6+1 degree-of-freedom (dof) robotic arm (OpenEAI-Arm) and a reproducible VLA model (OpenEAI-VLA). OpenEAI-Arm provides open-source mechanical designs for low manufacturing cost and compliant control methods for higher accuracy. OpenEAI-VLA builds on Qwen3-VL-4B and uses a Diffusion Transformer action head, and is trained in two stages with only open-source robot and multimodal datasets. Across four real-world manipulation tasks, OpenEAI-Arm outperforms two commercial 6+1-dof arms under the same policy, and OpenEAI-VLA achieves success rates comparable to the large-scale pretrained pi0 baseline with only limited pretraining data. We will release the full hardware designs, drivers, models, and training/data pipelines to support reproducible research and scalable data collection. Our codes, layouts, and models will be released after the paper is accepted.