RAI: Flexible Agent Framework for Embodied AI

📅 2025-05-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address challenges in embodied AI—including difficult multi-agent system development, poor cross-platform portability, and misalignment between large language models (LLMs) and physical execution—this paper proposes RAI, a flexible multi-agent framework. Methodologically, RAI introduces: (1) a unified agent encapsulation mechanism tailored for embodied intelligence, enabling seamless integration of LLMs, ROS 2 robotics middleware, and diverse simulation environments (e.g., dual-arm manipulators, agricultural machinery, and ROSBot XL digital twins); (2) an LLM interface abstraction layer, a digital twin synchronization mechanism, and a lightweight multi-agent communication protocol; and (3) the first simulation-based embodied multi-task evaluation benchmark. Experimental results demonstrate RAI’s effectiveness across real robots and two major simulation platforms, supporting high-precision motion control, real-time perception–action response, and collaborative decision-making. Crucially, RAI identifies and mitigates key LLM limitations in embodied reasoning, temporal planning, and action grounding.

Technology Category

Application Category

📝 Abstract
With an increase in the capabilities of generative language models, a growing interest in embodied AI has followed. This contribution introduces RAI - a framework for creating embodied Multi Agent Systems for robotics. The proposed framework implements tools for Agents' integration with robotic stacks, Large Language Models, and simulations. It provides out-of-the-box integration with state-of-the-art systems like ROS 2. It also comes with dedicated mechanisms for the embodiment of Agents. These mechanisms have been tested on a physical robot, Husarion ROSBot XL, which was coupled with its digital twin, for rapid prototyping. Furthermore, these mechanisms have been deployed in two simulations: (1) robot arm manipulator and (2) tractor controller. All of these deployments have been evaluated in terms of their control capabilities, effectiveness of embodiment, and perception ability. The proposed framework has been used successfully to build systems with multiple agents. It has demonstrated effectiveness in all the aforementioned tasks. It also enabled identifying and addressing the shortcomings of the generative models used for embodied AI.
Problem

Research questions and friction points this paper is trying to address.

Develops a framework for embodied Multi Agent Systems in robotics
Integrates Agents with robotic stacks, LLMs, and simulations
Tests embodiment mechanisms on physical and simulated robots
Innovation

Methods, ideas, or system contributions that make the work stand out.

Flexible framework for embodied Multi Agent Systems
Integration with ROS 2 and Large Language Models
Digital twin and simulation for rapid prototyping
🔎 Similar Papers
2024-07-09IEEE/ASME transactions on mechatronicsCitations: 94
K
Kajetan Rachwal
Robotec.AI, Warsaw, Poland
M
Maciej Majek
Robotec.AI, Warsaw, Poland
B
Bartlomiej Boczek
Robotec.AI, Warsaw, Poland
K
Kacper Dkabrowski
Robotec.AI, Warsaw, Poland
P
Pawel Liberadzki
Robotec.AI, Warsaw, Poland
A
Adam Dkabrowski
Robotec.AI, Warsaw, Poland
Maria Ganzha
Maria Ganzha
Associate Professor Warsaw University of Technology
Agent-based computingMultiagent systemdistributed systemOntologySemantic Data Processing