Mixed-Reality Digital Twins: Leveraging the Physical and Virtual Worlds for Hybrid Sim2Real Transition of Multi-Agent Reinforcement Learning Policies

📅 2024-03-16

📈 Citations: 0

✨ Influential: 0

career value

255K/year

🤖 AI Summary

To address the challenges of prohibitively long training times, high real-world deployment costs, and poor sim-to-real transfer performance in multi-agent reinforcement learning (MARL) for cyber-physical vehicular systems, this paper proposes a hybrid-reality digital twin framework. The framework introduces an on-demand dynamic parallelization-based load scheduling mechanism to enable elastic scaling of simulation resources, coupled with a systematic domain randomization strategy to enhance policy generalization. Our approach enables efficient collaborative MARL training and zero-shot sim-to-real transfer. Extensive evaluation across cooperative and adversarial traffic scenarios demonstrates its effectiveness: training time is reduced by up to 76.3%, while the sim-to-real performance gap is narrowed to just 2.9%. The method thus achieves a compelling balance between computational scalability and physical deployability.

Technology Category

Application Category

📝 Abstract

Multi-agent reinforcement learning (MARL) for cyber-physical vehicle systems usually requires a significantly long training time due to their inherent complexity. Furthermore, deploying the trained policies in the real world demands a feature-rich environment along with multiple physical embodied agents, which may not be feasible due to monetary, physical, energy, or safety constraints. This work seeks to address these pain points by presenting a mixed-reality digital twin framework capable of: (i) selectively scaling parallelized workloads on-demand, and (ii) evaluating the trained policies across simulation-to-reality (sim2real) experiments. The viability and performance of the proposed framework are highlighted through two representative use cases, which cover cooperative as well as competitive classes of MARL problems. We study the effect of: (i) agent and environment parallelization on training time, and (ii) systematic domain randomization on zero-shot sim2real transfer across both case studies. Results indicate up to 76.3% reduction in training time with the proposed parallelization scheme and sim2real gap as low as 2.9% using the proposed deployment method.

Problem

Research questions and friction points this paper is trying to address.

Reduces training time for multi-agent reinforcement learning in cyber-physical systems.

Enables scalable and cost-effective deployment of trained policies in real-world environments.

Improves simulation-to-reality transfer accuracy using domain randomization techniques.

Innovation

Methods, ideas, or system contributions that make the work stand out.

Mixed-reality digital twin framework for MARL

Selective scaling of parallelized workloads

Systematic domain randomization for sim2real transfer

🔎 Similar Papers

No similar papers found.