RoboMoRe: LLM-based Robot Co-design via Joint Optimization of Morphology and Reward

📅 2025-05-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Joint optimization of robot morphology and control policies has long been constrained by fixed reward functions, leading to suboptimal convergence and limited emergence of morphology-adapted diverse locomotion behaviors. This paper introduces the first large language model (LLM)-driven morphology–reward co-optimization framework, eliminating task-specific prompts and predefined templates. It achieves end-to-end joint optimization via a two-stage mechanism: (i) LLM-based generation of high-diversity, high-quality morphology–reward pairs; and (ii) gradient-guided alternating fine-tuning. The method integrates LLM reasoning, reward shaping, parametric morphological modeling, and differentiable optimization. Evaluated on eight canonical locomotion tasks, our framework consistently outperforms human-designed solutions and state-of-the-art methods, autonomously synthesizing high-performance morphologies alongside their dedicated control policies.

Technology Category

Application Category

📝 Abstract
Robot co-design, jointly optimizing morphology and control policy, remains a longstanding challenge in the robotics community, where many promising robots have been developed. However, a key limitation lies in its tendency to converge to sub-optimal designs due to the use of fixed reward functions, which fail to explore the diverse motion modes suitable for different morphologies. Here we propose RoboMoRe, a large language model (LLM)-driven framework that integrates morphology and reward shaping for co-optimization within the robot co-design loop. RoboMoRe performs a dual-stage optimization: in the coarse optimization stage, an LLM-based diversity reflection mechanism generates both diverse and high-quality morphology-reward pairs and efficiently explores their distribution. In the fine optimization stage, top candidates are iteratively refined through alternating LLM-guided reward and morphology gradient updates. RoboMoRe can optimize both efficient robot morphologies and their suited motion behaviors through reward shaping. Results demonstrate that without any task-specific prompting or predefined reward/morphology templates, RoboMoRe significantly outperforms human-engineered designs and competing methods across eight different tasks.
Problem

Research questions and friction points this paper is trying to address.

Joint optimization of robot morphology and control policy
Overcoming sub-optimal designs from fixed reward functions
Exploring diverse motion modes for different morphologies
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-driven joint optimization of morphology and reward
Dual-stage optimization with diversity reflection
Alternating LLM-guided reward and morphology updates
🔎 Similar Papers
No similar papers found.
J
Jiawei Fang
Department of Mechanical Engineering, University of California, Berkeley
Y
Yuxuan Sun
Department of Mechanical Engineering, University of California, Berkeley
C
Chengtian Ma
Department of Mechanical Engineering, University of California, Berkeley
Q
Qiuyu Lu
Department of Mechanical Engineering, University of California, Berkeley
Lining Yao
Lining Yao
UC Berkeley
Programming MaterialAdaptive and Morphing MatterDigital FabricationHuman Computer Interaction