Interleaved LLM and Motion Planning for Generalized Multi-Object Collection in Large Scene Graphs

πŸ“… 2025-07-21
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address the challenges of long-horizon planning, high uncertainty, and weak cross-location coordination in generalized multi-object collection tasks for domestic robots operating over large-scale scene graphs, this paper proposes Inter-LLM: an interleaved LLM–motion-planning framework. It integrates multimodal action-cost similarity functions and explicitly models historical states alongside future predictions, enabling joint task-level semantic reasoning and motion-level path optimization. The method achieves a 30% improvement in task completion rate in simulation, significantly enhances multi-turn human-robot instruction execution success, and reduces overall task cost. By unifying high-level symbolic reasoning with low-level geometric control, Inter-LLM provides a scalable, human-like intelligence framework for open-set object manipulation and large-scale environment navigation.

Technology Category

Application Category

πŸ“ Abstract
Household robots have been a longstanding research topic, but they still lack human-like intelligence, particularly in manipulating open-set objects and navigating large environments efficiently and accurately. To push this boundary, we consider a generalized multi-object collection problem in large scene graphs, where the robot needs to pick up and place multiple objects across multiple locations in a long mission of multiple human commands. This problem is extremely challenging since it requires long-horizon planning in a vast action-state space under high uncertainties. To this end, we propose a novel interleaved LLM and motion planning algorithm Inter-LLM. By designing a multimodal action cost similarity function, our algorithm can both reflect the history and look into the future to optimize plans, striking a good balance of quality and efficiency. Simulation experiments demonstrate that compared with latest works, our algorithm improves the overall mission performance by 30% in terms of fulfilling human commands, maximizing mission success rates, and minimizing mission costs.
Problem

Research questions and friction points this paper is trying to address.

Generalized multi-object collection in large scene graphs
Long-horizon planning under high uncertainties
Balancing quality and efficiency in robot missions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Interleaved LLM and motion planning algorithm
Multimodal action cost similarity function
Balances plan quality and efficiency
πŸ”Ž Similar Papers
No similar papers found.