LLM3: Bridging Symbolic Task Planning and Continuous Motion Generation with LLMs

Download PDF Copy

Add AZoAi on Googleas a preferred source

By Soham NandiReviewed by Susha Cheriyedath, M.Sc.Mar 21 2024

In an article recently submitted to the arxiv* server, researchers introduced large language model (LLM)³, a novel task and motion planning (TAMP) framework that leveraged LLMs to bridge symbolic task planning and continuous motion generation.

*Study: LLM3: Bridging Symbolic Task Planning and Continuous Motion Generation with LLMs. Image credit: sdecoret/Shutterstock*

*Important notice: arXiv publishes preliminary scientific reports that are not peer-reviewed and, therefore, should not be regarded as definitive, used to guide development decisions, or treated as established information in the field of artificial intelligence research.

LLM³ incorporated motion planning feedback iteratively to refine action sequences, alleviating the need for domain-specific interfaces. Through simulations and physical experiments, LLM³ demonstrated effectiveness in solving TAMP problems and efficiently selecting action parameters, with motion failure reasoning contributing significantly to its success.

Background

TAMP is critical for autonomous robots to effectively navigate complex environments and accomplish diverse tasks. TAMP divides planning into symbolic task planning and low-level motion planning stages. Traditional TAMP methods often rely on manually designed interfaces between symbolic and continuous domains, leading to domain-specific solutions and limited generalizability.

Previous approaches have attempted to address this challenge by incorporating data-driven heuristics or designing specialized communication modules. However, these methods lack generalizability across domains and require substantial manual effort.

This paper introduces LLM³, a novel TAMP framework that leverages pre-trained LLMs to bridge the gap between symbolic and continuous planning domains. LLM³ utilized LLMs to propose symbolic action sequences and generate continuous action parameters, benefiting from the implicit heuristics encoded in the LLM. Additionally, LLM³ reasoned over motion planning feedback to refine action sequences and parameters iteratively. By employing LLMs as both task planners and informed parameter samplers, LLM³ offered a domain-independent approach to TAMP, eliminating the need for manually designed symbolic domain files.

Methods

The researchers introduced LLM³, a TAMP framework leveraging pre-trained LLMs to reason on motion failure and generate refined action sequences. LLM³ iteratively refined symbolic actions and continuous parameters by integrating motion planning feedback. The framework alternated between reasoning with the LLM and verifying action feasibility with a motion planner, aiming to solve TAMP problems efficiently.

Each planning iteration involved generating action sequences guided by previous motion failure reasoning and updating the motion planning feedback trace. LLM³ aimed to improve action sequence quality incrementally, benefiting from the LLM's intrinsic heuristics and previous failure insights. Using a system message and task description as prompts, LLM³ prompted the LLM to generate reasoning and action sequences autonomously, facilitating domain-independent planning.

Two strategies, backtrack and from scratch, were employed to generate new action sequences, enabling the LLM to refine its outputs based on previous failures. Motion planning feedback was synthesized to provide meaningful insights into motion failures, aiding LLM³ in improving high-level planning effectively. Feedback included collision and unreachability categorizations, enhancing the LLM's understanding of failure causes. The paper conducted simulations in a box-packing domain to quantify LLM³'s effectiveness and efficiency, demonstrating its superiority over unguided planners.

Ablation studies underscored the importance of motion failure reasoning in LLM³'s success. Additionally, qualitative experiments on a physical manipulator showcased the practical applicability of LLM³ in real-world settings. Overall, LLM³ represented a significant advancement in TAMP, offering a domain-independent interface and leveraging pre-trained LLMs for efficient and effective task planning with motion feedback integration.

Simulation and experiment

Through simulations and experiments, the effectiveness and efficiency of LLM³ were demonstrated, highlighting its potential for real-world applications. In simulations, LLM³ was evaluated in two settings: one with increasing object sizes and a constant basket size, and another with increasing basket sizes. LLM³'s success rate (%SR), the number of LLM calls (#LM), and the number of motion planner calls (#MP) were quantitatively assessed.

Results showed that integrating motion planning feedback led to improvements in %SR while reducing the #LM and #MP. Surprisingly, no clear advantage was observed between using backtracking and planning from scratch strategies. Additionally, an ablation study compared LLM³ with baseline methods, revealing the framework's superiority in terms of %SR and efficiency. LLM³ significantly reduced the number of iterations and #MP required to achieve feasible action sequences compared to random sampling.

Furthermore, LLM³'s ability to act as an informed action parameter sampler was investigated. Results indicated that leveraging LLMs for action parameter selection substantially reduced the sampling iterations and motion planner calls needed to generate feasible action sequences, with further improvements observed when incorporating motion planning feedback. In a real-world experiment with a physical robot manipulator, LLM³ successfully performed a box-packing task despite uncertainties in perception and execution. The robot accurately identified and manipulated objects, demonstrating the practicality and robustness of LLM³ in real-world scenarios.

Conclusion

In conclusion, LLM³ represented a significant advancement in TAMP by leveraging pre-trained LLMs to bridge symbolic task planning and continuous motion generation. Through simulations and physical experiments, LLM³ demonstrated its effectiveness in solving TAMP problems efficiently, with motion failure reasoning playing a crucial role in refining action sequences. The framework's ability to integrate motion planning feedback and its practical applicability in real-world settings showcased its potential for autonomous robotic manipulation tasks.

Journal reference:

Preliminary scientific report. Wang, S., Han, M., Jiao, Z., Zhang, Z., Wu, Y. N., Zhu, S.-C., & Liu, H. (2024, March 18). LLM^3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning. ArXiv.org. https://doi.org/10.48550/arXiv.2403.11552, https://arxiv.org/abs/2403.11552

Posted in: AI Research News

Comments (0)

Written by

Soham Nandi

Soham Nandi is a technical writer based in Memari, India. His academic background is in Computer Science Engineering, specializing in Artificial Intelligence and Machine learning. He has extensive experience in Data Analytics, Machine Learning, and Python. He has worked on group projects that required the implementation of Computer Vision, Image Classification, and App Development.

Download PDF Copy

Citations

Please use one of the following formats to cite this article in your essay, paper or report:

APA
Nandi, Soham. (2024, March 21). LLM3: Bridging Symbolic Task Planning and Continuous Motion Generation with LLMs. AZoAi. Retrieved on July 15, 2026 from https://www.azoai.com/news/20240321/LLM3-Bridging-Symbolic-Task-Planning-and-Continuous-Motion-Generation-with-LLMs.aspx.
MLA
Nandi, Soham. "LLM3: Bridging Symbolic Task Planning and Continuous Motion Generation with LLMs". AZoAi. 15 July 2026. <https://www.azoai.com/news/20240321/LLM3-Bridging-Symbolic-Task-Planning-and-Continuous-Motion-Generation-with-LLMs.aspx>.
Chicago
Nandi, Soham. "LLM3: Bridging Symbolic Task Planning and Continuous Motion Generation with LLMs". AZoAi. https://www.azoai.com/news/20240321/LLM3-Bridging-Symbolic-Task-Planning-and-Continuous-Motion-Generation-with-LLMs.aspx. (accessed July 15, 2026).
Harvard
Nandi, Soham. 2024. LLM3: Bridging Symbolic Task Planning and Continuous Motion Generation with LLMs. AZoAi, viewed 15 July 2026, https://www.azoai.com/news/20240321/LLM3-Bridging-Symbolic-Task-Planning-and-Continuous-Motion-Generation-with-LLMs.aspx.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of AZoAi.

Post a new comment

(Logout)

Post

Sign in to keep reading

We're committed to providing free access to quality science. By registering and providing insight into your preferences you're joining a community of over 1m science interested individuals and help us to provide you with insightful content whilst keeping our service free.