Large Language Models as Planning Domain Generators

James Oswald; Kavitha Srinivas; Harsha Kokel; Junkyu Lee; Michael Katz; Shirin Sohrabi

doi:10.1609/icaps.v34i1.31502

Authors

James Oswald Rensselaer Polytechnic Institute
Kavitha Srinivas IBM Research
Harsha Kokel IBM Research
Junkyu Lee IBM Research
Michael Katz IBM Research
Shirin Sohrabi IBM Research

DOI:

https://doi.org/10.1609/icaps.v34i1.31502

Abstract

Developing domain models is one of the few remaining places that require manual human labor in AI planning. Thus, in order to make planning more accessible, it is desirable to automate the process of domain model generation. To this end, we investigate if large language models (LLMs) can be used to generate planning domain models from simple textual descriptions. Specifically, we introduce a framework for automated evaluation of LLM-generated domains by comparing the sets of plans for domain instances. Finally, we perform an empirical analysis of 7 large language models, including coding and chat models across 9 different planning domains, and under three classes of natural language domain descriptions. Our results indicate that LLMs, particularly those with high parameter counts, exhibit a moderate level of proficiency in generating correct planning domains from natural language descriptions. Our code is available at https://github.com/IBM/NL2PDDL.

Large Language Models as Planning Domain Generators

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information