Hierarchical Planning and Learning for Robots in Stochastic Settings Using Zero-Shot Option Invention
DOI:
https://doi.org/10.1609/aaai.v38i9.28903Keywords:
ROB: Motion and Path Planning, ROB: Learning & Optimization for ROB, PRS: Learning for Planning and Scheduling, PRS: Planning/Scheduling and LearningAbstract
This paper addresses the problem of inventing and using hierarchical representations for stochastic robot-planning problems. Rather than using hand-coded state or action representations as input, it presents new methods for learning how to create a high-level action representation for long-horizon, sparse reward robot planning problems in stochastic settings with unknown dynamics. After training, this system yields a robot-specific but environment independent planning system. Given new problem instances in unseen stochastic environments, it first creates zero-shot options (without any experience on the new environment) with dense pseudo-rewards and then uses them to solve the input problem in a hierarchical planning and refinement process. Theoretical results identify sufficient conditions for completeness of the presented approach. Extensive empirical analysis shows that even in settings that go beyond these sufficient conditions, this approach convincingly outperforms baselines by 2x in terms of solution time with orders of magnitude improvement in solution quality.Downloads
Published
2024-03-24
How to Cite
Shah, N., & Srivastava, S. (2024). Hierarchical Planning and Learning for Robots in Stochastic Settings Using Zero-Shot Option Invention. Proceedings of the AAAI Conference on Artificial Intelligence, 38(9), 10358-10367. https://doi.org/10.1609/aaai.v38i9.28903
Issue
Section
Intelligent Robots (ROB)