Hierarchical Planning and Learning for Robots in Stochastic Settings Using Zero-Shot Option Invention

Authors

  • Naman Shah Arizona State University
  • Siddharth Srivastava Arizona State University

DOI:

https://doi.org/10.1609/aaai.v38i9.28903

Keywords:

ROB: Motion and Path Planning, ROB: Learning & Optimization for ROB, PRS: Learning for Planning and Scheduling, PRS: Planning/Scheduling and Learning

Abstract

This paper addresses the problem of inventing and using hierarchical representations for stochastic robot-planning problems. Rather than using hand-coded state or action representations as input, it presents new methods for learning how to create a high-level action representation for long-horizon, sparse reward robot planning problems in stochastic settings with unknown dynamics. After training, this system yields a robot-specific but environment independent planning system. Given new problem instances in unseen stochastic environments, it first creates zero-shot options (without any experience on the new environment) with dense pseudo-rewards and then uses them to solve the input problem in a hierarchical planning and refinement process. Theoretical results identify sufficient conditions for completeness of the presented approach. Extensive empirical analysis shows that even in settings that go beyond these sufficient conditions, this approach convincingly outperforms baselines by 2x in terms of solution time with orders of magnitude improvement in solution quality.

Published

2024-03-24

How to Cite

Shah, N., & Srivastava, S. (2024). Hierarchical Planning and Learning for Robots in Stochastic Settings Using Zero-Shot Option Invention. Proceedings of the AAAI Conference on Artificial Intelligence, 38(9), 10358-10367. https://doi.org/10.1609/aaai.v38i9.28903

Issue

Section

Intelligent Robots (ROB)