PhyPlan: Learning to Plan Tasks with Generalizable and Rapid Physical Reasoning for Embodied Manipulation

Authors

  • Ankit Kanwar Indian Institute of Technology, Delhi
  • Hartej Soin Indian Institute of Technology, Delhi
  • Abhinav Barnawal Indian Institute of Technology, Delhi
  • Mudit Chopra Indian Institute of Technology, Delhi
  • Harshil Vagadia Indian Institute of Technology, Delhi
  • Tamajit Banerjee Indian Institute of Technology, Delhi
  • Shreshth Tuli Indian Institute of Technology, Delhi
  • Rohan Paul Indian Institute of Technology, Delhi
  • Souvik Chakraborty Indian Institute of Technology, Delhi

DOI:

https://doi.org/10.1609/aaai.v40i22.38900

Abstract

Given the task of landing a ball in a goal region beyond direct reach, humans can often throw, slide, or rebound objects against the wall to attain the goal. Enabling robots to replicate such reasoning is non-trivial as it requires multi-step planning and involves a mixture of discrete and continuous action spaces, a sparse and sensitive reward structure, computationally expensive simulations, and an incomplete understanding of the environment's physics. We present PhyPlan, a physics-informed and adaptable planning framework for efficient multi-step physical reasoning. At its core, PhyPlan comprises of Generative Flow Networks (GFlowNets) and Monte Carlo Tree Search (MCTS) to explore and evaluate sequences of object interactions. GFlowNets sample discrete action sequences in proportion to their associated reward, enabling broad and reward-driven exploration of the discrete planning space. MCTS complements this by adaptively balancing the use of a fast but approximate pre-trained physics-informed dynamics predictor and costly but accurate environment rollouts, ensuring both speed and precision in planning. The known and actual physics discrepancy is captured using Gaussian Process Regression. Experiments on benchmark simulated tasks requiring composition of collisions, slides, and rebounds demonstrate that PhyPlan achieves a 45\% higher success rate and up to 3× efficiency gains over state-of-the-art model-based reinforcement learning approaches.

Published

2026-03-14

How to Cite

Kanwar, A., Soin, H., Barnawal, A., Chopra, M., Vagadia, H., Banerjee, T., … Chakraborty, S. (2026). PhyPlan: Learning to Plan Tasks with Generalizable and Rapid Physical Reasoning for Embodied Manipulation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(22), 18360–18369. https://doi.org/10.1609/aaai.v40i22.38900

Issue

Section

AAAI Technical Track on Intelligent Robotics