SLAMuZero: Plan and Learn to Map for Joint SLAM and Navigation

Authors

  • Bowen Fang Columbia University
  • Xu Chen Columbia University
  • Zhengkun Pan Columbia University
  • Xuan Di Columbia University

DOI:

https://doi.org/10.1609/icaps.v34i1.31476

Abstract

MuZero has demonstrated remarkable performance in board and video games where Monte Carlo tree search (MCTS) method is utilized to learn and adapt to different game environments. This paper leverages the strength of MuZero to enhance agents’ planning capability for joint active simultaneous localization and mapping (SLAM) and navigation tasks, which require an agent to navigate an unknown environment while simultaneously constructing a map and localizing itself. We propose SLAMuZero, a novel approach for joint SLAM and navigation, which employs a search process that uses an explicit encoder-decoder architecture for mapping, followed by a prediction function to evaluate policy and value based on the generated map. SLAMuZero outperforms the state-of-the-art baseline and significantly reduces training time, underscoring the efficiency of our approach. Additionally, we develop a new open source library for implementing SLAMuZero, which is a flexible and modular toolkit for researchers and practitioners (https://github.com/bwfbowen/SLAMuZero).

Downloads

Published

2024-05-30

How to Cite

Fang, B., Chen, X., Pan, Z., & Di, X. (2024). SLAMuZero: Plan and Learn to Map for Joint SLAM and Navigation. Proceedings of the International Conference on Automated Planning and Scheduling, 34(1), 196-200. https://doi.org/10.1609/icaps.v34i1.31476