FloNa: Floor Plan Guided Embodied Visual Navigation

Authors

  • Jiaxin Li Beijing Institute of Technology
  • Weiqi Huang Beijing Institute of Technology
  • Zan Wang Beijing Institute of Technology
  • Wei Liang Beijing Institute of Technology Yangtze Delta Region Academy of Beijing Institute of Technology, Jiaxing, China
  • Huijun Di Beijing Institute of Technology
  • Feng Liu Beijing Racobit Electronic Information Technology Co., Ltd.

DOI:

https://doi.org/10.1609/aaai.v39i14.33601

Abstract

Humans naturally rely on floor plans to navigate in unfamiliar environments, as they are readily available, reliable, and provide rich geometrical guidance. However, existing visual navigation settings overlook this valuable prior knowledge, leading to limited efficiency and accuracy. To eliminate this gap, we introduce a novel navigation task: Floor Plan Visual Navigation (FloNa), the first attempt to incorporate floor plans into embodied visual navigation. While the floor plan offers significant advantages, two key challenges emerge: (1) handling the spatial inconsistency between the floor plan and the actual scene layout for collision-free navigation, and (2) aligning observed images with the floor plan sketch despite their distinct modalities. To address these challenges, we propose FloDiff, a novel diffusion policy framework incorporating a localization module to facilitate alignment between the current observation and the floor plan. We further collect 20k navigation episodes across 117 scenes in the iGibson simulator to support the training and evaluation. Extensive experiments demonstrate the effectiveness and efficiency of our framework in unfamiliar scenes using floor plan knowledge.

Published

2025-04-11

How to Cite

Li, J., Huang, W., Wang, Z., Liang, W., Di, H., & Liu, F. (2025). FloNa: Floor Plan Guided Embodied Visual Navigation. Proceedings of the AAAI Conference on Artificial Intelligence, 39(14), 14610–14618. https://doi.org/10.1609/aaai.v39i14.33601

Issue

Section

AAAI Technical Track on Intelligent Robots