Scalable Spatial Memory for Scene Rendering and Navigation
Keywords:CV: Scene Analysis & Understanding, CV: 3D Computer Vision, CV: Vision for Robotics & Autonomous Driving
AbstractNeural scene representation and rendering methods have shown promise in learning the implicit form of scene structure without supervision. However, the implicit representation learned in most existing methods is non-expandable and cannot be inferred online for novel scenes, which makes the learned representation difficult to be applied across different reinforcement learning (RL) tasks. In this work, we introduce Scene Memory Network (SMN) to achieve online spatial memory construction and expansion for view rendering in novel scenes. SMN models the camera projection and back-projection as spatially aware memory control processes, where the memory values store the information of the partial 3D area, and the memory keys indicate the position of that area. The memory controller can learn the geometry property from observations without the camera's intrinsic parameters and depth supervision. We further apply the memory constructed by SMN to exploration and navigation tasks. The experimental results reveal the generalization ability of our proposed SMN in large-scale scene synthesis and its potential to improve the performance of spatial RL tasks.
How to Cite
Chen, W.-C., Chen, C.-S., Chiu, W.-C., & Hu, M.-C. (2023). Scalable Spatial Memory for Scene Rendering and Navigation. Proceedings of the AAAI Conference on Artificial Intelligence, 37(1), 369-377. https://doi.org/10.1609/aaai.v37i1.25110
AAAI Technical Track on Computer Vision I