LINGO-Space: Language-Conditioned Incremental Grounding for Space

Authors

  • Dohyun Kim Korea Advanced Institute of Science and Technology
  • Nayoung Oh Korea Advanced Institute of Science and Technology
  • Deokmin Hwang Korea Advanced Institute of Science and Technology
  • Daehyung Park Korea Advanced Institute of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v38i9.28898

Keywords:

ROB: Learning & Optimization for ROB, NLP: Language Grounding & Multi-modal NLP

Abstract

We aim to solve the problem of spatially localizing composite instructions referring to space: space grounding. Compared to current instance grounding, space grounding is challenging due to the ill-posedness of identifying locations referred to by discrete expressions and the compositional ambiguity of referring expressions. Therefore, we propose a novel probabilistic space-grounding methodology (LINGO-Space) that accurately identifies a probabilistic distribution of space being referred to and incrementally updates it, given subsequent referring expressions leveraging configurable polar distributions. Our evaluations show that the estimation using polar distributions enables a robot to ground locations successfully through 20 table-top manipulation benchmark tests. We also show that updating the distribution helps the grounding method accurately narrow the referring space. We finally demonstrate the robustness of the space grounding with simulated manipulation and real quadruped robot navigation tasks. Code and videos are available at https://lingo-space.github.io.

Published

2024-03-24

How to Cite

Kim, D., Oh, N., Hwang, D., & Park, D. (2024). LINGO-Space: Language-Conditioned Incremental Grounding for Space. Proceedings of the AAAI Conference on Artificial Intelligence, 38(9), 10314–10322. https://doi.org/10.1609/aaai.v38i9.28898

Issue

Section

Intelligent Robots (ROB)