Photogrammetry and VR for Comparing 2D and Immersive Linguistic Data Collection (Student Abstract)

Authors

  • Jacob Rubinstein University of Maryland, Baltimore County
  • Cynthia Matuszek University of Maryland, Baltimore County
  • Don Engel University of Maryland, Baltimore County

DOI:

https://doi.org/10.1609/aaai.v37i13.27016

Keywords:

Computer Vision, Human Robot Interaction, Photogrammetry, Grounded Language Learning

Abstract

The overarching goal of this work is to enable the collection of language describing a wide variety of objects viewed in virtual reality. We aim to create full 3D models from a small number of ‘keyframe’ images of objects found in the publicly available Grounded Language Dataset (GoLD) using photogrammetry. We will then collect linguistic descriptions by placing our models in virtual reality and having volunteers describe them. To evaluate the impact of virtual reality immersion on linguistic descriptions of the objects, we intend to apply contrastive learning to perform grounded language learning, then compare the descriptions collected from images (in GoLD) versus our models.

Downloads

Published

2024-07-15

How to Cite

Rubinstein, J., Matuszek, C., & Engel, D. (2024). Photogrammetry and VR for Comparing 2D and Immersive Linguistic Data Collection (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 37(13), 16312-16313. https://doi.org/10.1609/aaai.v37i13.27016