Photogrammetry and VR for Comparing 2D and Immersive Linguistic Data Collection (Student Abstract)
DOI:
https://doi.org/10.1609/aaai.v37i13.27016Keywords:
Computer Vision, Human Robot Interaction, Photogrammetry, Grounded Language LearningAbstract
The overarching goal of this work is to enable the collection of language describing a wide variety of objects viewed in virtual reality. We aim to create full 3D models from a small number of ‘keyframe’ images of objects found in the publicly available Grounded Language Dataset (GoLD) using photogrammetry. We will then collect linguistic descriptions by placing our models in virtual reality and having volunteers describe them. To evaluate the impact of virtual reality immersion on linguistic descriptions of the objects, we intend to apply contrastive learning to perform grounded language learning, then compare the descriptions collected from images (in GoLD) versus our models.Downloads
Published
2024-07-15
How to Cite
Rubinstein, J., Matuszek, C., & Engel, D. (2024). Photogrammetry and VR for Comparing 2D and Immersive Linguistic Data Collection (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 37(13), 16312-16313. https://doi.org/10.1609/aaai.v37i13.27016
Issue
Section
AAAI Student Abstract and Poster Program