Grounding Acoustic Echoes in Single View Geometry Estimation

Authors

  • Muhammad Wajahat Hussain University of Zaragoza
  • Javier Civera University of Zaragoza
  • Luis Montano Universidad de Zaragoza

DOI:

https://doi.org/10.1609/aaai.v28i1.9140

Keywords:

layout, audio-visual scene understanding

Abstract

Extracting the 3D geometry plays an important part in scene understanding. Recently, robust visual descriptors are proposed for extracting the indoor scene layout from a passive agent’s perspective, specifically from a single image. Their robustness is mainly due to modelling the physical interaction of the underlying room geometry with the objects and the humans present in the room. In this work we add the physical constraints coming from acoustic echoes, generated by an audio source, to this visual model. Our audio-visual 3D geometry descriptor improves over the state of the art in passive perception models as we show in our experiments.

Downloads

Published

2014-06-21

How to Cite

Hussain, M. W., Civera, J., & Montano, L. (2014). Grounding Acoustic Echoes in Single View Geometry Estimation. Proceedings of the AAAI Conference on Artificial Intelligence, 28(1). https://doi.org/10.1609/aaai.v28i1.9140