Diagram Understanding in Geometry Questions

Authors

  • Min Joon Seo University of Washington
  • Hannaneh Hajishirzi University of Washington
  • Ali Farhadi University of Washington
  • Oren Etzioni Allen Institute for AI

DOI:

https://doi.org/10.1609/aaai.v28i1.9146

Keywords:

artificial intelligence, AI, diagram understanding, geometry problem, math problem, hough, submodular, diagram representation

Abstract

Automatically solving geometry questions is a long-standing AI problem. A geometry question typically includes a textual description accompanied by a diagram. The first step in solving geometry questions is diagram understanding, which consists of identifying visual elements in the diagram, their locations, their geometric properties, and aligning them to corresponding textual descriptions. In this paper, we present a method for diagram understanding that identifies visual elements in a diagram while maximizing agreement between textual and visual data. We show that the method's objective function is submodular; thus we are able to introduce an efficient method for diagram understanding that is close to optimal. To empirically evaluate our method, we compile a new dataset of geometry questions (textual descriptions and diagrams) and compare with baselines that utilize standard vision techniques. Our experimental evaluation shows an F1 boost of more than 17% in identifying visual elements and 25% in aligning visual elements with their textual descriptions.

Downloads

Published

2014-06-21

How to Cite

Seo, M. J., Hajishirzi, H., Farhadi, A., & Etzioni, O. (2014). Diagram Understanding in Geometry Questions. Proceedings of the AAAI Conference on Artificial Intelligence, 28(1). https://doi.org/10.1609/aaai.v28i1.9146