Diagram Understanding in Geometry Questions

Min Joon Seo; Hannaneh Hajishirzi; Ali Farhadi; Oren Etzioni

doi:10.1609/aaai.v28i1.9146

Authors

Min Joon Seo University of Washington
Hannaneh Hajishirzi University of Washington
Ali Farhadi University of Washington
Oren Etzioni Allen Institute for AI

DOI:

https://doi.org/10.1609/aaai.v28i1.9146

Keywords:

artificial intelligence, AI, diagram understanding, geometry problem, math problem, hough, submodular, diagram representation

Abstract

Automatically solving geometry questions is a long-standing AI problem. A geometry question typically includes a textual description accompanied by a diagram. The first step in solving geometry questions is diagram understanding, which consists of identifying visual elements in the diagram, their locations, their geometric properties, and aligning them to corresponding textual descriptions. In this paper, we present a method for diagram understanding that identifies visual elements in a diagram while maximizing agreement between textual and visual data. We show that the method's objective function is submodular; thus we are able to introduce an efficient method for diagram understanding that is close to optimal. To empirically evaluate our method, we compile a new dataset of geometry questions (textual descriptions and diagrams) and compare with baselines that utilize standard vision techniques. Our experimental evaluation shows an F1 boost of more than 17% in identifying visual elements and 25% in aligning visual elements with their textual descriptions.

Diagram Understanding in Geometry Questions

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription