Learning from the Tangram to Solve Mini Visual Tasks

Yizhou Zhao; Liang Qiu; Pan Lu; Feng Shi; Tian Han; Song-Chun Zhu

doi:10.1609/aaai.v36i3.20260

Authors

Yizhou Zhao UCLA Center for Vision, Cognition, Learning, and Autonomy
Liang Qiu UCLA Center for Vision, Cognition, Learning, and Autonomy
Pan Lu UCLA Center for Vision, Cognition, Learning, and Autonomy
Feng Shi UCLA Center for Vision, Cognition, Learning, and Autonomy
Tian Han Stevens Institute of Technology
Song-Chun Zhu UCLA Center for Vision, Cognition, Learning, and Autonomy

DOI:

https://doi.org/10.1609/aaai.v36i3.20260

Keywords:

Computer Vision (CV), Machine Learning (ML)

Abstract

Current pre-training methods in computer vision focus on natural images in the daily-life context. However, abstract diagrams such as icons and symbols are common and important in the real world. We are inspired by Tangram, a game that requires replicating an abstract pattern from seven dissected shapes. By recording human experience in solving tangram puzzles, we present the Tangram dataset and show that a pre-trained neural model on the Tangram helps solve some mini visual tasks based on low-resolution vision. Extensive experiments demonstrate that our proposed method generates intelligent solutions for aesthetic tasks such as folding clothes and evaluating room layouts. The pre-trained feature extractor can facilitate the convergence of few-shot learning tasks on human handwriting and improve the accuracy in identifying icons by their contours. The Tangram dataset is available at https://github.com/yizhouzhao/Tangram.

Learning from the Tangram to Solve Mini Visual Tasks

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription