Learning from the Tangram to Solve Mini Visual Tasks

Abstract

Current pre-training methods in computer vision focus on natural images in the daily-life context. However, abstract diagrams such as icons and symbols are common and important in the real world. We are inspired by Tangram, a game that requires replicating an abstract pattern from seven dissected shapes. By recording human experience in solving tangram puzzles, we present the Tangram dataset and show that a pre-trained neural model on the Tangram helps solve some mini visual tasks based on low-resolution vision. Extensive experiments demonstrate that our proposed method generates intelligent solutions for aesthetic tasks such as folding clothes and evaluating room layouts. The pre-trained feature extractor can facilitate the convergence of few-shot learning tasks on human handwriting and improve the accuracy in identifying icons by their contours. The Tangram dataset is available at https://github.com/yizhouzhao/Tangram.

Cite

Text

Zhao et al. "Learning from the Tangram to Solve Mini Visual Tasks." AAAI Conference on Artificial Intelligence, 2022. doi:10.1609/AAAI.V36I3.20260

Markdown

[Zhao et al. "Learning from the Tangram to Solve Mini Visual Tasks." AAAI Conference on Artificial Intelligence, 2022.](https://mlanthology.org/aaai/2022/zhao2022aaai-learning/) doi:10.1609/AAAI.V36I3.20260

BibTeX

@inproceedings{zhao2022aaai-learning,
  title     = {{Learning from the Tangram to Solve Mini Visual Tasks}},
  author    = {Zhao, Yizhou and Qiu, Liang and Lu, Pan and Shi, Feng and Han, Tian and Zhu, Song-Chun},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2022},
  pages     = {3490-3498},
  doi       = {10.1609/AAAI.V36I3.20260},
  url       = {https://mlanthology.org/aaai/2022/zhao2022aaai-learning/}
}