Learning from the Tangram to Solve Mini Visual Tasks
Abstract
Current pre-training methods in computer vision focus on natural images in the daily-life context. However, abstract diagrams such as icons and symbols are common and important in the real world. We are inspired by Tangram, a game that requires replicating an abstract pattern from seven dissected shapes. By recording human experience in solving tangram puzzles, we present the Tangram dataset and show that a pre-trained neural model on the Tangram helps solve some mini visual tasks based on low-resolution vision. Extensive experiments demonstrate that our proposed method generates intelligent solutions for aesthetic tasks such as folding clothes and evaluating room layouts. The pre-trained feature extractor can facilitate the convergence of few-shot learning tasks on human handwriting and improve the accuracy in identifying icons by their contours. The Tangram dataset is available at https://github.com/yizhouzhao/Tangram.
Cite
Text
Zhao et al. "Learning from the Tangram to Solve Mini Visual Tasks." AAAI Conference on Artificial Intelligence, 2022. doi:10.1609/AAAI.V36I3.20260Markdown
[Zhao et al. "Learning from the Tangram to Solve Mini Visual Tasks." AAAI Conference on Artificial Intelligence, 2022.](https://mlanthology.org/aaai/2022/zhao2022aaai-learning/) doi:10.1609/AAAI.V36I3.20260BibTeX
@inproceedings{zhao2022aaai-learning,
title = {{Learning from the Tangram to Solve Mini Visual Tasks}},
author = {Zhao, Yizhou and Qiu, Liang and Lu, Pan and Shi, Feng and Han, Tian and Zhu, Song-Chun},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2022},
pages = {3490-3498},
doi = {10.1609/AAAI.V36I3.20260},
url = {https://mlanthology.org/aaai/2022/zhao2022aaai-learning/}
}