Multi-Task Curriculum Transfer Deep Learning of Clothing Attributes
Abstract
Recognising detailed clothing characteristics (finegrained attributes) in unconstrained images of people inthe-wild is a challenging task for computer vision, especially when there is only limited training data from the wild whilst most data available for model learning are captured in well-controlled environments using fashion models (well lit, no background clutter, frontal view, high-resolution). In this work, we develop a deep learning framework capable of model transfer learning from well-controlled shop clothing images collected from web retailers to in-the-wild images from the street. Specifically, we formulate a novel Multi-Task Curriculum Transfer (MTCT) deep learning method to explore multiple sources of different types of web annotations with multi-labelled fine-grained attributes. Our multi-task loss function is designed to extract more discriminative representations in training by jointly learning all attributes, and our curriculum strategy exploits the staged easy-to-hard transfer learning motivated by cognitive studies. We demonstrate the advantages of the MTCT model over the state-of-the-art methods on the X-Domain benchmark, a large scale clothing attribute dataset. Moreover, we show that the MTCT model has a notable advantage over contemporary models when the training data size is small.
Cite
Text
Dong et al. "Multi-Task Curriculum Transfer Deep Learning of Clothing Attributes." IEEE/CVF Winter Conference on Applications of Computer Vision, 2017. doi:10.1109/WACV.2017.64Markdown
[Dong et al. "Multi-Task Curriculum Transfer Deep Learning of Clothing Attributes." IEEE/CVF Winter Conference on Applications of Computer Vision, 2017.](https://mlanthology.org/wacv/2017/dong2017wacv-multi/) doi:10.1109/WACV.2017.64BibTeX
@inproceedings{dong2017wacv-multi,
title = {{Multi-Task Curriculum Transfer Deep Learning of Clothing Attributes}},
author = {Dong, Qi and Gong, Shaogang and Zhu, Xiatian},
booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
year = {2017},
pages = {520-529},
doi = {10.1109/WACV.2017.64},
url = {https://mlanthology.org/wacv/2017/dong2017wacv-multi/}
}