Fine-Grained Fashion Representation Learning by Online Deep Clustering
Abstract
Fashion designs are rich in visual details associated with various visual attributes at both global and local levels. As a result, effective modeling and analyzing fashion requires fine-grained representations for individual attributes. In this work, we present a deep learning based online clustering method to jointly learn fine-grained fashion representations for all attributes at both instance and cluster level, where the attribute-specific cluster centers are online estimated. Based on the similarity between fine-grained representations and cluster centers, attribute-specific embedding spaces are further segmented into class-specific embedding spaces for fine-grained fashion retrieval. To better regulate the learning process, we design a three-stage learning scheme, to progressively incorporate different supervisions at both instance and cluster level, from both original and augmented data, and with ground-truth and pseudo labels. Experiments on FashionAI and DARN datasets in retrieval task demonstrated the efficacy of our method compared with competing baselines.
Cite
Text
Jiao et al. "Fine-Grained Fashion Representation Learning by Online Deep Clustering." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19812-0_2Markdown
[Jiao et al. "Fine-Grained Fashion Representation Learning by Online Deep Clustering." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/jiao2022eccv-finegrained/) doi:10.1007/978-3-031-19812-0_2BibTeX
@inproceedings{jiao2022eccv-finegrained,
title = {{Fine-Grained Fashion Representation Learning by Online Deep Clustering}},
author = {Jiao, Yang and Xie, Ning and Gao, Yan and Wang, Chien-chih and Sun, Yi},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2022},
doi = {10.1007/978-3-031-19812-0_2},
url = {https://mlanthology.org/eccv/2022/jiao2022eccv-finegrained/}
}