Fashion-Specific Ambiguous Expression Interpretation with Partial Visual-Semantic Embedding

Abstract

A novel technology named fashion intelligence system has been proposed to quantify ambiguous expressions unique to fashion, such as "casual," "adult-casual," and "office-casual," and to support users’ understanding of fashion. However, the existing visual-semantic embedding (VSE) model, which is the basis of its system, does not support situations in which images are composed of multiple parts such as hair, tops, pants, skirts, and shoes. We propose partial VSE, which enables sensitive learning for each part of the fashion outfits. This enables five types of practical functionalities, particularly image-retrieval tasks in which changes are made only to the specified parts and image-reordering tasks that focus on the specified parts by the single model. Based on both the multiple unique qualitative and quantitative evaluation experiments, we show the effectiveness of the proposed model.

Cite

Text

Shimizu et al. "Fashion-Specific Ambiguous Expression Interpretation with Partial Visual-Semantic Embedding." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023. doi:10.1109/CVPRW59228.2023.00353

Markdown

[Shimizu et al. "Fashion-Specific Ambiguous Expression Interpretation with Partial Visual-Semantic Embedding." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023.](https://mlanthology.org/cvprw/2023/shimizu2023cvprw-fashionspecific/) doi:10.1109/CVPRW59228.2023.00353

BibTeX

@inproceedings{shimizu2023cvprw-fashionspecific,
  title     = {{Fashion-Specific Ambiguous Expression Interpretation with Partial Visual-Semantic Embedding}},
  author    = {Shimizu, Ryotaro and Nakamura, Takuma and Goto, Masayuki},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2023},
  pages     = {3497-3502},
  doi       = {10.1109/CVPRW59228.2023.00353},
  url       = {https://mlanthology.org/cvprw/2023/shimizu2023cvprw-fashionspecific/}
}