MISC210K: A Large-Scale Dataset for Multi-Instance Semantic Correspondence
Abstract
Semantic correspondence have built up a new way for object recognition. However current single-object matching schema can be hard for discovering commonalities for a category and far from the real-world recognition tasks. To fill this gap, we design the multi-instance semantic correspondence task which aims at constructing the correspondence between multiple objects in an image pair. To support this task, we build a multi-instance semantic correspondence (MISC) dataset from COCO Detection 2017 task called MISC210K. We construct our dataset as three steps: (1) category selection and data cleaning; (2) keypoint design based on 3D models and object description rules; (3) human-machine collaborative annotation. Following these steps, we select 34 classes of objects with 4,812 challenging images annotated via a well designed semi-automatic workflow, and finally acquire 218,179 image pairs with instance masks and instance-level keypoint pairs annotated. We design a dual-path collaborative learning pipeline to train instance-level co-segmentation task and fine-grained level correspondence task together. Benchmark evaluation and further ablation results with detailed analysis are provided with three future directions proposed. Our project is available on https://github.com/YXSUNMADMAX/MISC210K.
Cite
Text
Sun et al. "MISC210K: A Large-Scale Dataset for Multi-Instance Semantic Correspondence." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.00688Markdown
[Sun et al. "MISC210K: A Large-Scale Dataset for Multi-Instance Semantic Correspondence." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/sun2023cvpr-misc210k/) doi:10.1109/CVPR52729.2023.00688BibTeX
@inproceedings{sun2023cvpr-misc210k,
title = {{MISC210K: A Large-Scale Dataset for Multi-Instance Semantic Correspondence}},
author = {Sun, Yixuan and Huang, Yiwen and Guo, Haijing and Zhao, Yuzhou and Wu, Runmin and Yu, Yizhou and Ge, Weifeng and Zhang, Wenqiang},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2023},
pages = {7121-7130},
doi = {10.1109/CVPR52729.2023.00688},
url = {https://mlanthology.org/cvpr/2023/sun2023cvpr-misc210k/}
}