Foundation Model-Based Data Selection for Dense Prediction Tasks

Abstract

Data selection, the problem of selecting a small dataset to be labeled from a large unlabeled pool is an important practical problem. In particular, dense prediction tasks such as object detection and segmentation require high-quality labels at pixel level, which are particularly costly to obtain. We propose object-focused data selection (OFDS) which leverages object-level representations from foundation models to ensure that the selected image subsets semantically cover the target classes, including rare ones. We show that OFDS achieves state-of-the-art performance both for object detection and image segmentation with substantial improvements over all baselines in scenarios with imbalanced class distributions. Moreover, we demonstrate that pre-training with autolabels from foundation models on the full datasets before fine-tuning on human-labeled subsets selected by OFDS further enhances the final performance. Finally, OFDS consistently improves active learning methods when replacing the random selection of the initial labeled dataset, the so-called "cold start problem'' of active learning, with OFDS.

Cite

Text

Popp et al. "Foundation Model-Based Data Selection for Dense Prediction Tasks." ICLR 2025 Workshops: FM-Wild, 2025.

Markdown

[Popp et al. "Foundation Model-Based Data Selection for Dense Prediction Tasks." ICLR 2025 Workshops: FM-Wild, 2025.](https://mlanthology.org/iclrw/2025/popp2025iclrw-foundation/)

BibTeX

@inproceedings{popp2025iclrw-foundation,
  title     = {{Foundation Model-Based Data Selection for Dense Prediction Tasks}},
  author    = {Popp, Niclas and Zhang, Dan and Metzen, Jan Hendrik and Hein, Matthias and Schott, Lukas},
  booktitle = {ICLR 2025 Workshops: FM-Wild},
  year      = {2025},
  url       = {https://mlanthology.org/iclrw/2025/popp2025iclrw-foundation/}
}