Challenges of Multi-Modal Coreset Selection for Depth Prediction

Abstract

Coreset selection methods are effective in accelerating training and reducing memory requirements but remain largely unexplored in applied multimodal settings. We adapt a state-of-the-art (SoTA) coreset selection technique for multimodal data, focusing on the depth prediction task. Our experiments with embedding aggregation and dimensionality reduction approaches reveal the challenges of extending unimodal algorithms to multimodal scenarios, highlighting the need for specialized methods to better capture inter-modal relationships.

Cite

Text

Moskvoretskii and Alvandian. "Challenges of Multi-Modal Coreset Selection for Depth Prediction." ICLR 2025 Workshops: ICBINB, 2025.

Markdown

[Moskvoretskii and Alvandian. "Challenges of Multi-Modal Coreset Selection for Depth Prediction." ICLR 2025 Workshops: ICBINB, 2025.](https://mlanthology.org/iclrw/2025/moskvoretskii2025iclrw-challenges/)

BibTeX

@inproceedings{moskvoretskii2025iclrw-challenges,
  title     = {{Challenges of Multi-Modal Coreset Selection for Depth Prediction}},
  author    = {Moskvoretskii, Viktor and Alvandian, Narek},
  booktitle = {ICLR 2025 Workshops: ICBINB},
  year      = {2025},
  url       = {https://mlanthology.org/iclrw/2025/moskvoretskii2025iclrw-challenges/}
}