Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild

Abstract

Despite the success in large language models, constructing a data-efficient generalist for dense visual prediction presents a distinct challenge due to the variation in label structures across different tasks. In this study, we explore a universal model that can flexibly adapt to unseen dense label structures with a few examples, enabling it to serve as a data-efficient vision generalist in diverse real-world scenarios. To this end, we base our method on a powerful meta-learning framework and explore several axes to improve its performance and versatility for real-world problems, such as flexible adaptation mechanisms and scalability. We evaluate our model across a spectrum of unseen real-world scenarios where low-shot learning is desirable, including video, 3D, medical, biological, and user-interactive tasks. Equipped with a generic architecture and an effective adaptation mechanism, our model flexibly adapts to all of these tasks with at most 50 labeled images, showcasing a significant advancement over existing data-efficient generalist approaches. Codes are available at https: //github.com/GitGyun/chameleon.

Cite

Text

Kim et al. "Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73337-6_24

Markdown

[Kim et al. "Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/kim2024eccv-chameleon/) doi:10.1007/978-3-031-73337-6_24

BibTeX

@inproceedings{kim2024eccv-chameleon,
  title     = {{Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild}},
  author    = {Kim, Donggyun and Cho, Seongwoong and Kim, Semin and Luo, Chong and Hong, Seunghoon},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73337-6_24},
  url       = {https://mlanthology.org/eccv/2024/kim2024eccv-chameleon/}
}