Cho, Jaemin
22 publications
ICCV
2025
CAPTURE: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting
ECCV
2024
Contrastive Region Guidance: Improving Grounding in Vision-Language Models Without Training
CVPR
2024
Rethinking Interactive Image Segmentation with Low Latency High Quality and Diverse Prompts
NeurIPS
2024
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data