Kuo, Weicheng

18 publications

WACV 2025 Learning Visual Grounding from Generative Vision and Language Model Shijie Wang, Dahun Kim, Ali Taalimi, Chen Sun, Weicheng Kuo
ECCV 2024 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation Zihao Xiao, Longlong Jing, Shangxuan Wu, Alex Zihao Zhu, Jingwei Ji, Chiyu Max Jiang, Wei-Chih Hung, Thomas Funkhouser, Weicheng Kuo, Anelia Angelova, Yin Zhou, Shiwei Sheng
ECCV 2024 Region-Centric Image-Language Pretraining for Open-Vocabulary Detection Dahun Kim, Anelia Angelova, Weicheng Kuo
ICCV 2023 Contrastive Feature Masking Open-Vocabulary Vision Transformer Dahun Kim, Anelia Angelova, Weicheng Kuo
NeurIPS 2023 DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model Xiuye Gu, Yin Cui, Jonathan Huang, Abdullah Rashwan, Xuan Yang, Xingyi Zhou, Golnaz Ghiasi, Weicheng Kuo, Huizhong Chen, Liang-Chieh Chen, David A. Ross
ICLRW 2023 Dynamic Pretraining of Vision-Language Models Aj Piergiovanni, Weicheng Kuo, Wei Li, Anelia Angelova
TMLR 2023 MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks Weicheng Kuo, Aj Piergiovanni, Dahun Kim, Xiyang Luo, Benjamin Caine, Wei Li, Abhijit Ogale, Luowei Zhou, Andrew M. Dai, Zhifeng Chen, Claire Cui, Anelia Angelova
ICLR 2023 Open-Vocabulary Object Detection upon Frozen Vision and Language Models Weicheng Kuo, Yin Cui, Xiuye Gu, Aj Piergiovanni, Anelia Angelova
ICLR 2023 PaLI: A Jointly-Scaled Multilingual Language-Image Model Xi Chen, Xiao Wang, Soravit Changpinyo, Aj Piergiovanni, Piotr Padlewski, Daniel Salz, Sebastian Goodman, Adam Grycner, Basil Mustafa, Lucas Beyer, Alexander Kolesnikov, Joan Puigcerver, Nan Ding, Keran Rong, Hassan Akbari, Gaurav Mishra, Linting Xue, Ashish V Thapliyal, James Bradbury, Weicheng Kuo, Mojtaba Seyedhosseini, Chao Jia, Burcu Karagol Ayan, Carlos Riquelme Ruiz, Andreas Peter Steiner, Anelia Angelova, Xiaohua Zhai, Neil Houlsby, Radu Soricut
TMLR 2023 RECLIP: Resource-Efficient CLIP by Training with Small Images Runze Li, Dahun Kim, Bir Bhanu, Weicheng Kuo
CVPR 2023 Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers Dahun Kim, Anelia Angelova, Weicheng Kuo
CVPR 2023 Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning Aj Piergiovanni, Weicheng Kuo, Anelia Angelova
ECCV 2022 FindIt: Generalized Localization with Natural Language Queries Weicheng Kuo, Fred Bertsch, Wei Li, Aj Piergiovanni, Mohammad Saffar, Anelia Angelova
ICLR 2022 Open-Vocabulary Object Detection via Vision and Language Knowledge Distillation Xiuye Gu, Tsung-Yi Lin, Weicheng Kuo, Yin Cui
ECCV 2022 Video Question Answering with Iterative Video-Text Co-Tokenization Aj Piergiovanni, Kairo Morton, Weicheng Kuo, Michael S. Ryoo, Anelia Angelova
ICCV 2021 Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval from a Single Image Weicheng Kuo, Anelia Angelova, Tsung-Yi Lin, Angela Dai
ECCV 2020 Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve Weicheng Kuo, Anelia Angelova, Tsung-Yi Lin, Angela Dai
ICCV 2015 DeepBox: Learning Objectness with Convolutional Networks Weicheng Kuo, Bharath Hariharan, Jitendra Malik