Lee, Yong Jae

69 publications

ICLR 2025 Aligned Datasets Improve Detection of Latent Diffusion-Generated Images Anirudh Sundara Rajan, Utkarsh Ojha, Jedidiah Schloesser, Yong Jae Lee

WACV 2025 An Investigation on LLMs' Visual Understanding Ability Using SVG for Image-Text Bridging Mu Cai, Zeyi Huang, Yuheng Li, Utkarsh Ojha, Haohan Wang, Yong Jae Lee

CVPR 2025 Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs Zeyi Huang, Yuyang Ji, Xiaofang Wang, Nikhil Mehta, Tong Xiao, Donghyun Lee, Sigmund Vanvalkenburgh, Shengxin Zha, Bolin Lai, Licheng Yu, Ning Zhang, Yong Jae Lee, Miao Liu

ICCV 2025 CuRe: Cultural Gaps in the Long Tail of Text-to-Image Systems Aniket Rege, Zinnia Nie, Mahesh Ramesh, Unmesh Raskar, Zhuoran Yu, Aditya Kusupati, Yong Jae Lee, Ramya Korlakai Vinayak

ICCV 2025 Customizing Domain Adapters for Domain Generalization Yuyang Ji, Zeyi Huang, Haohan Wang, Yong Jae Lee

TMLR 2025 Diversify, Don't Fine-Tune: Scaling up Visual Recognition Training with Synthetic Images Zhuoran Yu, Chenchen Zhu, Sean Culatana, Raghuraman Krishnamoorthi, Fanyi Xiao, Yong Jae Lee

ICLR 2025 LLaRA: Supercharging Robot Learning Data for Vision-Language Policy Xiang Li, Cristina Mata, Jongwoo Park, Kumara Kahatapitiya, Yoo Sung Jang, Jinghuan Shang, Kanchana Ranasinghe, Ryan D Burgert, Mu Cai, Yong Jae Lee, Michael S Ryoo

ICCV 2025 LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models Yuzhang Shang, Mu Cai, Bingxin Xu, Yong Jae Lee, Yan Yan

ICLR 2025 Matryoshka Multimodal Models Mu Cai, Jianwei Yang, Jianfeng Gao, Yong Jae Lee

ICML 2025 Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection Anirudh Sundara Rajan, Yong Jae Lee

ICCV 2025 X-Fusion: Introducing New Modality to Frozen Large Language Models Sicheng Mo, Thao Nguyen, Xun Huang, Siddharth Srinivasan Iyer, Yijun Li, Yuchen Liu, Abhishek Tandon, Eli Shechtman, Krishna Kumar Singh, Yong Jae Lee, Bolei Zhou, Yuheng Li

CVPR 2025 Yo'Chameleon: Personalized Vision and Language Generation Thao Nguyen, Krishna Kumar Singh, Jing Shi, Trung Bui, Yong Jae Lee, Yuheng Li

WACV 2024 Computer Vision on the Edge: Individual Cattle Identification in Real-Time with ReadMyCow System Moniek Smink, Haotian Liu, Dörte Döpfer, Yong Jae Lee

CVPR 2024 Edit One for All: Interactive Batch Image Editing Thao Nguyen, Utkarsh Ojha, Yuheng Li, Haotian Liu, Yong Jae Lee

CVPR 2024 Improved Baselines with Visual Instruction Tuning Haotian Liu, Chunyuan Li, Yuheng Li, Yong Jae Lee

NeurIPS 2024 Interfacing Foundation Models' Embeddings Xueyan Zou, Linjie Li, Jianfeng Wang, Jianwei Yang, Mingyu Ding, Junyi Wei, Zhengyuan Yang, Feng Li, Hao Zhang, Shilong Liu, Arul Aravinthan, Yong Jae Lee, Lijuan Wang

CPAL 2024 Investigating the Catastrophic Forgetting in Multimodal Large Language Model Fine-Tuning Yuexiang Zhai, Shengbang Tong, Xiao Li, Mu Cai, Qing Qu, Yong Jae Lee, Yi Ma

NeurIPSW 2024 Matryoshka Multimodal Models Mu Cai, Jianwei Yang, Jianfeng Gao, Yong Jae Lee

ECCV 2024 Removing Distributional Discrepancies in Captions Improves Image-Text Alignment Mu Cai, Haotian Liu, Yuheng Li, Yijun Li, Eli Shechtman, Zhe Lin, Yong Jae Lee, Krishna Kumar Singh

NeurIPSW 2024 TemporalBench: Benchmarking Fine-Grained Temporal Understanding for Multimodal Video Models Mu Cai, Reuben Tan, Jianrui Zhang, Bocheng Zou, Kai Zhang, Yao Feng, Fangrui Zhu, Jing Gu, Yiwu Zhong, Yuzhang Shang, Yao Dou, Jaden Park, Jianfeng Gao, Yong Jae Lee, Jianwei Yang

CVPR 2024 ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts Mu Cai, Haotian Liu, Siva Karthik Mustikovela, Gregory P. Meyer, Yuning Chai, Dennis Park, Yong Jae Lee

NeurIPS 2024 Yo'LLaVA: Your Personalized Language and Vision Assistant Thao Nguyen, Haotian Liu, Yuheng Li, Mu Cai, Utkarsh Ojha, Yong Jae Lee

ICCV 2023 A Sentence Speaks a Thousand Images: Domain Generalization Through Distilling CLIP with Language Guidance Zeyi Huang, Andy Zhou, Zijian Ling, Mu Cai, Haohan Wang, Yong Jae Lee

CVPR 2023 GLIGEN: Open-Set Grounded Text-to-Image Generation Yuheng Li, Haotian Liu, Qingyang Wu, Fangzhou Mu, Jianwei Yang, Jianfeng Gao, Chunyuan Li, Yong Jae Lee

CVPR 2023 Generalized Decoding for Pixel, Image, and Language Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao

NeurIPSW 2023 Improved Baselines with Visual Instruction Tuning Haotian Liu, Chunyuan Li, Yuheng Li, Yong Jae Lee

ICLR 2023 InPL: Pseudo-Labeling the Inliers First for Imbalanced Semi-Supervised Learning Zhuoran Yu, Yin Li, Yong Jae Lee

NeurIPSW 2023 Investigating the Catastrophic Forgetting in Multimodal Large Language Models Yuexiang Zhai, Shengbang Tong, Xiao Li, Mu Cai, Qing Qu, Yong Jae Lee, Yi Ma

CVPR 2023 Learning Customized Visual Models with Retrieval-Augmented Knowledge Haotian Liu, Kilho Son, Jianwei Yang, Ce Liu, Jianfeng Gao, Yong Jae Lee, Chunyuan Li

NeurIPS 2023 Segment Everything Everywhere All at Once Xueyan Zou, Jianwei Yang, Hao Zhang, Feng Li, Linjie Li, Jianfeng Wang, Lijuan Wang, Jianfeng Gao, Yong Jae Lee

CVPR 2023 Towards Universal Fake Image Detectors That Generalize Across Generative Models Utkarsh Ojha, Yuheng Li, Yong Jae Lee

NeurIPS 2023 Visual Instruction Inversion: Image Editing via Image Prompting Thao Nguyen, Yuheng Li, Utkarsh Ojha, Yong Jae Lee

NeurIPS 2023 Visual Instruction Tuning Haotian Liu, Chunyuan Li, Qingyang Wu, Yong Jae Lee

NeurIPS 2023 What Knowledge Gets Distilled in Knowledge Distillation? Utkarsh Ojha, Yuheng Li, Anirudh Sundara Rajan, Yingyu Liang, Yong Jae Lee

ECCV 2022 Contrastive Learning for Diverse Disentangled Foreground Generation Yuheng Li, Yijun Li, Jingwan Lu, Eli Shechtman, Yong Jae Lee, Krishna Kumar Singh

NeurIPS 2022 ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models Chunyuan Li, Haotian Liu, Liunian Li, Pengchuan Zhang, Jyoti Aneja, Jianwei Yang, Ping Jin, Houdong Hu, Zicheng Liu, Yong Jae Lee, Jianfeng Gao

WACV 2022 Equine Pain Behavior Classification via Self-Supervised Disentangled Pose Representation Maheen Rashid, Sofia Broomé, Katrina Ask, Elin Hernlund, Pia Haubro Andersen, Hedvig Kjellström, Yong Jae Lee

CVPR 2022 GIRAFFE HD: A High-Resolution 3D-Aware Generative Model Yang Xue, Yuheng Li, Krishna Kumar Singh, Yong Jae Lee

ECCV 2022 Masked Discrimination for Self-Supervised Learning on Point Clouds Haotian Liu, Mu Cai, Yong Jae Lee

CVPR 2022 The Two Dimensions of Worst-Case Training and Their Integrated Effect for Out-of-Domain Generalization Zeyi Huang, Haohan Wang, Dong Huang, Yong Jae Lee, Eric P. Xing

UAI 2022 Toward Learning Human-Aligned Cross-Domain Robust Models by Countering Misaligned Features Haohan Wang, Zeyi Huang, Hanlin Zhang, Yong Jae Lee, Eric P. Xing

ICCV 2021 Collaging Class-Specific GANs for Semantic Image Synthesis Yuheng Li, Yijun Li, Jingwan Lu, Eli Shechtman, Yong Jae Lee, Krishna Kumar Singh

CVPR 2021 Few-Shot Image Generation via Cross-Domain Correspondence Utkarsh Ojha, Yijun Li, Jingwan Lu, Alexei A. Efros, Yong Jae Lee, Eli Shechtman, Richard Zhang

ICLR 2021 Generating Furry Cars: Disentangling Object Shape and Appearance Across Multiple Domains Utkarsh Ojha, Krishna Kumar Singh, Yong Jae Lee

CVPR 2021 Progressive Temporal Feature Alignment Network for Video Inpainting Xueyan Zou, Linjie Yang, Ding Liu, Yong Jae Lee

ICCVW 2021 Seeing the Unseen: Predicting the First-Person Camera Wearer's Location and Pose in Third-Person Scenes Yangming Wen, Krishna Kumar Singh, Markham Anderson, Wei-Pang Jan, Yong Jae Lee

WACV 2021 SinGAN-GIF: Learning a Generative Video Model from a Single GIF Rajat Arora, Yong Jae Lee

WACV 2020 Action Graphs: Weakly-Supervised Action Localization with Graph Convolution Networks Maheen Rashid, Hedvig Kjellstrom, Yong Jae Lee

NeurIPS 2020 Elastic-InfoGAN: Unsupervised Disentangled Representation Learning in Class-Imbalanced Data Utkarsh Ojha, Krishna Kumar Singh, Cho-Jui Hsieh, Yong Jae Lee

ECCV 2020 Password-Conditioned Anonymization and Deanonymization with Face Identity Transformers Xiuye Gu, Weixin Luo, Michael S. Ryoo, Yong Jae Lee

ICCV 2017 Hide-and-Seek: Forcing a Network to Be Meticulous for Weakly-Supervised Object and Action Localization Krishna Kumar Singh, Yong Jae Lee

CVPR 2017 Identifying First-Person Camera Wearers in Third-Person Videos Chenyou Fan, Jangwon Lee, Mingze Xu, Krishna Kumar Singh, Yong Jae Lee, David J. Crandall, Michael S. Ryoo

CVPR 2017 Interspecies Knowledge Transfer for Facial Keypoint Detection Maheen Rashid, Xiuye Gu, Yong Jae Lee

CVPR 2017 Weakly-Supervised Visual Grounding of Phrases with Linguistic Structures Fanyi Xiao, Leonid Sigal, Yong Jae Lee

WACV 2017 Who Moved My Cheese? Automatic Annotation of Rodent Behaviors with Convolutional Neural Networks Zhongzheng Ren, Adriana Noronha Annie, Vogel Ciernia, Yong Jae Lee

ECCV 2016 End-to-End Localization and Ranking for Relative Attributes Krishna Kumar Singh, Yong Jae Lee

CVPR 2016 Track and Segment: An Iterative Unsupervised Approach for Video Object Proposals Fanyi Xiao, Yong Jae Lee

CVPR 2016 Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection Krishna Kumar Singh, Fanyi Xiao, Yong Jae Lee

ICCV 2015 Discovering the Spatial Extent of Relative Attributes Fanyi Xiao, Yong Jae Lee

CVPR 2015 FlowWeb: Joint Image Set Alignment by Weaving Consistent, Pixel-Wise Correspondences Tinghui Zhou, Yong Jae Lee, Stella X. Yu, Alyosha A. Efros

CVPRW 2014 An Introduction to the 3rd Workshop on Egocentric (First-Person) Vision Steve Mann, Kris M. Kitani, Yong Jae Lee, Michael S. Ryoo, Alireza Fathi

NeurIPS 2014 Weakly-Supervised Discovery of Visual Pattern Configurations Hyun Oh Song, Yong Jae Lee, Stefanie Jegelka, Trevor Darrell

ICCV 2013 Style-Aware Mid-Level Representation for Discovering Visual Connections in Space and Time Yong Jae Lee, Alexei A. Efros, Martial Hebert

CVPR 2012 Discovering Important People and Objects for Egocentric Video Summarization Yong Jae Lee, Joydeep Ghosh, Kristen Grauman

ICCV 2011 Key-Segments for Video Object Segmentation Yong Jae Lee, Jaechul Kim, Kristen Grauman

CVPR 2011 Learning the Easy Things First: Self-Paced Visual Category Discovery Yong Jae Lee, Kristen Grauman

CVPR 2010 Collect-Cut: Segmentation with Top-Down Cues Discovered in Multi-Object Images Yong Jae Lee, Kristen Grauman

CVPR 2010 Object-Graphs for Context-Aware Category Discovery Yong Jae Lee, Kristen Grauman

CVPR 2009 Shape Discovery from Unlabeled Image Collections Yong Jae Lee, Kristen Grauman