Koh, Jing Yu

19 publications

ICLR 2025 Dissecting Adversarial Robustness of Multimodal LM Agents Chen Henry Wu, Rishi Rajesh Shah, Jing Yu Koh, Russ Salakhutdinov, Daniel Fried, Aditi Raghunathan
TMLR 2025 Tree Search for Language Model Agents Jing Yu Koh, Stephen Marcus McAleer, Daniel Fried, Ruslan Salakhutdinov
NeurIPSW 2024 Dissecting Adversarial Robustness of Multimodal LM Agents Chen Henry Wu, Rishi Rajesh Shah, Jing Yu Koh, Russ Salakhutdinov, Daniel Fried, Aditi Raghunathan
ECCV 2024 OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web Raghav Kapoor, Yash Parag Butala, Melisa A Russak, Jing Yu Koh, Kiran Kamble, Waseem AlShikh, Ruslan Salakhutdinov
ICLRW 2024 VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks Jing Yu Koh, Robert Lo, Lawrence Jang, Vikram Duvvur, Ming Chong Lim, Po-Yu Huang, Graham Neubig, Shuyan Zhou, Ruslan Salakhutdinov, Daniel Fried
CVPR 2023 A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning Aishwarya Kamath, Peter Anderson, Su Wang, Jing Yu Koh, Alexander Ku, Austin Waters, Yinfei Yang, Jason Baldridge, Zarana Parekh
NeurIPS 2023 Generating Images with Multimodal Language Models Jing Yu Koh, Daniel Fried, Ruslan Salakhutdinov
ICML 2023 Grounding Language Models to Images for Multimodal Inputs and Outputs Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried
NeurIPSW 2023 Multimodal Graph Learning for Generative Tasks Minji Yoon, Jing Yu Koh, Bryan Hooi, Russ Salakhutdinov
AAAI 2023 Simple and Effective Synthesis of Indoor 3D Scenes Jing Yu Koh, Harsh Agrawal, Dhruv Batra, Richard Tucker, Austin Waters, Honglak Lee, Yinfei Yang, Jason Baldridge, Peter Anderson
ICCV 2023 VQ3D: Learning a 3D-Aware Generative Model on ImageNet Kyle Sargent, Jing Yu Koh, Han Zhang, Huiwen Chang, Charles Herrmann, Pratul Srinivasan, Jiajun Wu, Deqing Sun
TMLR 2022 Scaling Autoregressive Models for Content-Rich Text-to-Image Generation Jiahui Yu, Yuanzhong Xu, Jing Yu Koh, Thang Luong, Gunjan Baid, Zirui Wang, Vijay Vasudevan, Alexander Ku, Yinfei Yang, Burcu Karagol Ayan, Ben Hutchinson, Wei Han, Zarana Parekh, Xin Li, Han Zhang, Jason Baldridge, Yonghui Wu
ICLR 2022 Vector-Quantized Image Modeling with Improved VQGAN Jiahui Yu, Xin Li, Jing Yu Koh, Han Zhang, Ruoming Pang, James Qin, Alexander Ku, Yuanzhong Xu, Jason Baldridge, Yonghui Wu
CVPR 2021 Cross-Modal Contrastive Learning for Text-to-Image Generation Han Zhang, Jing Yu Koh, Jason Baldridge, Honglak Lee, Yinfei Yang
ICCV 2021 Pathdreamer: A World Model for Indoor Navigation Jing Yu Koh, Honglak Lee, Yinfei Yang, Jason Baldridge, Peter Anderson
ICLR 2021 Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction Wonkwang Lee, Whie Jung, Han Zhang, Ting Chen, Jing Yu Koh, Thomas Huang, Hyungsuk Yoon, Honglak Lee, Seunghoon Hong
WACV 2021 Text-to-Image Generation Grounded by Fine-Grained User Attention Jing Yu Koh, Jason Baldridge, Honglak Lee, Yinfei Yang
ECCV 2020 SideInfNet: A Deep Neural Network for Semi-Automatic Semantic Segmentation with Side Information Jing Yu Koh, Duc Thanh Nguyen, Quang-Trung Truong, Sai-Kit Yeung, Alexander Binder
IJCAI 2019 Improving Customer Satisfaction in Bike Sharing Systems Through Dynamic Repositioning Supriyo Ghosh, Jing Yu Koh, Patrick Jaillet