Shi, Jing

27 publications

ICCV 2025 DiffTell: A High-Quality Dataset for Describing Image Manipulation Changes Zonglin Di, Jing Shi, Yifei Fan, Hao Tan, Alexander Black, John Collomosse, Yang Liu
CVPR 2025 FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity Hang Hua, Qing Liu, Lingzhi Zhang, Jing Shi, Soo Ye Kim, Zhifei Zhang, Yilin Wang, Jianming Zhang, Zhe Lin, Jiebo Luo
ICCV 2025 Improving Large Vision and Language Models by Learning from a Panel of Peers Jefferson Hernandez, Jing Shi, Simon Jenni, Vicente Ordonez, Kushal Kafle
AAAI 2025 Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU Clusters WenZheng Zhang, Yang Hu, Jing Shi, Xiaoying Bai
CVPR 2025 The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique like Photographers Daiqing Qi, Handong Zhao, Jing Shi, Simon Jenni, Yifei Fan, Franck Dernoncourt, Scott Cohen, Sheng Li
ICML 2025 Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage Saehyung Lee, Seunghyun Yoon, Trung Bui, Jing Shi, Sungroh Yoon
NeurIPS 2025 Understanding and Mitigating Numerical Sources of Nondeterminism in LLM Inference Jiayi Yuan, Hao Li, Xinheng Ding, Wenya Xie, Yu-Jhe Li, Wentian Zhao, Kun Wan, Jing Shi, Xia Hu, Zirui Liu
CVPR 2025 Visual Persona: Foundation Model for Full-Body Human Customization Jisu Nam, Soowon Son, Zhan Xu, Jing Shi, Difan Liu, Feng Liu, Seungryong Kim, Yang Zhou
CVPR 2025 Yo'Chameleon: Personalized Vision and Language Generation Thao Nguyen, Krishna Kumar Singh, Jing Shi, Trung Bui, Yong Jae Lee, Yuheng Li
NeurIPSW 2024 AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation Kai Wang, Shijian Deng, Jing Shi, Dimitrios Hatzinakos, Yapeng Tian
WACV 2024 Content-Aware Image Color Editing with Auxiliary Color Restoration Tasks Yixuan Ren, Jing Shi, Zhifei Zhang, Yifei Fan, Zhe Lin, Bo He, Abhinav Shrivastava
ECCV 2024 Customize-a-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models Yixuan Ren, Yang Zhou, Jimei Yang, Jing Shi, Difan Liu, Feng Liu, Mingi Kwon, Abhinav Shrivastava
ECCV 2024 FineMatch: Aspect-Based Fine-Grained Image and Text Mismatch Detection and Correction Hang Hua, Jing Shi, Kushal Kafle, Simon Jenni, Daoan Zhang, John Collomosse, Scott Cohen, Jiebo Luo
CVPR 2024 InstantBooth: Personalized Text-to-Image Generation Without Test-Time Finetuning Jing Shi, Wei Xiong, Zhe Lin, Hyun Joon Jung
AAAI 2024 VIXEN: Visual Text Comparison Network for Image Difference Captioning Alexander Black, Jing Shi, Yifei Fan, Tu Bui, John P. Collomosse
CVPR 2022 SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Color Editing Jing Shi, Ning Xu, Haitian Zheng, Alex Smith, Jiebo Luo, Chenliang Xu
ICCV 2021 A Simple Baseline for Weakly-Supervised Scene Graph Generation Jing Shi, Yiwu Zhong, Ning Xu, Yin Li, Chenliang Xu
WACV 2021 How to Make a BLT Sandwich? Learning VQA Towards Understanding Web Instructional Videos Shaojie Wang, Wentian Zhao, Ziyi Kou, Jing Shi, Chenliang Xu
ICCV 2021 Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism Wentao Jiang, Ning Xu, Jiayun Wang, Chen Gao, Jing Shi, Zhe Lin, Si Liu
CVPR 2021 Learning by Planning: Language-Guided Global Image Editing Jing Shi, Ning Xu, Yihang Xu, Trung Bui, Franck Dernoncourt, Chenliang Xu
ICCV 2021 Learning to Generate Scene Graph from Natural Language Supervision Yiwu Zhong, Jing Shi, Jianwei Yang, Chenliang Xu, Yin Li
NeurIPS 2020 Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals Jing Shi, Xuankai Chang, Pengcheng Guo, Shinji Watanabe, Yusuke Fujita, Jiaming Xu, Bo Xu, Lei Xie
CVPRW 2019 Audio-Visual Event Localization in the Wild Yapeng Tian, Jing Shi, Bochen Li, Zhiyao Duan, Chenliang Xu
IJCAI 2019 GAN-EM: GAN Based EM Learning Framework Wentian Zhao, Shaojie Wang, Zhihuai Xie, Jing Shi, Chenliang Xu
ECCV 2018 Audio-Visual Event Localization in Unconstrained Videos Yapeng Tian, Jing Shi, Bochen Li, Zhiyao Duan, Chenliang Xu
IJCAI 2018 Listen, Think and Listen Again: Capturing Top-Down Auditory Attention for Speaker-Independent Speech Separation Jing Shi, Jiaming Xu, Guangcan Liu, Bo Xu
AAAI 2018 Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment Jiaming Xu, Jing Shi, Guangcan Liu, Xiuyi Chen, Bo Xu