ML Anthology
Authors
Search
About
Shi, Jing
27 publications
ICCV
2025
DiffTell: A High-Quality Dataset for Describing Image Manipulation Changes
Zonglin Di
,
Jing Shi
,
Yifei Fan
,
Hao Tan
,
Alexander Black
,
John Collomosse
,
Yang Liu
CVPR
2025
FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity
Hang Hua
,
Qing Liu
,
Lingzhi Zhang
,
Jing Shi
,
Soo Ye Kim
,
Zhifei Zhang
,
Yilin Wang
,
Jianming Zhang
,
Zhe Lin
,
Jiebo Luo
ICCV
2025
Improving Large Vision and Language Models by Learning from a Panel of Peers
Jefferson Hernandez
,
Jing Shi
,
Simon Jenni
,
Vicente Ordonez
,
Kushal Kafle
AAAI
2025
Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU Clusters
WenZheng Zhang
,
Yang Hu
,
Jing Shi
,
Xiaoying Bai
CVPR
2025
The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique like Photographers
Daiqing Qi
,
Handong Zhao
,
Jing Shi
,
Simon Jenni
,
Yifei Fan
,
Franck Dernoncourt
,
Scott Cohen
,
Sheng Li
ICML
2025
Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage
Saehyung Lee
,
Seunghyun Yoon
,
Trung Bui
,
Jing Shi
,
Sungroh Yoon
NeurIPS
2025
Understanding and Mitigating Numerical Sources of Nondeterminism in LLM Inference
Jiayi Yuan
,
Hao Li
,
Xinheng Ding
,
Wenya Xie
,
Yu-Jhe Li
,
Wentian Zhao
,
Kun Wan
,
Jing Shi
,
Xia Hu
,
Zirui Liu
CVPR
2025
Visual Persona: Foundation Model for Full-Body Human Customization
Jisu Nam
,
Soowon Son
,
Zhan Xu
,
Jing Shi
,
Difan Liu
,
Feng Liu
,
Seungryong Kim
,
Yang Zhou
CVPR
2025
Yo'Chameleon: Personalized Vision and Language Generation
Thao Nguyen
,
Krishna Kumar Singh
,
Jing Shi
,
Trung Bui
,
Yong Jae Lee
,
Yuheng Li
NeurIPSW
2024
AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation
Kai Wang
,
Shijian Deng
,
Jing Shi
,
Dimitrios Hatzinakos
,
Yapeng Tian
WACV
2024
Content-Aware Image Color Editing with Auxiliary Color Restoration Tasks
Yixuan Ren
,
Jing Shi
,
Zhifei Zhang
,
Yifei Fan
,
Zhe Lin
,
Bo He
,
Abhinav Shrivastava
ECCV
2024
Customize-a-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models
Yixuan Ren
,
Yang Zhou
,
Jimei Yang
,
Jing Shi
,
Difan Liu
,
Feng Liu
,
Mingi Kwon
,
Abhinav Shrivastava
ECCV
2024
FineMatch: Aspect-Based Fine-Grained Image and Text Mismatch Detection and Correction
Hang Hua
,
Jing Shi
,
Kushal Kafle
,
Simon Jenni
,
Daoan Zhang
,
John Collomosse
,
Scott Cohen
,
Jiebo Luo
CVPR
2024
InstantBooth: Personalized Text-to-Image Generation Without Test-Time Finetuning
Jing Shi
,
Wei Xiong
,
Zhe Lin
,
Hyun Joon Jung
AAAI
2024
VIXEN: Visual Text Comparison Network for Image Difference Captioning
Alexander Black
,
Jing Shi
,
Yifei Fan
,
Tu Bui
,
John P. Collomosse
CVPR
2022
SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Color Editing
Jing Shi
,
Ning Xu
,
Haitian Zheng
,
Alex Smith
,
Jiebo Luo
,
Chenliang Xu
ICCV
2021
A Simple Baseline for Weakly-Supervised Scene Graph Generation
Jing Shi
,
Yiwu Zhong
,
Ning Xu
,
Yin Li
,
Chenliang Xu
WACV
2021
How to Make a BLT Sandwich? Learning VQA Towards Understanding Web Instructional Videos
Shaojie Wang
,
Wentian Zhao
,
Ziyi Kou
,
Jing Shi
,
Chenliang Xu
ICCV
2021
Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism
Wentao Jiang
,
Ning Xu
,
Jiayun Wang
,
Chen Gao
,
Jing Shi
,
Zhe Lin
,
Si Liu
CVPR
2021
Learning by Planning: Language-Guided Global Image Editing
Jing Shi
,
Ning Xu
,
Yihang Xu
,
Trung Bui
,
Franck Dernoncourt
,
Chenliang Xu
ICCV
2021
Learning to Generate Scene Graph from Natural Language Supervision
Yiwu Zhong
,
Jing Shi
,
Jianwei Yang
,
Chenliang Xu
,
Yin Li
NeurIPS
2020
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals
Jing Shi
,
Xuankai Chang
,
Pengcheng Guo
,
Shinji Watanabe
,
Yusuke Fujita
,
Jiaming Xu
,
Bo Xu
,
Lei Xie
CVPRW
2019
Audio-Visual Event Localization in the Wild
Yapeng Tian
,
Jing Shi
,
Bochen Li
,
Zhiyao Duan
,
Chenliang Xu
IJCAI
2019
GAN-EM: GAN Based EM Learning Framework
Wentian Zhao
,
Shaojie Wang
,
Zhihuai Xie
,
Jing Shi
,
Chenliang Xu
ECCV
2018
Audio-Visual Event Localization in Unconstrained Videos
Yapeng Tian
,
Jing Shi
,
Bochen Li
,
Zhiyao Duan
,
Chenliang Xu
IJCAI
2018
Listen, Think and Listen Again: Capturing Top-Down Auditory Attention for Speaker-Independent Speech Separation
Jing Shi
,
Jiaming Xu
,
Guangcan Liu
,
Bo Xu
AAAI
2018
Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment
Jiaming Xu
,
Jing Shi
,
Guangcan Liu
,
Xiuyi Chen
,
Bo Xu