Hua, Hang

11 publications

AAAI 2025 Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding Yunlong Tang, Daiki Shimada, Jing Bi, Mingqian Feng, Hang Hua, Chenliang Xu
CVPR 2025 FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity Hang Hua, Qing Liu, Lingzhi Zhang, Jing Shi, Soo Ye Kim, Zhifei Zhang, Yilin Wang, Jianming Zhang, Zhe Lin, Jiebo Luo
NeurIPS 2025 Latent Chain-of-Thought for Visual Reasoning Guohao Sun, Hang Hua, Jian Wang, Jiebo Luo, Sohail Dianat, Majid Rabbani, Raghuveer Rao, Zhiqiang Tao
NeurIPS 2025 MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models Hang Hua, Ziyun Zeng, Yizhi Song, Yunlong Tang, Liu He, Daniel Aliaga, Wei Xiong, Jiebo Luo
NeurIPS 2025 MMPerspective: Do MLLMs Understand Perspective? a Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness Yunlong Tang, Pinxin Liu, Zhangyun Tan, Mingqian Feng, Rui Mao, Chao Huang, Jing Bi, Yunzhong Xiao, Susan Liang, Hang Hua, Ali Vosoughi, Luchuan Song, Zeliang Zhang, Chenliang Xu
AAAI 2025 V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning Hang Hua, Yunlong Tang, Chenliang Xu, Jiebo Luo
CVPR 2025 VidComposition: Can MLLMs Analyze Compositions in Compiled Videos? Yunlong Tang, Junjia Guo, Hang Hua, Susan Liang, Mingqian Feng, Xinyang Li, Rui Mao, Chao Huang, Jing Bi, Zeliang Zhang, Pooyan Fazli, Chenliang Xu
ECCV 2024 FineMatch: Aspect-Based Fine-Grained Image and Text Mismatch Detection and Correction Hang Hua, Jing Shi, Kushal Kafle, Simon Jenni, Daoan Zhang, John Collomosse, Scott Cohen, Jiebo Luo
NeurIPS 2024 PromptFix: You Prompt and We Fix the Photo Yongsheng Yu, Ziyun Zeng, Hang Hua, Jianlong Fu, Jiebo Luo
ICCV 2023 PromptCap: Prompt-Guided Image Captioning for VQA with GPT-3 Yushi Hu, Hang Hua, Zhengyuan Yang, Weijia Shi, Noah A. Smith, Jiebo Luo
NeurIPS 2019 Controllable Unsupervised Text Attribute Transfer via Editing Entangled Latent Representation Ke Wang, Hang Hua, Xiaojun Wan