Han, Jianhua

37 publications

WACV 2025 DisCo: Discovering Common Affordance from Large Models for Actionable Part Perception Youpeng Wen, Yi Zhu, Zhihao Zhan, Pengzhen Ren, Jianhua Han, Hang Xu, Shen Zhao, Xiaodan Liang
CVPR 2025 EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions Kai Chen, Yunhao Gou, Runhui Huang, Zhili Liu, Daxin Tan, Jing Xu, Chunwei Wang, Yi Zhu, Yihan Zeng, Kuo Yang, Dingdong Wang, Kun Xiang, Haoyuan Li, Haoli Bai, Jianhua Han, Xiaohui Li, Weike Jin, Nian Xie, Yu Zhang, James T. Kwok, Hengshuang Zhao, Xiaodan Liang, Dit-Yan Yeung, Xiao Chen, Zhenguo Li, Wei Zhang, Qun Liu, Lanqing Hong, Lu Hou, Hang Xu
ICLR 2025 G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model Jiahui Gao, Renjie Pi, Jipeng Zhang, Jiacheng Ye, Wanjun Zhong, Yufei Wang, Lanqing Hong, Jianhua Han, Hang Xu, Zhenguo Li, Lingpeng Kong
CVPR 2025 HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models Runhui Huang, Xinpeng Ding, Chunwei Wang, Jianhua Han, Yulong Liu, Hengshuang Zhao, Hang Xu, Lu Hou, Wei Zhang, Xiaodan Liang
ICCV 2025 ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance Chunwei Wang, Guansong Lu, Junwei Yang, Runhui Huang, Jianhua Han, Lu Hou, Wei Zhang, Hang Xu
NeurIPS 2025 SeePhys: Does Seeing Help Thinking? – Benchmarking Vision-Based Physics Reasoning Kun Xiang, Heng Li, Terry Jingchen Zhang, Yinya Huang, Zirong Liu, Peixin Qu, Jixi He, Jiaqi Chen, Yu-Jie Yuan, Jianhua Han, Hang Xu, Hanhui Li, Mrinmaya Sachan, Xiaodan Liang
NeurIPS 2025 Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization Ming Nie, Chunwei Wang, Jianhua Han, Hang Xu, Li Zhang
AAAI 2024 Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images Qingping Zheng, Yuanfan Guo, Jiankang Deng, Jianhua Han, Ying Li, Songcen Xu, Hang Xu
CVPR 2024 DetCLIPv3: Towards Versatile Generative Open-Vocabulary Object Detection Lewei Yao, Renjie Pi, Jianhua Han, Xiaodan Liang, Hang Xu, Wei Zhang, Zhenguo Li, Dan Xu
ICLR 2024 Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis Kai Chen, Chunwei Wang, Kuo Yang, Jianhua Han, Lanqing Hong, Fei Mi, Hang Xu, Zhengying Liu, Wenyong Huang, Zhenguo Li, Dit-Yan Yeung, Lifeng Shang
CVPR 2024 Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models Xinpeng Ding, Jianhua Han, Hang Xu, Xiaodan Liang, Wei Zhang, Xiaomeng Li
ECCV 2024 HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-Fine Pose-Reversible Guidance Guian Fang, Wenbiao Yan, Yuanfan Guo, Jianhua Han, Zutao Jiang, Hang Xu, Shengcai Liao, Xiaodan Liang
ECCV 2024 Implicit Concept Removal of Diffusion Models Zhili Liu, Kai Chen, Yifan Zhang, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James Kwok
ICLR 2024 Ins-DetCLIP: Aligning Detection Model to Follow Human-Language Instruction Renjie Pi, Lewei Yao, Jianhua Han, Xiaodan Liang, Wei Zhang, Hang Xu
ECCV 2024 LayerDiff: Exploring Text-Guided Multi-Layered Composable Image Synthesis via Layer-Collaborative Diffusion Model Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu
ECCV 2024 PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion Guansong Lu, Yuanfan Guo, Jianhua Han, Minzhe Niu, Yihan Zeng, Songcen Xu, Zeyi Huang, Zhao Zhong, Wei Zhang, Hang Xu
ECCV 2024 Reason2Drive: Towards Interpretable and Chain-Based Reasoning for Autonomous Driving Ming Nie, Renyuan Peng, Chunwei Wang, Xinyue Cai, Jianhua Han, Hang Xu, Li Zhang
NeurIPS 2024 SlowFocus: Enhancing Fine-Grained Temporal Understanding in Video LLM Ming Nie, Dan Ding, Chunwei Wang, Yuanfan Guo, Jianhua Han, Hang Xu, Li Zhang
NeurIPS 2024 UNIT: Unifying Image and Text Recognition in One Vision Encoder Yi Zhu, Yanpeng Zhou, Chunwei Wang, Yang Cao, Jianhua Han, Lu Hou, Hang Xu
NeurIPS 2024 VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation Youpeng Wen, Junfan Lin, Yi Zhu, Jianhua Han, Hang Xu, Shen Zhao, Xiaodan Liang
CVPR 2023 CLIP2: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data Yihan Zeng, Chenhan Jiang, Jiageng Mao, Jianhua Han, Chaoqiang Ye, Qingqiu Huang, Dit-Yan Yeung, Zhen Yang, Xiaodan Liang, Hang Xu
CVPR 2023 CapDet: Unifying Dense Captioning and Open-World Detection Pretraining Yanxin Long, Youpeng Wen, Jianhua Han, Hang Xu, Pengzhen Ren, Wei Zhang, Shen Zhao, Xiaodan Liang
CVPR 2023 DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-Training via Word-Region Alignment Lewei Yao, Jianhua Han, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Hang Xu
ICCV 2023 DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability Runhui Huang, Jianhua Han, Guansong Lu, Xiaodan Liang, Yihan Zeng, Wei Zhang, Hang Xu
ICCV 2023 GrowCLIP: Data-Aware Automatic Model Growing for Large-Scale Contrastive Language-Image Pre-Training Xinchi Deng, Han Shi, Runhui Huang, Changlin Li, Hang Xu, Jianhua Han, James Kwok, Shen Zhao, Wei Zhang, Xiaodan Liang
AAAI 2023 NLIP: Noise-Robust Language-Image Pre-Training Runhui Huang, Yanxin Long, Jianhua Han, Hang Xu, Xiwen Liang, Chunjing Xu, Xiaodan Liang
ICLR 2023 Task-Customized Masked Autoencoder via Mixture of Cluster-Conditional Experts Zhili Liu, Kai Chen, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, James Kwok
CVPR 2023 Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving Xiwen Liang, Minzhe Niu, Jianhua Han, Hang Xu, Chunjing Xu, Xiaodan Liang
ECCV 2022 CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving Kaican Li, Kai Chen, Haoyu Wang, Lanqing Hong, Chaoqiang Ye, Jianhua Han, Yukuai Chen, Wei Zhang, Chunjing Xu, Dit-Yan Yeung, Xiaodan Liang, Zhenguo Li, Hang Xu
NeurIPS 2022 DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-Training for Open-World Detection Lewei Yao, Jianhua Han, Youpeng Wen, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Chunjing Xu, Hang Xu
NeurIPS 2022 Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving Xiwen Liang, Yangxin Wu, Jianhua Han, Hang Xu, Chunjing Xu, Xiaodan Liang
ECCV 2022 Generative Negative Text Replay for Continual Vision-Language Pretraining Shipeng Yan, Lanqing Hong, Hang Xu, Jianhua Han, Tinne Tuytelaars, Zhenguo Li, Xuming He
AAAI 2022 Laneformer: Object-Aware Row-Column Transformers for Lane Detection Jianhua Han, Xiajun Deng, Xinyue Cai, Zhen Yang, Hang Xu, Chunjing Xu, Xiaodan Liang
CVPR 2022 ONCE-3DLanes: Building Monocular 3D Lane Detection Fan Yan, Ming Nie, Xinyue Cai, Jianhua Han, Hang Xu, Zhen Yang, Chaoqiang Ye, Yanwei Fu, Michael Bi Mi, Li Zhang
ECCV 2022 Open-World Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding Quande Liu, Youpeng Wen, Jianhua Han, Chunjing Xu, Hang Xu, Xiaodan Liang
AAAI 2022 Task-Customized Self-Supervised Pre-Training with Scalable Dynamic Routing Zhili Liu, Jianhua Han, Lanqing Hong, Hang Xu, Kai Chen, Chunjing Xu, Zhenguo Li
IJCAI 2017 Aggregating Crowd Wisdoms with Label-Aware Autoencoders Li'ang Yin, Jianhua Han, Weinan Zhang, Yong Yu