Zhu, Bin

19 publications

ICCV 2025 DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses Yatian Pang, Bin Zhu, Bin Lin, Mingzhe Zheng, Francis E. H. Tay, Ser-Nam Lim, Harry Yang, Li Yuan
ICCV 2025 From Holistic to Localized: Local Enhanced Adapters for Efficient Visual Instruction Fine-Tuning Pengkun Jiao, Bin Zhu, Jingjing Chen, Chong-Wah Ngo, Yu-Gang Jiang
CVPR 2025 HD-EPIC: A Highly-Detailed Egocentric Video Dataset Toby Perrett, Ahmad Darkhalil, Saptarshi Sinha, Omar Emara, Sam Pollard, Kranti Kumar Parida, Kaiting Liu, Prajwal Gatti, Siddhant Bansal, Kevin Flanagan, Jacob Chalk, Zhifan Zhu, Rhodri Guerrier, Fahd Abdelazim, Bin Zhu, Davide Moltisanti, Michael Wray, Hazel Doughty, Dima Damen
AAAI 2025 Hand1000: Generating Realistic Hands from Text with Only 1, 000 Images Haozhuo Zhang, Bin Zhu, Yu Cao, Yanbin Hao
WACV 2025 Multimodal Interpretable Depression Analysis Using Visual Physiological Audio and Textual Data Puneet Kumar, Shreshtha Misra, Zhuhong Shao, Bin Zhu, Balasubramanian Raman, Xiaobai Li
CVPR 2025 PolarNeXt: Rethink Instance Segmentation with Polar Representation Jiacheng Sun, Xinghong Zhou, Yiqiang Wu, Bin Zhu, Jiaxuan Lu, Yu Qin, Xiaomao Li
ICML 2025 Preference Optimization for Combinatorial Optimization Problems Mingjun Pan, Guanquan Lin, You-Wei Luo, Bin Zhu, Zhien Dai, Lijun Sun, Chun Yuan
AAAI 2025 RAGG: Retrieval-Augmented Grasp Generation Model Zhenhua Tang, Bin Zhu, Yanbin Hao, Chong-Wah Ngo, Richang Hong
WACV 2025 Retrieval Augmented Recipe Generation Guoshan Liu, Hailong Yin, Bin Zhu, Jingjing Chen, Chong-Wah Ngo, Yu-Gang Jiang
ECCV 2024 Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective Fangzhou Song, Bin Zhu, Yanbin Hao, Shuo Wang
ICLR 2024 LanguageBind: Extending Video-Language Pretraining to N-Modality by Language-Based Semantic Alignment Bin Zhu, Bin Lin, Munan Ning, Yang Yan, Jiaxi Cui, Wang HongFa, Yatian Pang, Wenhao Jiang, Junwu Zhang, Zongwei Li, Cai Wan Zhang, Zhifeng Li, Wei Liu, Li Yuan
ECCVW 2024 Video Editing for Video Retrieval Bin Zhu, Kevin Flanagan, Adriano Fragomeni, Michael Wray, Dima Damen
IJCAI 2023 Controlling Neural Style Transfer with Deep Reinforcement Learning Chengming Feng, Jing Hu, Xin Wang, Shu Hu, Bin Zhu, Xi Wu, Hongtu Zhu, Siwei Lyu
CVPRW 2023 Harnessing the Power of Text-Image Contrastive Models for Automatic Detection of Online Misinformation Hao Chen, Peng Zheng, Xin Wang, Shu Hu, Bin Zhu, Jinrong Hu, Xi Wu, Siwei Lyu
ICCV 2023 Towards Attack-Tolerant Federated Learning via Critical Parameter Analysis Sungwon Han, Sungwon Park, Fangzhao Wu, Sundong Kim, Bin Zhu, Xing Xie, Meeyoung Cha
NeurIPS 2022 EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations Ahmad Darkhalil, Dandan Shan, Bin Zhu, Jian Ma, Amlan Kar, Richard Higgins, Sanja Fidler, David Fouhey, Dima Damen
NeurIPS 2022 TaiSu: A 166m Large-Scale High-Quality Dataset for Chinese Vision-Language Pre-Training Yulong Liu, Guibo Zhu, Bin Zhu, Qi Song, Guojing Ge, Haoran Chen, GuanHui Qiao, Ru Peng, Lingxiang Wu, Jinqiao Wang
WACV 2021 CPM R-CNN: Calibrating Point-Guided Misalignment in Object Detection Bin Zhu, Qing Song, Lu Yang, Zhihui Wang, Chun Liu, Mengjie Hu
WACV 2020 Graph Neural Networks for Image Understanding Based on Multiple Cues: Group Emotion Recognition and Event Recognition as Use Cases Xin Guo, Luisa Polania, Bin Zhu, Charles Boncelet, Kenneth Barner