Zhang, Yiyuan

12 publications

ICCV 2025 Breaking the Encoder Barrier for Seamless Video-Language Understanding Handong Li, Yiyuan Zhang, Longteng Guo, Xiangyu Yue, Jing Liu
ICCV 2025 FairGen: Enhancing Fairness in Text-to-Image Diffusion Models via Self-Discovering Latent Directions Yilei Jiang, Wei-Hong Li, Yiyuan Zhang, Minghong Cai, Xiangyu Yue
ICCV 2025 Learning Beyond Still Frames: Scaling Vision-Language Models with Video Yiyuan Zhang, Handong Li, Jing Liu, Xiangyu Yue
ICCV 2025 MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing Langyu Wang, Bingke Zhu, Yingying Chen, Yiyuan Zhang, Ming Tang, Jinqiao Wang
NeurIPS 2025 Native-Resolution Image Synthesis ZiDong Wang, Lei Bai, Xiangyu Yue, Wanli Ouyang, Yiyuan Zhang
ICCV 2025 Scaling Omni-Modal Pretraining with Multimodal Context: Advancing Universal Representation Learning Across Modalities Yiyuan Zhang, Handong Li, Jing Liu, Xiangyu Yue
CVPR 2024 Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities Yiyuan Zhang, Xiaohan Ding, Kaixiong Gong, Yixiao Ge, Ying Shan, Xiangyu Yue
CVPR 2024 OneLLM: One Framework to Align All Modalities with Language Jiaming Han, Kaixiong Gong, Yiyuan Zhang, Jiaqi Wang, Kaipeng Zhang, Dahua Lin, Yu Qiao, Peng Gao, Xiangyu Yue
ECCV 2024 Online Vectorized HD mAP Construction Using Geometry Zhixin Zhang, Yiyuan Zhang, Xiaohan Ding, Fusheng Jin, Xiangyu Yue
CVPR 2024 Text-to-3D Generation with Bidirectional Diffusion Using Both 2D and 3D Priors Lihe Ding, Shaocong Dong, Zhanpeng Huang, Zibin Wang, Yiyuan Zhang, Kaixiong Gong, Dan Xu, Tianfan Xue
CVPR 2024 UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio Video Point Cloud Time-Series and Image Recognition Xiaohan Ding, Yiyuan Zhang, Yixiao Ge, Sijie Zhao, Lin Song, Xiangyu Yue, Ying Shan
ECCV 2022 Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-Identification Yiyuan Zhang, Sanyuan Zhao, Yuhao Kang, Jianbing Shen