Wang, Xinlong

39 publications

NeurIPS 2025 Audio-Sync Video Generation with Multi-Stream Temporal Control Shuchen Weng, Haojie Zheng, Zheng Chang, Si Li, Boxin Shi, Xinlong Wang
ICLR 2025 Autoregressive Video Generation Without Vector Quantization Haoge Deng, Ting Pan, Haiwen Diao, Zhengxiong Luo, Yufeng Cui, Huchuan Lu, Shiguang Shan, Yonggang Qi, Xinlong Wang
ICLR 2025 Diffusion Feedback Helps CLIP See Better Wenxuan Wang, Quan Sun, Fan Zhang, Yepeng Tang, Jing Liu, Xinlong Wang
ICCV 2025 EVEv2: Improved Baselines for Encoder-Free Vision-Language Models Haiwen Diao, Xiaotong Li, Yufeng Cui, Yueze Wang, Haoge Deng, Ting Pan, Wenxuan Wang, Huchuan Lu, Xinlong Wang
NeurIPS 2025 End-to-End Vision Tokenizer Tuning Wenxuan Wang, Fan Zhang, Yufeng Cui, Haiwen Diao, Zhuoyan Luo, Huchuan Lu, Jing Liu, Xinlong Wang
ICLR 2025 JudgeLM: Fine-Tuned Large Language Models Are Scalable Judges Lianghui Zhu, Xinggang Wang, Xinlong Wang
NeurIPS 2025 Unveiling Chain of Step Reasoning for Vision-Language Models with Fine-Grained Rewards Honghao Chen, Xingzhou Lou, Xiaokun Feng, Kaiqi Huang, Xinlong Wang
CVPR 2025 You See It, You Got It: Learning 3D Creation on Pose-Free Videos at Scale Baorui Ma, Huachen Gao, Haoge Deng, Zhengxiong Luo, Tiejun Huang, Lulu Tang, Xinlong Wang
NeurIPS 2024 A Simple Image Segmentation Framework via In-Context Examples Yang Liu, Chenchen Jing, Hengtao Li, Muzhi Zhu, Hao Chen, Xinlong Wang, Chunhua Shen
CVPR 2024 CapsFusion: Rethinking Image-Text Data at Scale Qiying Yu, Quan Sun, Xiaosong Zhang, Yufeng Cui, Fan Zhang, Yue Cao, Xinlong Wang, Jingjing Liu
NeurIPS 2024 DenseFusion-1m: Merging Vision Experts for Comprehensive Multimodal Perception Xiaotong Li, Fan Zhang, Haiwen Diao, Yueze Wang, Xinlong Wang, Ling-Yu Duan
ICLR 2024 Emu: Generative Pretraining in Multimodality Quan Sun, Qiying Yu, Yufeng Cui, Fan Zhang, Xiaosong Zhang, Yueze Wang, Hongcheng Gao, Jingjing Liu, Tiejun Huang, Xinlong Wang
CVPR 2024 Generative Multimodal Models Are In-Context Learners Quan Sun, Yufeng Cui, Xiaosong Zhang, Fan Zhang, Qiying Yu, Yueze Wang, Yongming Rao, Jingjing Liu, Tiejun Huang, Xinlong Wang
ICLR 2024 Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching Yang Liu, Muzhi Zhu, Hengtao Li, Hao Chen, Xinlong Wang, Chunhua Shen
ECCV 2024 Region-Native Visual Tokenization Mengyu Wang, Yuyao Huang, Henghui Ding, Xinlong Wang, Tiejun Huang, Yao Zhao, Yunchao Wei, Shuicheng Yan
ECCV 2024 Tokenize Anything via Prompting Ting Pan, Lulu Tang, Xinlong Wang, Shiguang Shan
ICLR 2024 Uni3D: Exploring Unified 3D Representation at Scale Junsheng Zhou, Jinsheng Wang, Baorui Ma, Yu-Shen Liu, Tiejun Huang, Xinlong Wang
NeurIPS 2024 Unleashing the Potential of the Diffusion Model in Few-Shot Semantic Segmentation Muzhi Zhu, Yang Liu, Zekai Luo, Chenchen Jing, Hao Chen, Guangkai Xu, Xinlong Wang, Chunhua Shen
NeurIPS 2024 Unveiling Encoder-Free Vision-Language Models Haiwen Diao, Yufeng Cui, Xiaotong Li, Yueze Wang, Huchuan Lu, Xinlong Wang
CVPR 2024 Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation Wenxuan Wang, Tongtian Yue, Yisi Zhang, Longteng Guo, Xingjian He, Xinlong Wang, Jing Liu
ICML 2024 Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Lianghui Zhu, Bencheng Liao, Qian Zhang, Xinlong Wang, Wenyu Liu, Xinggang Wang
ICCV 2023 Affective Image Filter: Reflecting Emotions from Text to Images Shuchen Weng, Peixuan Zhang, Zheng Chang, Xinlong Wang, Si Li, Boxin Shi
ICLR 2023 Conditional Positional Encodings for Vision Transformers Xiangxiang Chu, Zhi Tian, Bo Zhang, Xinlong Wang, Chunhua Shen
CVPR 2023 EVA: Exploring the Limits of Masked Visual Representation Learning at Scale Yuxin Fang, Wen Wang, Binhui Xie, Quan Sun, Ledell Wu, Xinggang Wang, Tiejun Huang, Xinlong Wang, Yue Cao
NeurIPS 2023 Fine-Grained Visual Prompting Lingfeng Yang, Yueze Wang, Xiang Li, Xinlong Wang, Jian Yang
CVPR 2023 Images Speak in Images: A Generalist Painter for In-Context Visual Learning Xinlong Wang, Wen Wang, Yue Cao, Chunhua Shen, Tiejun Huang
AAAI 2023 Point-Teaching: Weakly Semi-Supervised Object Detection with Point Annotations Yongtao Ge, Qiang Zhou, Xinlong Wang, Chunhua Shen, Zhibin Wang, Hao Li
ICCV 2023 SegGPT: Towards Segmenting Everything in Context Xinlong Wang, Xiaosong Zhang, Yue Cao, Wen Wang, Chunhua Shen, Tiejun Huang
CVPR 2022 FreeSOLO: Learning to Segment Objects Without Annotations Xinlong Wang, Zhiding Yu, Shalini De Mello, Jan Kautz, Anima Anandkumar, Chunhua Shen, Jose M. Alvarez
ECCV 2022 Poseur: Direct Human Pose Regression with Transformers Weian Mao, Yongtao Ge, Chunhua Shen, Zhi Tian, Xinlong Wang, Zhibin Wang, Anton van den Hengel
CVPR 2021 BoxInst: High-Performance Instance Segmentation with Box Annotations Zhi Tian, Chunhua Shen, Xinlong Wang, Hao Chen
CVPR 2021 Dense Contrastive Learning for Self-Supervised Visual Pre-Training Xinlong Wang, Rufeng Zhang, Chunhua Shen, Tao Kong, Lei Li
AAAI 2021 Diverse Knowledge Distillation for End-to-End Person Search Xinyu Zhang, Xinlong Wang, Jia-Wang Bian, Chunhua Shen, Mingyu You
CVPR 2021 End-to-End Video Instance Segmentation with Transformers Yuqing Wang, Zhaoliang Xu, Xinlong Wang, Chunhua Shen, Baoshan Cheng, Hao Shen, Huaxia Xia
CVPR 2021 FCPose: Fully Convolutional Multi-Person Pose Estimation with Dynamic Instance-Aware Convolutions Weian Mao, Zhi Tian, Xinlong Wang, Chunhua Shen
ECCV 2020 Instance-Aware Embedding for Point Cloud Instance Segmentation Tong He, Yifan Liu, Chunhua Shen, Xinlong Wang, Changming Sun
ECCV 2020 SOLO: Segmenting Objects by Locations Xinlong Wang, Tao Kong, Chunhua Shen, Yuning Jiang, Lei Li
NeurIPS 2020 SOLOv2: Dynamic and Fast Instance Segmentation Xinlong Wang, Rufeng Zhang, Tao Kong, Lei Li, Chunhua Shen
AAAI 2020 Task-Aware Monocular Depth Estimation for 3D Object Detection Xinlong Wang, Wei Yin, Tao Kong, Yuning Jiang, Lei Li, Chunhua Shen