Yu, Qihang

31 publications

TMLR 2026 ReVision: Refining Video Diffusion with Explicit 3D Motion Modeling Qihao Liu, Ju He, Qihang Yu, Liang-Chieh Chen, Alan Yuille
ICCV 2025 Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation Sucheng Ren, Qihang Yu, Ju He, Xiaohui Shen, Alan Yuille, Liang-Chieh Chen
NeurIPS 2025 COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation Xueqing Deng, Linjie Yang, Qihang Yu, Ali Athar, Chenglin Yang, Xiaojie Jin, Xiaohui Shen, Liang-Chieh Chen
ICCV 2025 Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens Dongwon Kim, Ju He, Qihang Yu, Chenglin Yang, Xiaohui Shen, Suha Kwak, Liang-Chieh Chen
TMLR 2025 Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting Inkyu Shin, Qihang Yu, Xiaohui Shen, In So Kweon, Kuk-Jin Yoon, Liang-Chieh Chen
ICML 2025 FlowAR: Scale-Wise Autoregressive Image Generation Meets Flow Matching Sucheng Ren, Qihang Yu, Ju He, Xiaohui Shen, Alan Yuille, Liang-Chieh Chen
ICCV 2025 FlowTok: Flowing Seamlessly Across Text and Image Tokens Ju He, Qihang Yu, Qihao Liu, Liang-Chieh Chen
ICCV 2025 Leveraging Panoptic Scene Graph for Evaluating Fine-Grained Text-to-Image Generation Xueqing Deng, Linjie Yang, Qihang Yu, Chenglin Yang, Liang-Chieh Chen
ICCV 2025 Randomized Autoregressive Visual Generation Qihang Yu, Ju He, Xueqing Deng, Xiaohui Shen, Liang-Chieh Chen
TMLR 2024 A Simple Video Segmenter by Tracking Objects Along Axial Trajectories Ju He, Qihang Yu, Inkyu Shin, Xueqing Deng, Alan Yuille, Xiaohui Shen, Liang-Chieh Chen
NeurIPS 2024 Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization Qihao Liu, Zhanpeng Zeng, Ju He, Qihang Yu, Xiaohui Shen, Liang-Chieh Chen
NeurIPS 2024 An Image Is Worth 32 Tokens for Reconstruction and Generation Qihang Yu, Mark Weber, Xueqing Deng, Xiaohui Shen, Daniel Cremers, Liang-Chieh Chen
CVPR 2024 COCONut: Modernizing COCO Segmentation Xueqing Deng, Qihang Yu, Peng Wang, Xiaohui Shen, Liang-Chieh Chen
TMLR 2024 MaskBit: Embedding-Free Image Generation via Bit Tokens Mark Weber, Lijun Yu, Qihang Yu, Xueqing Deng, Xiaohui Shen, Daniel Cremers, Liang-Chieh Chen
ECCV 2024 Towards Open-Ended Visual Recognition with Large Language Models Qihang Yu, Xiaohui Shen, Liang-Chieh Chen
CVPR 2024 ViTamin: Designing Scalable Vision Models in the Vision-Language Era Jieneng Chen, Qihang Yu, Xiaohui Shen, Alan Yuille, Liang-Chieh Chen
WACV 2024 Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation Inkyu Shin, Dahun Kim, Qihang Yu, Jun Xie, Hong-Seok Kim, Bradley Green, In So Kweon, Kuk-Jin Yoon, Liang-Chieh Chen
ICCV 2023 CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans Jieneng Chen, Yingda Xia, Jiawen Yao, Ke Yan, Jianpeng Zhang, Le Lu, Fakai Wang, Bo Zhou, Mingyan Qiu, Qihang Yu, Mingze Yuan, Wei Fang, Yuxing Tang, Minfeng Xu, Jian Zhou, Yuqian Zhao, Qifeng Wang, Xianghua Ye, Xiaoli Yin, Yu Shi, Xin Chen, Jingren Zhou, Alan Yuille, Zaiyi Liu, Ling Zhang
CVPR 2023 Compositor: Bottom-up Clustering and Compositing for Robust Part and Object Segmentation Ju He, Jieneng Chen, Ming-Xian Lin, Qihang Yu, Alan L. Yuille
NeurIPS 2023 Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP Qihang Yu, Ju He, Xueqing Deng, Xiaohui Shen, Liang-Chieh Chen
ICLR 2023 MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models Chenglin Yang, Siyuan Qiao, Qihang Yu, Xiaoding Yuan, Yukun Zhu, Alan Yuille, Hartwig Adam, Liang-Chieh Chen
NeurIPS 2023 ReMaX: Relaxing for Better Training on Efficient Panoptic Segmentation Shuyang Sun, Weijun Wang, Andrew Howard, Qihang Yu, Philip Torr, Liang-Chieh Chen
CVPR 2022 CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation Qihang Yu, Huiyu Wang, Dahun Kim, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen
ECCV 2022 K-Means Mask Transformer Qihang Yu, Huiyu Wang, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen
ECCV 2022 PartImageNet: A Large, High-Quality Dataset of Parts Ju He, Shuo Yang, Shaokang Yang, Adam Kortylewski, Xiaoding Yuan, Jie-Neng Chen, Shuai Liu, Cheng Yang, Qihang Yu, Alan Yuille
CVPR 2022 TubeFormer-DeepLab: Video Mask Transformer Dahun Kim, Jun Xie, Huiyu Wang, Siyuan Qiao, Qihang Yu, Hong-Seok Kim, Hartwig Adam, In So Kweon, Liang-Chieh Chen
AAAI 2021 CAKES: Channel-Wise Automatic KErnel Shrinking for Efficient 3D Networks Qihang Yu, Yingwei Li, Jieru Mei, Yuyin Zhou, Alan L. Yuille
NeurIPS 2021 Glance-and-Gaze Vision Transformer Qihang Yu, Yingda Xia, Yutong Bai, Yongyi Lu, Alan L. Yuille, Wei Shen
CVPR 2021 Mask Guided Matting via Progressive Refinement Network Qihang Yu, Jianming Zhang, He Zhang, Yilin Wang, Zhe Lin, Ning Xu, Yutong Bai, Alan Yuille
ICLR 2021 Shape-Texture Debiased Neural Network Training Yingwei Li, Qihang Yu, Mingxing Tan, Jieru Mei, Peng Tang, Wei Shen, Alan Yuille, Cihang Xie
AAAI 2020 When Radiology Report Generation Meets Knowledge Graph Yixiao Zhang, Xiaosong Wang, Ziyue Xu, Qihang Yu, Alan L. Yuille, Daguang Xu