Qian, Kaizhi

13 publications

TMLR 2026 ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning Bairu Hou, Yang Zhang, Jiabao Ji, Yujian Liu, Kaizhi Qian, Jacob Andreas, Shiyu Chang
ICCV 2025 RapVerse: Coherent Vocals and Whole-Body Motion Generation from Text Jiaben Chen, Xin Yan, Yihang Chen, Siyuan Cen, Zixin Wang, Qinwei Ma, Haoyu Zhen, Kaizhi Qian, Lie Lu, Chuang Gan
AAAI 2025 UniMuMo: Unified Text, Music, and Motion Generation Han Yang, Kun Su, Yutong Zhang, Jiaben Chen, Kaizhi Qian, Gaowen Liu, Chuang Gan
ICML 2024 Decomposing Uncertainty for Large Language Models Through Input Clarification Ensembling Bairu Hou, Yujian Liu, Kaizhi Qian, Jacob Andreas, Shiyu Chang, Yang Zhang
ICML 2024 Speech Self-Supervised Learning Using Diffusion Model Synthetic Data Heting Gao, Kaizhi Qian, Junrui Ni, Chuang Gan, Mark A. Hasegawa-Johnson, Shiyu Chang, Yang Zhang
ICML 2023 Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning Zhongzhi Yu, Yang Zhang, Kaizhi Qian, Cheng Wan, Yonggan Fu, Yongan Zhang, Yingyan Celine Lin
CVPR 2023 Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos Kun Su, Kaizhi Qian, Eli Shlizerman, Antonio Torralba, Chuang Gan
ICML 2022 ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers Kaizhi Qian, Yang Zhang, Heting Gao, Junrui Ni, Cheng-I Lai, David Cox, Mark Hasegawa-Johnson, Shiyu Chang
NeurIPS 2022 Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing Yonggan Fu, Yang Zhang, Kaizhi Qian, Zhifan Ye, Zhongzhi Yu, Cheng-I Jeff Lai, Celine Lin
ICML 2021 Global Prosody Style Transfer Without Text Transcriptions Kaizhi Qian, Yang Zhang, Shiyu Chang, Jinjun Xiong, Chuang Gan, David Cox, Mark Hasegawa-Johnson
NeurIPS 2021 PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition Cheng-I Jeff Lai, Yang Zhang, Alexander H. Liu, Shiyu Chang, Yi-Lun Liao, Yung-Sung Chuang, Kaizhi Qian, Sameer Khurana, David Cox, Jim Glass
ICML 2020 Unsupervised Speech Decomposition via Triple Information Bottleneck Kaizhi Qian, Yang Zhang, Shiyu Chang, Mark Hasegawa-Johnson, David Cox
ICML 2019 AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss Kaizhi Qian, Yang Zhang, Shiyu Chang, Xuesong Yang, Mark Hasegawa-Johnson