Liu, Xubo

9 publications

ICML 2025 ALMTokenizer: A Low-Bitrate and Semantic-Rich Audio Codec Tokenizer for Audio Language Modeling Dongchao Yang, Songxiang Liu, Haohan Guo, Jiankun Zhao, Yuanyuan Wang, Helin Wang, Zeqian Ju, Xubo Liu, Xueyuan Chen, Xu Tan, Xixin Wu, Helen M. Meng
NeurIPS 2025 MoME: Mixture of Matryoshka Experts for Audio-Visual Speech Recognition Umberto Cappellazzo, Minsu Kim, Pingchuan Ma, Honglie Chen, Xubo Liu, Stavros Petridis, Maja Pantic
ICLR 2025 Scaling Transformers for Low-Bitrate High-Quality Speech Coding Julian D Parker, Anton Smirnov, Jordi Pons, Cj Carr, Zack Zukowski, Zach Evans, Xubo Liu
AAAI 2024 Learning Temporal Resolution in Spectrogram for Audio Classification Haohe Liu, Xubo Liu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley
NeurIPSW 2024 Sound-VECaps: Improving Audio Generation with Visual Enhanced Captions Yi Yuan, Dongya Jia, Xiaobin Zhuang, Yuanzhe Chen, Zhengxi Liu, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xubo Liu, Xiyuan Kang, Mark D Plumbley, Wenwu Wang
ICLRW 2024 WavCraft: Audio Editing and Generation with Large Language Models Jinhua Liang, Huan Zhang, Haohe Liu, Yin Cao, Qiuqiang Kong, Xubo Liu, Wenwu Wang, Mark D Plumbley, Huy Phan, Emmanouil Benetos
ICML 2023 AudioLDM: Text-to-Audio Generation with Latent Diffusion Models Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo Mandic, Wenwu Wang, Mark D Plumbley
AAAI 2023 Personalized Dialogue Generation with Persona-Adaptive Attention Qiushi Huang, Yu Zhang, Tom Ko, Xubo Liu, Bo Wu, Wenwu Wang, H. Lilian Tang
CVPR 2023 SynthVSR: Scaling up Visual Speech Recognition with Synthetic Supervision Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jachym Kolar, Stavros Petridis, Maja Pantic, Christian Fuegen