Liu, Haohe

7 publications

NeurIPSW 2024 AudioSetCaps: Enriched Audio Captioning Dataset Generation Using Large Audio Language Models Jisheng Bai, Haohe Liu, Mou Wang, Dongyuan Shi, Wenwu Wang, Mark D Plumbley, Woon-Seng Gan, Jianfeng Chen
NeurIPSW 2024 Latent Diffusion Model for Audio: Generation, Quality Enhancement, and Neural Audio Codec Haohe Liu, Wenwu Wang, Mark D Plumbley
AAAI 2024 Learning Temporal Resolution in Spectrogram for Audio Classification Haohe Liu, Xubo Liu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley
NeurIPSW 2024 Towards Temporally Synchronized Visually Indicated Sounds Through Scale-Adapted Positional Embeddings Xinhao Mei, Gael Le Lan, Haohe Liu, Zhaoheng Ni, Varun K. Nagaraja, Anurag Kumar, Yangyang Shi, Vikas Chandra
ICLRW 2024 WavCraft: Audio Editing and Generation with Large Language Models Jinhua Liang, Huan Zhang, Haohe Liu, Yin Cao, Qiuqiang Kong, Xubo Liu, Wenwu Wang, Mark D Plumbley, Huy Phan, Emmanouil Benetos
ICML 2023 AudioLDM: Text-to-Audio Generation with Latent Diffusion Models Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo Mandic, Wenwu Wang, Mark D Plumbley
NeurIPS 2022 BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis Yichong Leng, Zehua Chen, Junliang Guo, Haohe Liu, Jiawei Chen, Xu Tan, Danilo P. Mandic, Lei He, Xiangyang Li, Tao Qin, Sheng Zhao, Tie-Yan Liu