Yu, Shengye

1 publications

CVPR 2024 SonicVisionLM: Playing Sound with Vision Language Models Zhifeng Xie, Shengye Yu, Qile He, Mengtian Li