Wang, Hanqin

2 publications

TMLR 2025 EDM-TTS: Efficient Dual-Stage Masked Modeling for Alignment-Free Text-to-Speech Synthesis Nabarun Goswami, Hanqin Wang, Tatsuya Harada
ICLR 2025 T2V2: A Unified Non-Autoregressive Model for Speech Recognition and Synthesis via Multitask Learning Nabarun Goswami, Hanqin Wang, Tatsuya Harada