Park, Se Jin

4 publications

ICML 2025 Long-Form Speech Generation with Spoken Language Models Se Jin Park, Julian Salazar, Aren Jansen, Keisuke Kinoshita, Yong Man Ro, Rj Skerry-Ryan
CVPR 2024 AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation Jeongsoo Choi, Se Jin Park, Minsu Kim, Yong Man Ro
AAAI 2022 SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory Se Jin Park, Minsu Kim, Joanna Hong, Jeongsoo Choi, Yong Man Ro
ICCV 2021 Multi-Modality Associative Bridging Through Memory: Speech Sound Recollected from Face Video Minsu Kim, Joanna Hong, Se Jin Park, Yong Man Ro