Baek, Wooyeol

1 publications

NeurIPSW 2024 Efficient Generative Multimodal Integration (EGMI): Enabling Audio Generation from Text-Image Pairs Through Alignment with Large Language Models Taemin Kim, Wooyeol Baek, Heeseok Oh