ML Anthology
Authors
Search
About
Kim, Taemin
1 publications
NeurIPSW
2024
Efficient Generative Multimodal Integration (EGMI): Enabling Audio Generation from Text-Image Pairs Through Alignment with Large Language Models
Taemin Kim
,
Wooyeol Baek
,
Heeseok Oh