Read, Watch and Scream! Sound Generation from Text and Video

Cite

Text

Jeong et al. "Read, Watch and Scream! Sound Generation from Text and Video." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I17.33934

Markdown

[Jeong et al. "Read, Watch and Scream! Sound Generation from Text and Video." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/jeong2025aaai-read/) doi:10.1609/AAAI.V39I17.33934

BibTeX

@inproceedings{jeong2025aaai-read,
  title     = {{Read, Watch and Scream! Sound Generation from Text and Video}},
  author    = {Jeong, Yujin and Kim, Yunji and Chun, Sanghyuk and Lee, Jiyoung},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {17590-17598},
  doi       = {10.1609/AAAI.V39I17.33934},
  url       = {https://mlanthology.org/aaai/2025/jeong2025aaai-read/}
}