CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling

Cite

Text

Yang et al. "CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-93806-1_16

Markdown

[Yang et al. "CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/yang2024eccvw-cmmd/) doi:10.1007/978-3-031-93806-1_16

BibTeX

@inproceedings{yang2024eccvw-cmmd,
  title     = {{CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling}},
  author    = {Yang, Ruihan and Gamper, Hannes and Braun, Sebastian},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2024},
  pages     = {214-226},
  doi       = {10.1007/978-3-031-93806-1_16},
  url       = {https://mlanthology.org/eccvw/2024/yang2024eccvw-cmmd/}
}