Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification

Cite

Text

Awan et al. "Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-93806-1_15

Markdown

[Awan et al. "Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/awan2024eccvw-attendfusion/) doi:10.1007/978-3-031-93806-1_15

BibTeX

@inproceedings{awan2024eccvw-attendfusion,
  title     = {{Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification}},
  author    = {Awan, Mahrukh and Nadeem, Asmar and Awan, Muhammad Junaid and Mustafa, Armin and Husain, Syed Sameed},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2024},
  pages     = {195-213},
  doi       = {10.1007/978-3-031-93806-1_15},
  url       = {https://mlanthology.org/eccvw/2024/awan2024eccvw-attendfusion/}
}