Multimodal Class-Aware Semantic Enhancement Network for Audio-Visual Video Parsing

Cite

Text

Zhao et al. "Multimodal Class-Aware Semantic Enhancement Network for Audio-Visual Video Parsing." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I10.33134

Markdown

[Zhao et al. "Multimodal Class-Aware Semantic Enhancement Network for Audio-Visual Video Parsing." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhao2025aaai-multimodal/) doi:10.1609/AAAI.V39I10.33134

BibTeX

@inproceedings{zhao2025aaai-multimodal,
  title     = {{Multimodal Class-Aware Semantic Enhancement Network for Audio-Visual Video Parsing}},
  author    = {Zhao, Pengcheng and Zhou, Jinxing and Zhao, Yang and Guo, Dan and Chen, Yanxiang},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {10448-10456},
  doi       = {10.1609/AAAI.V39I10.33134},
  url       = {https://mlanthology.org/aaai/2025/zhao2025aaai-multimodal/}
}