Towards Robust Multimodal AU Detection: STN-Enhanced Visual Encoding and Audio-Visual Spatial-Temporal Alignment

Cite

Text

Yu et al. "Towards Robust Multimodal AU Detection: STN-Enhanced Visual Encoding and Audio-Visual Spatial-Temporal Alignment." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.

Markdown

[Yu et al. "Towards Robust Multimodal AU Detection: STN-Enhanced Visual Encoding and Audio-Visual Spatial-Temporal Alignment." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.](https://mlanthology.org/cvprw/2025/yu2025cvprw-robust/)

BibTeX

@inproceedings{yu2025cvprw-robust,
  title     = {{Towards Robust Multimodal AU Detection: STN-Enhanced Visual Encoding and Audio-Visual Spatial-Temporal Alignment}},
  author    = {Yu, Jun and Zhang, Yunxiang and Sun, Fengzhao and Wang, Leilei and Lu, Renjie and Zhu, Lingsi and Lu, Xilong and Zheng, Yang and Wang, Yongqi},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2025},
  pages     = {5725-5732},
  url       = {https://mlanthology.org/cvprw/2025/yu2025cvprw-robust/}
}