JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts

Cite

Text

Son et al. "JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I7.32745

Markdown

[Son et al. "JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/son2025aaai-jovale/) doi:10.1609/AAAI.V39I7.32745

BibTeX

@inproceedings{son2025aaai-jovale,
  title     = {{JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts}},
  author    = {Son, Taein and Seo, Soo Won and Kim, Jisong and Lee, Seok Hwan and Choi, Jun Won},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {6940-6949},
  doi       = {10.1609/AAAI.V39I7.32745},
  url       = {https://mlanthology.org/aaai/2025/son2025aaai-jovale/}
}