Zero-Shot Video Moment Retrieval via Off-the-Shelf Multimodal Large Language Models

Cite

Text

Xu et al. "Zero-Shot Video Moment Retrieval via Off-the-Shelf Multimodal Large Language Models." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I9.32971

Markdown

[Xu et al. "Zero-Shot Video Moment Retrieval via Off-the-Shelf Multimodal Large Language Models." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/xu2025aaai-zero/) doi:10.1609/AAAI.V39I9.32971

BibTeX

@inproceedings{xu2025aaai-zero,
  title     = {{Zero-Shot Video Moment Retrieval via Off-the-Shelf Multimodal Large Language Models}},
  author    = {Xu, Yifang and Sun, Yunzhuo and Zhai, Benxiang and Li, Ming and Liang, Wenxin and Li, Yang and Du, Sidan},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {8978-8986},
  doi       = {10.1609/AAAI.V39I9.32971},
  url       = {https://mlanthology.org/aaai/2025/xu2025aaai-zero/}
}