Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model

Cite

Text

Xu et al. "Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.

Markdown

[Xu et al. "Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.](https://mlanthology.org/cvprw/2025/xu2025cvprw-beyond/)

BibTeX

@inproceedings{xu2025cvprw-beyond,
  title     = {{Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model}},
  author    = {Xu, Lu and Zhu, Sijie and Li, Chunyuan and Kuo, Chia-Wen and Chen, Fan and Wang, Xinyao and Chen, Guang and Du, Dawei and Yuan, Ye and Wen, Longyin},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2025},
  pages     = {503-512},
  url       = {https://mlanthology.org/cvprw/2025/xu2025cvprw-beyond/}
}