Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning

Cite

Text

Tian et al. "Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I6.28327

Markdown

[Tian et al. "Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/tian2024aaai-efficient/) doi:10.1609/AAAI.V38I6.28327

BibTeX

@inproceedings{tian2024aaai-efficient,
  title     = {{Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning}},
  author    = {Tian, Kaibin and Cheng, Yanhua and Liu, Yi and Hou, Xinglin and Chen, Quan and Li, Han},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {5207-5214},
  doi       = {10.1609/AAAI.V38I6.28327},
  url       = {https://mlanthology.org/aaai/2024/tian2024aaai-efficient/}
}