Improving Cross-Modal Alignment with Synthetic Pairs for Text-Only Image Captioning

Cite

Text

Liu et al. "Improving Cross-Modal Alignment with Synthetic Pairs for Text-Only Image Captioning." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I4.28178

Markdown

[Liu et al. "Improving Cross-Modal Alignment with Synthetic Pairs for Text-Only Image Captioning." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/liu2024aaai-improving/) doi:10.1609/AAAI.V38I4.28178

BibTeX

@inproceedings{liu2024aaai-improving,
  title     = {{Improving Cross-Modal Alignment with Synthetic Pairs for Text-Only Image Captioning}},
  author    = {Liu, Zhiyue and Liu, Jinyuan and Ma, Fanrong},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {3864-3872},
  doi       = {10.1609/AAAI.V38I4.28178},
  url       = {https://mlanthology.org/aaai/2024/liu2024aaai-improving/}
}