Why Only Text: Empowering Vision-and-Language Navigation with Multi-Modal Prompts

Cite

Text

Hong et al. "Why Only Text: Empowering Vision-and-Language Navigation with Multi-Modal Prompts." International Joint Conference on Artificial Intelligence, 2024.

Markdown

[Hong et al. "Why Only Text: Empowering Vision-and-Language Navigation with Multi-Modal Prompts." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/hong2024ijcai-only/)

BibTeX

@inproceedings{hong2024ijcai-only,
  title     = {{Why Only Text: Empowering Vision-and-Language Navigation with Multi-Modal Prompts}},
  author    = {Hong, Haodong and Wang, Sen and Huang, Zi and Wu, Qi and Liu, Jiajun},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {839-847},
  url       = {https://mlanthology.org/ijcai/2024/hong2024ijcai-only/}
}