A Multimodal AI Dialogue System for Unified Document, Visual, and Audio Interaction

Cite

Text

Feng et al. "A Multimodal AI Dialogue System for Unified Document, Visual, and Audio Interaction." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/1259

Markdown

[Feng et al. "A Multimodal AI Dialogue System for Unified Document, Visual, and Audio Interaction." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/feng2025ijcai-multimodal/) doi:10.24963/IJCAI.2025/1259

BibTeX

@inproceedings{feng2025ijcai-multimodal,
  title     = {{A Multimodal AI Dialogue System for Unified Document, Visual, and Audio Interaction}},
  author    = {Feng, Yujun and Huang, Jingyi and Zhang, Yang},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {11044-11047},
  doi       = {10.24963/IJCAI.2025/1259},
  url       = {https://mlanthology.org/ijcai/2025/feng2025ijcai-multimodal/}
}