Towards Holistic, Pragmatic and Multimodal Conversational Systems
Abstract
Language acquisition and utilization transcend the mere exchange of lexical units. Visual cues, prosody, gestures, body movements, and context play an undeniably crucial role. Humans naturally communicate multimodally, employing multiple channels and synthesizing information from diverse modalities. My research delves into the characterization and construction of multimodal models that seamlessly integrate data from multiple independent modalities. I will cover recent work that highlights the challenges, achievements, and opportunities towards developing capable multimodal discursive models.
Cite
Text
Madhyastha. "Towards Holistic, Pragmatic and Multimodal Conversational Systems." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I20.30293Markdown
[Madhyastha. "Towards Holistic, Pragmatic and Multimodal Conversational Systems." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/madhyastha2024aaai-holistic/) doi:10.1609/AAAI.V38I20.30293BibTeX
@inproceedings{madhyastha2024aaai-holistic,
title = {{Towards Holistic, Pragmatic and Multimodal Conversational Systems}},
author = {Madhyastha, Pranava},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2024},
pages = {22677},
doi = {10.1609/AAAI.V38I20.30293},
url = {https://mlanthology.org/aaai/2024/madhyastha2024aaai-holistic/}
}