A Unified Framework for Motion Reasoning and Generation in Human Interaction
Abstract
Recent advancements in large language models (LLMs) have significantly improved their ability to generate natural and contextually appropriate text, enabling more human-like interactions. However, understanding and generating interactive human-like motion, especially involving coordinated interactive motion, remains a challenging problem due to its inherent complexity. To address this, we present MoLaM, the Interactive Motion-LAnguage Model, a unified architecture that jointly processes language and motion modalities for understanding, generating, and controlling interactive motions in multi-turn conversational settings. Unlike prior approaches limited to uni-directional tasks (e.g., text-to-motion or motion-to-text), MoLaM supports a wide range of bi-directional and multi-turn tasks. We also introduce Inter-MT2, a large-scale instruction tuning dataset containing 82.7K multi-turn interactive motion instructions and 153K motion samples, covering tasks such as editing, question answering, and story generation. Leveraging LLMs and motion diffusion models, MoLaM demonstrates strong performance across five interactive motion tasks: motion-to-text, text-to-motion, reaction generation, motion editing, and motion reasoning, outperforming or matching task-specific baselines with a single, unified model.
Cite
Text
Park et al. "A Unified Framework for Motion Reasoning and Generation in Human Interaction." International Conference on Computer Vision, 2025.Markdown
[Park et al. "A Unified Framework for Motion Reasoning and Generation in Human Interaction." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/park2025iccv-unified/)BibTeX
@inproceedings{park2025iccv-unified,
title = {{A Unified Framework for Motion Reasoning and Generation in Human Interaction}},
author = {Park, Jeongeun and Choi, Sungjoon and Yun, Sangdoo},
booktitle = {International Conference on Computer Vision},
year = {2025},
pages = {10698-10707},
url = {https://mlanthology.org/iccv/2025/park2025iccv-unified/}
}