AutoMV: An Autonomous Agent Framework for Real Estate Marketing Video Generation

Abstract

In this paper, we introduce AutoMV, an autonomous agent framework designed for generating real estate marketing videos. The framework integrates a diverse set of existing models into a tool library, allowing the agent to intelligently select and execute the appropriate tools. Given property images and text, the agent decomposes the task into manageable subtasks, generating storyline directives and corresponding camera movement trajectories to guide the video production process. By automatically applying video synthesis techniques and incorporating multimedia elements such as subtitles and background music, the agent transforms static real estate images into dynamic, visually appealing videos, thereby optimizing their impact for digital marketing purposes.

Cite

Text

Wu et al. "AutoMV: An Autonomous Agent Framework for Real Estate Marketing Video Generation." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I28.35377

Markdown

[Wu et al. "AutoMV: An Autonomous Agent Framework for Real Estate Marketing Video Generation." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/wu2025aaai-automv/) doi:10.1609/AAAI.V39I28.35377

BibTeX

@inproceedings{wu2025aaai-automv,
  title     = {{AutoMV: An Autonomous Agent Framework for Real Estate Marketing Video Generation}},
  author    = {Wu, Kuizong and Yuan, Shaozu and Shen, Chang and Xu, Long and Chen, Meng},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {29715-29717},
  doi       = {10.1609/AAAI.V39I28.35377},
  url       = {https://mlanthology.org/aaai/2025/wu2025aaai-automv/}
}