PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop
Abstract
Large-scale pre-trained video generation models excel in content creation but are not reliable as physically accurate world simulators out of the box. This work studies the process of post-training these models for accurate world modeling through the lens of the simple, yet fundamental, physics task of modeling object freefall. We show state-of-the-art video generation models struggle with this basic task, despite their visually impressive outputs. To remedy this problem, we find that fine-tuning on a relatively small amount of simulated videos is effective in inducing the dropping behavior in the model, and we can further improve results through a novel reward modeling procedure we introduce. Our study also reveals key limitations of post-training in generalization and distribution modeling. Additionally, we release a benchmark for this task that may serve as a useful diagnostic tool for tracking physical accuracy in large-scale video generative model development. Code is available at this repository: https://github.com/vision-x-nyu/pisa-experiments.
Cite
Text
Li et al. "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Li et al. "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/li2025icml-pisa/)BibTeX
@inproceedings{li2025icml-pisa,
title = {{PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop}},
author = {Li, Chenyu and Michel, Oscar and Pan, Xichen and Liu, Sainan and Roberts, Mike and Xie, Saining},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {35685-35709},
volume = {267},
url = {https://mlanthology.org/icml/2025/li2025icml-pisa/}
}