Order from Chaos: Physical World Understanding from Glitchy Gameplay Videos
Abstract
Understanding the physical world, including object dynamics, material properties, and causal interactions, remains a core challenge in artificial intelligence. Although recent multi-modal large language models (MLLMs) have demonstrated impressive general reasoning capabilities, they still fall short of achieving human-level understanding of physical principles. Existing datasets for physical reasoning either rely on real-world videos, which incur high annotation costs, or on synthetic simulations, which suffer from limited realism and diversity. In this paper, we propose a novel paradigm that leverages glitches in gameplay videos, referring to visual anomalies that violate predefined physical laws, as a rich and scalable supervision source for physical world understanding. We introduce PhysGame, an instruction-tuning dataset containing 140,057 glitch-centric question–answer pairs across five physical domains and sixteen fine-grained categories. To ensure data accuracy, we design a meta-information–guided prompting strategy that utilizes gameplay metadata such as titles and descriptions to guide high-quality QA generation. Complementing PhysGame, we construct GameBench, an expert-annotated benchmark with 880 glitch-identified gameplay videos designed to evaluate physical reasoning capabilities. Extensive experiments show that PhysGame significantly enhances both Game2Real transferability, improving the real-world physical reasoning performance of Qwen2.5-VL by 2.5% on PhysBench, and Game2General transferability, yielding a 1.9% gain on the MVBench benchmark. Moreover, PhysGame-tuned models achieve a 3.7% absolute improvement on GameBench, demonstrating enhanced robustness in detecting physical implausibilities. These results indicate that learning from gameplay anomalies offers a scalable and effective pathway toward advancing physical world understanding in multimodal intelligence.
Cite
Text
Cao et al. "Order from Chaos: Physical World Understanding from Glitchy Gameplay Videos." Transactions on Machine Learning Research, 2026.Markdown
[Cao et al. "Order from Chaos: Physical World Understanding from Glitchy Gameplay Videos." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/cao2026tmlr-order/)BibTeX
@article{cao2026tmlr-order,
title = {{Order from Chaos: Physical World Understanding from Glitchy Gameplay Videos}},
author = {Cao, Meng and Tang, Haoran and Zhao, Haoze and Han, Mingfei and Liu, Ruyang and Sun, Qiang and Chang, Xiaojun and Reid, Ian and Liang, Xiaodan},
journal = {Transactions on Machine Learning Research},
year = {2026},
url = {https://mlanthology.org/tmlr/2026/cao2026tmlr-order/}
}