FlickerFusion: Intra-Trajectory Domain Generalizing Multi-Agent Reinforcement Learning
Abstract
Multi-agent reinforcement learning has demonstrated significant potential in addressing complex cooperative tasks across various real-world applications. However, existing MARL approaches often rely on the restrictive assumption that the number of entities (e.g., agents, obstacles) remains constant between training and inference. This overlooks scenarios where entities are dynamically removed or $\textit{added}$ $\textit{during}$ the inference trajectory—a common occurrence in real-world environments like search and rescue missions and dynamic combat situations. In this paper, we tackle the challenge of intra-trajectory dynamic entity composition under zero-shot out-of-domain (OOD) generalization, where such dynamic changes cannot be anticipated beforehand. Our empirical studies reveal that existing MARL methods suffer $\textit{significant}$ performance degradation and increased uncertainty in these scenarios. In response, we propose FlickerFusion, a novel OOD generalization method that acts as a $\textit{universally}$ applicable augmentation technique for MARL backbone methods. FlickerFusion stochastically drops out parts of the observation space, emulating being in-domain when inferenced OOD. The results show that FlickerFusion not only achieves superior inference rewards but also $\textit{uniquely}$ reduces uncertainty vis-à-vis the backbone, compared to existing methods. Benchmarks, implementations, and model weights are organized and open-sourced at $\texttt{\href{flickerfusion305.github.io}{\textbf{flickerfusion305.github.io}}}$, accompanied by ample demo video renderings.
Cite
Text
Koh et al. "FlickerFusion: Intra-Trajectory Domain Generalizing Multi-Agent Reinforcement Learning." International Conference on Learning Representations, 2025.Markdown
[Koh et al. "FlickerFusion: Intra-Trajectory Domain Generalizing Multi-Agent Reinforcement Learning." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/koh2025iclr-flickerfusion/)BibTeX
@inproceedings{koh2025iclr-flickerfusion,
title = {{FlickerFusion: Intra-Trajectory Domain Generalizing Multi-Agent Reinforcement Learning}},
author = {Koh, Woosung and Oh, Wonbeen and Kim, Siyeol and Shin, Suhin and Kim, Hyeongjin and Jang, Jaein and Lee, Junghyun and Yun, Se-Young},
booktitle = {International Conference on Learning Representations},
year = {2025},
url = {https://mlanthology.org/iclr/2025/koh2025iclr-flickerfusion/}
}