JointDiff: Bridging Continuous and Discrete in Multi-Agent Trajectory Generation

Abstract

Generative models often treat continuous data and discrete events as separate processes, creating a gap in modeling complex systems where they interact synchronously. To bridge this gap, we introduce $\textbf{JointDiff}$, a novel diffusion framework designed to unify these two processes by simultaneously generating continuous spatio-temporal data and synchronous discrete events. We demonstrate its efficacy in the sports domain by simultaneously modeling multi-agent trajectories and key possession events. This joint modeling is validated with non-controllable generation and two novel controllable generation scenarios: $\textit{weak-possessor-guidance}$, which offers flexible semantic control over game dynamics through a simple list of intended ball possessors, and $\textit{text-guidance}$, which enables fine-grained, language-driven generation. To enable the conditioning with these guidance signals, we introduce $\textbf{CrossGuid}$, an effective conditioning operation for multi-agent domains. We also share a new unified sports benchmark enhanced with textual descriptions for soccer and football datasets. JointDiff achieves state-of-the-art performance, demonstrating that joint modeling is crucial for building realistic and controllable generative models for interactive systems. [Project](https://guillem-cf.github.io/JointDiff/)

Cite

Text

Capellera et al. "JointDiff: Bridging Continuous and Discrete in Multi-Agent Trajectory Generation." International Conference on Learning Representations, 2026.

Markdown

[Capellera et al. "JointDiff: Bridging Continuous and Discrete in Multi-Agent Trajectory Generation." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/capellera2026iclr-jointdiff/)

BibTeX

@inproceedings{capellera2026iclr-jointdiff,
  title     = {{JointDiff: Bridging Continuous and Discrete in Multi-Agent Trajectory Generation}},
  author    = {Capellera, Guillem and Ferraz, Luis and Romano, Antonio Rubio and Alahi, Alexandre and Agudo, Antonio},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/capellera2026iclr-jointdiff/}
}