Temporal Coherent Object Flow for Multi-Object Tracking

Abstract

Multi-object tracking is a challenging vision task that requires simultaneous reasoning about object detection and object association. Conventional solutions use frame as the basic unit and typically rely on a motion predictor that exploits the appearance features to associate detected candidates, leading to insufficient adaptability to long-term associations. In this study, we propose a section-based multi-object tracking approach that integrates a temporal coherent Object Flow Tracker (OFTrack), capable of achieving simultaneous multi-frame tracking by treating multiple consecutive frames as the basic processing unit, denoted as a “section”. Our OFTrack boosts the optical flow to the object flow by employing object perception and section-based motion estimation strategies. Object perception adopts object-aware sampling and scale-aware correlation to enable precise target discrimination. Motion estimation models the correlation of different objects in multi-frames via specialized temporal-spatial attention to achieve robust association in very long videos. Additionally, to address the oscillation of unpredictable trajectories in multi-frame estimation, we have designed temporal coherent enhancement including the trajectory masking pre-training and the smoothing constraint on trajectory curves. Comprehensive experiments on several widely used benchmarks demonstrate the superior performance of our approach.

Cite

Text

Song et al. "Temporal Coherent Object Flow for Multi-Object Tracking." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I7.32749

Markdown

[Song et al. "Temporal Coherent Object Flow for Multi-Object Tracking." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/song2025aaai-temporal/) doi:10.1609/AAAI.V39I7.32749

BibTeX

@inproceedings{song2025aaai-temporal,
  title     = {{Temporal Coherent Object Flow for Multi-Object Tracking}},
  author    = {Song, Zikai and Luo, Run and Ma, Lintao and Tang, Ying and Chen, Yi-Ping Phoebe and Yu, Junqing and Yang, Wei},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {6978-6986},
  doi       = {10.1609/AAAI.V39I7.32749},
  url       = {https://mlanthology.org/aaai/2025/song2025aaai-temporal/}
}