Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking

ShiJie Sun, Naveed Akhtar, XiangYu Song, HuanSheng Song, Ajmal Mian, Mubarak Shah

ECCV 2020

doi:10.1007/978-3-030-58586-0_37 /eccv/2020/sun2020eccv-simultaneous/

Abstract

Deep learning based Multiple Object Tracking (MOT) currently relies on off-the-shelf detectors for tracking-by-detection. This results in deep models that are detector biased and evaluations that are detector influenced. To resolve this issue, we introduce Deep Motion Modeling Network (DMM-Net) that can estimate multiple objects' motion parameters to perform joint detection and association in an end-to-end manner. DMM-Net models object features over multiple frames and simultaneously infers object classes, visibility and their motion parameters. These outputs are readily used to update the tracklets for efficient MOT. DMM-Net achieves PR-MOTA score of 12.80 @ 120+ fps for the popular UA-DETRAC challenge - which is better performance and orders of magnitude faster. We also contribute a synthetic large-scale public dataset Omni-MOT for vehicle tracking that provides precise ground-truth annotations to eliminate the detector influence in MOT evaluation. This 14M+ frames dataset is extendable with our public script. We demonstrate the suitability of Omni-MOT for deep learning with DMM-Net, and also make the source code of our network public.

PDF ECCV Semantic Scholar

Cite

Text

Sun et al. "Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking." Proceedings of the European Conference on Computer Vision (ECCV), 2020. doi:10.1007/978-3-030-58586-0_37

Markdown

[Sun et al. "Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking." Proceedings of the European Conference on Computer Vision (ECCV), 2020.](https://mlanthology.org/eccv/2020/sun2020eccv-simultaneous/) doi:10.1007/978-3-030-58586-0_37

BibTeX

@inproceedings{sun2020eccv-simultaneous,
  title     = {{Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking}},
  author    = {Sun, ShiJie and Akhtar, Naveed and Song, XiangYu and Song, HuanSheng and Mian, Ajmal and Shah, Mubarak},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2020},
  doi       = {10.1007/978-3-030-58586-0_37},
  url       = {https://mlanthology.org/eccv/2020/sun2020eccv-simultaneous/}
}