CAMOT: Camera Angle-Aware Multi-Object Tracking

Abstract

This paper proposes CAMOT, a simple camera angle estimator for multi-object tracking to tackle two problems: 1) occlusion and 2) inaccurate distance estimation in the depth direction. Under the assumption that multiple objects are located on a flat plane in each video frame, CAMOT estimates the camera angle using object detection. In addition, it gives the depth of each object, enabling pseudo-3D MOT. We evaluated its performance by adding it to various 2D MOT methods on the MOT17 and MOT20 datasets and confirmed its effectiveness. Applying CAMOT to ByteTrack, we obtained 63.8% HOTA, 80.6% MOTA, and 78.5% IDF1 in MOT17, which are state-of-the-art results. Its computational cost is significantly lower than the existing deep-learning-based depth estimators for tracking.

Cite

Text

Limanta et al. "CAMOT: Camera Angle-Aware Multi-Object Tracking." Winter Conference on Applications of Computer Vision, 2024.

Markdown

[Limanta et al. "CAMOT: Camera Angle-Aware Multi-Object Tracking." Winter Conference on Applications of Computer Vision, 2024.](https://mlanthology.org/wacv/2024/limanta2024wacv-camot/)

BibTeX

@inproceedings{limanta2024wacv-camot,
  title     = {{CAMOT: Camera Angle-Aware Multi-Object Tracking}},
  author    = {Limanta, Felix and Uto, Kuniaki and Shinoda, Koichi},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2024},
  pages     = {6479-6488},
  url       = {https://mlanthology.org/wacv/2024/limanta2024wacv-camot/}
}