Using 3D Scene Structure to Improve Tracking

Abstract

In this work we consider the problem of tracking objects from a moving airborne platform in wide area surveillance through long occlusions and/or when their motion is unpredictable. The main idea is to take advantage of the known 3D scene structure to estimate a dynamic occlusion map, and to use the occlusion map to determine traffic entry and exit into these zones, which we call sources and sinks. Then the track linking problem is formulated as an alignment of sequences of tracks entering a sink and leaving a source. The sequence alignment problem is solved optimally and efficiently using dynamic programming. We have evaluated our algorithm on a vehicle tracking task in wide area motion imagery and have shown that track fragmentation is significantly decreased and outperforms the Hungarian algorithm.

Cite

Text

Prokaj and Medioni. "Using 3D Scene Structure to Improve Tracking." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2011. doi:10.1109/CVPR.2011.5995593

Markdown

[Prokaj and Medioni. "Using 3D Scene Structure to Improve Tracking." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2011.](https://mlanthology.org/cvpr/2011/prokaj2011cvpr-using/) doi:10.1109/CVPR.2011.5995593

BibTeX

@inproceedings{prokaj2011cvpr-using,
  title     = {{Using 3D Scene Structure to Improve Tracking}},
  author    = {Prokaj, Jan and Medioni, Gérard G.},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2011},
  pages     = {1337-1344},
  doi       = {10.1109/CVPR.2011.5995593},
  url       = {https://mlanthology.org/cvpr/2011/prokaj2011cvpr-using/}
}