What Do I See? Modeling Human Visual Perception for Multi-Person Tracking

Yan, Xu; Kakadiaris, Ioannis A.; Shah, Shishir K.

doi:10.1007/978-3-319-10605-2_21

What Do I See? Modeling Human Visual Perception for Multi-Person Tracking

Xu Yan, Ioannis A. Kakadiaris, Shishir K. Shah

ECCV 2014 pp. 314-329

doi:10.1007/978-3-319-10605-2_21 /eccv/2014/yan2014eccv-i/

Abstract

This paper presents a novel approach for multi-person tracking utilizing a model motivated by the human vision system. The model predicts human motion based on modeling of perceived information. An attention map is designed to mimic human reasoning that integrates both spatial and temporal information. The spatial component addresses human attention allocation to different areas in a scene and is represented using a retinal mapping based on the log-polar transformation while the temporal component denotes the human attention allocation to subjects with different motion velocity and is modeled as a static-dynamic attention map. With the static-dynamic attention map and retinal mapping, attention driven motion of the tracked target is estimated with a center-surround search mechanism. This perception based motion model is integrated into a data association tracking framework with appearance and motion features. The proposed algorithm tracks a large number of subjects in complex scenes and the evaluation on public datasets show promising improvements over state-of-the-art methods.

PDF ECCV Semantic Scholar

Cite

Text

Yan et al. "What Do I See? Modeling Human Visual Perception for Multi-Person Tracking." European Conference on Computer Vision, 2014. doi:10.1007/978-3-319-10605-2_21

Markdown

[Yan et al. "What Do I See? Modeling Human Visual Perception for Multi-Person Tracking." European Conference on Computer Vision, 2014.](https://mlanthology.org/eccv/2014/yan2014eccv-i/) doi:10.1007/978-3-319-10605-2_21

BibTeX

@inproceedings{yan2014eccv-i,
  title     = {{What Do I See? Modeling Human Visual Perception for Multi-Person Tracking}},
  author    = {Yan, Xu and Kakadiaris, Ioannis A. and Shah, Shishir K.},
  booktitle = {European Conference on Computer Vision},
  year      = {2014},
  pages     = {314-329},
  doi       = {10.1007/978-3-319-10605-2_21},
  url       = {https://mlanthology.org/eccv/2014/yan2014eccv-i/}
}