A Memory Model Based on the Siamese Network for Long-Term Tracking
Abstract
We propose a novel memory model using deep convolutional features for long-term tracking to handle the challenging issues, including visual deformation or target disappearance. Our memory model is separated into short- and long-term stores inspired by Atkinson-Shiffrin Memory Model (ASMM). In the tracking step, the bounding box of the target is estimated by the Siamese features obtained from both memory stores to accommodate changes in the visual appearance of the target. In the re-detection step, we take features only in the long-term store to alleviate the drift problem. At this time, we adopt a coarse-to-fine strategy to detect the target in the entire image without the dependency of the previous position. In the end, we employ Regional Maximum Activation of Convolutions (R-MAC) as key criteria. Our tracker achieves an F-score of 0.52 on the LTB35 dataset, which is 0.04 higher than the performance of the state-of-the-art algorithm.
Cite
Text
Lee et al. "A Memory Model Based on the Siamese Network for Long-Term Tracking." European Conference on Computer Vision Workshops, 2018. doi:10.1007/978-3-030-11009-3_5Markdown
[Lee et al. "A Memory Model Based on the Siamese Network for Long-Term Tracking." European Conference on Computer Vision Workshops, 2018.](https://mlanthology.org/eccvw/2018/lee2018eccvw-memory/) doi:10.1007/978-3-030-11009-3_5BibTeX
@inproceedings{lee2018eccvw-memory,
title = {{A Memory Model Based on the Siamese Network for Long-Term Tracking}},
author = {Lee, Hankyeol and Choi, Seokeon and Kim, Changick},
booktitle = {European Conference on Computer Vision Workshops},
year = {2018},
pages = {100-115},
doi = {10.1007/978-3-030-11009-3_5},
url = {https://mlanthology.org/eccvw/2018/lee2018eccvw-memory/}
}