Wave-Wise Discriminative Tracking by Phase-Amplitude Separation, Augmentation and Mixture

Abstract

Distinguishing key features in complex visual tasks is challenging. A novel approach treats image patches (tokens) as waves. By using both phase and amplitude, it captures richer semantics and specific invariances compared to pixel-based methods, and allows for feature fusion across regions for a holistic image representation. Based on this, we propose the Wave-wise Discriminative Transformer Tracker (WDT). During tracking, WDT represents features via phase-amplitude separation, enhancement, and mixture. First, we designed a Mutual Exclusive Phase-Amplitude Extractor (MEPAE) to separate phase and amplitude features with distinct semantics, representing spatial target info and background brightness respectively. Then, Wave-wise Feature Augmentation is carried out with two submodules: Phase-Amplitude Feature Augmentation and Mixture. The augmentation module disrupts the separated features in the same batch, and the mixture module recombines them to generate positive and negative waves. The original features are aggregated into the original wave. Positive waves have the same phase but different amplitudes, and negative waves have different phase components. Finally, self-supervised and tracking-supervised losses guide the global and local representation learning for original, positive, and negative waves, enhancing wave-level discrimination. Experiments on five benchmarks prove the effectiveness of our method.

Cite

Text

Tan et al. "Wave-Wise Discriminative Tracking by Phase-Amplitude Separation, Augmentation and Mixture." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/215

Markdown

[Tan et al. "Wave-Wise Discriminative Tracking by Phase-Amplitude Separation, Augmentation and Mixture." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/tan2025ijcai-wave/) doi:10.24963/IJCAI.2025/215

BibTeX

@inproceedings{tan2025ijcai-wave,
  title     = {{Wave-Wise Discriminative Tracking by Phase-Amplitude Separation, Augmentation and Mixture}},
  author    = {Tan, Huibin and Cao, Mingyu and Hu, Kun and He, Xihuai and Wang, Zhe and Li, Hao and Lan, Long and Wang, Mengzhu},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {1927-1935},
  doi       = {10.24963/IJCAI.2025/215},
  url       = {https://mlanthology.org/ijcai/2025/tan2025ijcai-wave/}
}