Revisiting Bilinear Pooling: A Coding Perspective

Abstract

Bilinear pooling has achieved state-of-the-art performance on fusing features in various machine learning tasks, owning to its ability to capture complex associations between features. Despite the success, bilinear pooling suffers from redundancy and burstiness issues, mainly due to the rank-one property of the resulting representation. In this paper, we prove that bilinear pooling is indeed a similarity-based coding-pooling formulation. This establishment then enables us to devise a new feature fusion algorithm, the factorized bilinear coding (FBC) method, to overcome the drawbacks of the bilinear pooling. We show that FBC can generate compact and discriminative representations with substantially fewer parameters. Experiments on two challenging tasks, namely image classification and visual question answering, demonstrate that our method surpasses the bilinear pooling technique by a large margin.

Cite

Text

Gao et al. "Revisiting Bilinear Pooling: A Coding Perspective." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I04.5811

Markdown

[Gao et al. "Revisiting Bilinear Pooling: A Coding Perspective." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/gao2020aaai-revisiting/) doi:10.1609/AAAI.V34I04.5811

BibTeX

@inproceedings{gao2020aaai-revisiting,
  title     = {{Revisiting Bilinear Pooling: A Coding Perspective}},
  author    = {Gao, Zhi and Wu, Yuwei and Zhang, Xiaoxun and Dai, Jindou and Jia, Yunde and Harandi, Mehrtash},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {3954-3961},
  doi       = {10.1609/AAAI.V34I04.5811},
  url       = {https://mlanthology.org/aaai/2020/gao2020aaai-revisiting/}
}