Statistically-Motivated Second-Order Pooling
Abstract
However, the nature of such operations is usually computationally expensive, and resulting vector representation orders of magnitude larger than first-order baselines. Here, by contrast, we introduce a statistically-motivated framework that projects the second-order descriptor into a compact vector while improving the representational power. To this end, we design a parametric vectorization layer, which maps a covariance matrix, known to follow a Wishart distribution, to a vector whose elements can be shown to follow a Chi-square distribution. We then propose to make use of a square-root normalization, which makes the distribution of the resulting representation converge to a Gaussian, with which most classifiers of recent first-order networks complying. As evidenced by our experiments, this lets us outperform the state-of-the-art first-order and second-order models on several benchmark recognition datasets.
Cite
Text
Yu and Salzmann. "Statistically-Motivated Second-Order Pooling." Proceedings of the European Conference on Computer Vision (ECCV), 2018. doi:10.1007/978-3-030-01234-2_37Markdown
[Yu and Salzmann. "Statistically-Motivated Second-Order Pooling." Proceedings of the European Conference on Computer Vision (ECCV), 2018.](https://mlanthology.org/eccv/2018/yu2018eccv-statisticallymotivated/) doi:10.1007/978-3-030-01234-2_37BibTeX
@inproceedings{yu2018eccv-statisticallymotivated,
title = {{Statistically-Motivated Second-Order Pooling}},
author = {Yu, Kaicheng and Salzmann, Mathieu},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2018},
doi = {10.1007/978-3-030-01234-2_37},
url = {https://mlanthology.org/eccv/2018/yu2018eccv-statisticallymotivated/}
}