Multi-View Harmonized Bilinear Network for 3D Object Recognition
Abstract
View-based methods have achieved considerable success in $3$D object recognition tasks. Different from existing view-based methods pooling the view-wise features, we tackle this problem from the perspective of patches-to-patches similarity measurement. By exploiting the relationship between polynomial kernel and bilinear pooling, we obtain an effective $3$D object representation by aggregating local convolutional features through bilinear pooling. Meanwhile, we harmonize different components inherited in the pooled bilinear feature to obtain a more discriminative representation for a $3$D object. To achieve an end-to-end trainable framework, we incorporate the harmonized bilinear pooling operation as a layer of a network, constituting the proposed Multi-view Harmonized Bilinear Network (MHBN). Systematic experiments conducted on two public benchmark datasets demonstrate the efficacy of the proposed methods in $3$D object recognition.
Cite
Text
Yu et al. "Multi-View Harmonized Bilinear Network for 3D Object Recognition." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. doi:10.1109/CVPR.2018.00027Markdown
[Yu et al. "Multi-View Harmonized Bilinear Network for 3D Object Recognition." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.](https://mlanthology.org/cvpr/2018/yu2018cvpr-multiview/) doi:10.1109/CVPR.2018.00027BibTeX
@inproceedings{yu2018cvpr-multiview,
title = {{Multi-View Harmonized Bilinear Network for 3D Object Recognition}},
author = {Yu, Tan and Meng, Jingjing and Yuan, Junsong},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2018},
doi = {10.1109/CVPR.2018.00027},
url = {https://mlanthology.org/cvpr/2018/yu2018cvpr-multiview/}
}