Dynamic Recursive Neural Network
Abstract
This paper proposes the dynamic recursive neural network (DRNN), which simplifies the duplicated building blocks in deep neural network. Different from forwarding through different blocks sequentially in previous networks, we demonstrate that the DRNN can achieve better performance with fewer blocks by employing block recursively. We further add a gate structure to each block, which can adaptively decide the loop times of recursive blocks to reduce the computational cost. Since the recursive networks are hard to train, we propose the Loopy Variable Batch Normalization (LVBN) to stabilize the volatile gradient. Further, we improve the LVBN to correct statistical bias caused by the gate structure. Experiments show that the DRNN reduces the parameters and computational cost and while outperforms the original model in term of the accuracy consistently on CIFAR-10 and ImageNet-1k. Lastly we visualize and discuss the relation between image saliency and the number of loop time.
Cite
Text
Guo et al. "Dynamic Recursive Neural Network." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00529Markdown
[Guo et al. "Dynamic Recursive Neural Network." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/guo2019cvpr-dynamic/) doi:10.1109/CVPR.2019.00529BibTeX
@inproceedings{guo2019cvpr-dynamic,
title = {{Dynamic Recursive Neural Network}},
author = {Guo, Qiushan and Yu, Zhipeng and Wu, Yichao and Liang, Ding and Qin, Haoyu and Yan, Junjie},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2019},
doi = {10.1109/CVPR.2019.00529},
url = {https://mlanthology.org/cvpr/2019/guo2019cvpr-dynamic/}
}