Russian Doll Network: Learning Nested Networks for Sample-Adaptive Dynamic Inference
Abstract
This work bridges recent advances in once-for-all (OFA) networks [1] and sample-adaptive dynamic networks. We propose a novel neural architecture dubbed as Russian doll network (RDN). Key differentiators of RDN are two-folds: first, a RDN topologically consists of a few nested sub-networks. Any smaller sub-network is completely embedded in all larger ones in a parameter-sharing manner. The computation flow of a RDN starts from the inner-most (and smallest) sub-network and sequentially executes larger ones according to the nesting order. A larger sub-network can re-use all intermediate features calculated at their inner sub-networks. This crucially ensures that each sub-network can conduct inference independently. Secondly, the nesting order of RDNs naturally plots the sequential neural path of a sample in the network. For an easy sample, much computation can be saved without much sacrifice of accuracy if an early-termination point can be intelligently determined. To this end, we formulate satisfying a specific accuracy-complexity tradeoff as a constrained optimization problem, solved via the Lagrangian multiplier theory. Comprehensive experiments of transforming several base models into RDN on ImageNet clearly demonstrate the superior accuracy-complexity balance of RDN.
Cite
Text
Jiang and Mu. "Russian Doll Network: Learning Nested Networks for Sample-Adaptive Dynamic Inference." IEEE/CVF International Conference on Computer Vision Workshops, 2021. doi:10.1109/ICCVW54120.2021.00042Markdown
[Jiang and Mu. "Russian Doll Network: Learning Nested Networks for Sample-Adaptive Dynamic Inference." IEEE/CVF International Conference on Computer Vision Workshops, 2021.](https://mlanthology.org/iccvw/2021/jiang2021iccvw-russian/) doi:10.1109/ICCVW54120.2021.00042BibTeX
@inproceedings{jiang2021iccvw-russian,
title = {{Russian Doll Network: Learning Nested Networks for Sample-Adaptive Dynamic Inference}},
author = {Jiang, Borui and Mu, Yadong},
booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
year = {2021},
pages = {336-344},
doi = {10.1109/ICCVW54120.2021.00042},
url = {https://mlanthology.org/iccvw/2021/jiang2021iccvw-russian/}
}