Can You Win Everything with a Lottery Ticket?
Abstract
$\textit{Lottery ticket hypothesis}$ (LTH) has demonstrated to yield independently trainable and highly sparse neural networks (a.k.a. $\textit{winning tickets}$), whose test set accuracies can be surprisingly on par or even better than dense models. However, accuracy is far from the only evaluation metric, and perhaps not always the most important one. Hence it might be myopic to conclude that a sparse subnetwork can replace its dense counterpart, even if the accuracy is preserved. Spurred by that, we perform the first comprehensive assessment of lottery tickets from diverse aspects beyond test accuracy, including $\textit{(i)}$ generalization to distribution shifts, $\textit{(ii)}$ prediction uncertainty, $\textit{(iii)}$ interpretability, and $\textit{(iv)}$ geometry of loss landscapes. With extensive experiments across datasets CIFAR-10, CIFAR-100, and ImageNet, model architectures, as well as tens of sparsification methods, we thoroughly characterize the trade-off between model sparsity and the all-dimension model capabilities. We find that an appropriate sparsity (e.g., $20\%\sim99.08\%$) can yield the winning ticket to perform comparably or even better $\textbf{in all above four aspects}$, although some aspects (generalization to certain distribution shifts, and uncertainty) appear more sensitive to the sparsification than others. We term it as a $\texttt{LTH-PASS}$. Overall, our results endorse choosing a good sparse subnetwork of a larger dense model, over directly training a small dense model of similar parameter counts. We hope that our study can offer more in-depth insights on pruning, for researchers and engineers who seek to incorporate sparse neural networks for user-facing deployments. Codes are available in: https://github.com/VITA-Group/LTH-Pass.
Cite
Text
Chen et al. "Can You Win Everything with a Lottery Ticket?." Transactions on Machine Learning Research, 2022.Markdown
[Chen et al. "Can You Win Everything with a Lottery Ticket?." Transactions on Machine Learning Research, 2022.](https://mlanthology.org/tmlr/2022/chen2022tmlr-you/)BibTeX
@article{chen2022tmlr-you,
title = {{Can You Win Everything with a Lottery Ticket?}},
author = {Chen, Tianlong and Zhang, Zhenyu and Wu, Jun and Huang, Randy and Liu, Sijia and Chang, Shiyu and Wang, Zhangyang},
journal = {Transactions on Machine Learning Research},
year = {2022},
url = {https://mlanthology.org/tmlr/2022/chen2022tmlr-you/}
}