Towards Regression-Free Neural Networks for Diverse Compute Platforms

Abstract

With the shift towards on-device deep learning, ensuring a consistent behavior of an AI service across diverse compute platforms becomes tremendously important. Our work tackles the emergent problem of reducing predictive in-consistencies arising as negative flips: test samples that are correctly predicted by a less accurate on-device model, but incorrectly by a more accurate on-cloud one. We introduce REGression constrained Neural Architecture Search (REG-NAS) to design a family of highly accurate models that engender fewer negative flips. REG-NAS consists of two components: (1) A novel architecture constraint that enables a larger on-cloud model to contain all the weights of the smaller on-device one thus maximizing weight sharing. This idea stems from our observation that larger weight sharing among networks leads to similar sample-wise predictions and results in fewer negative flips; (2) A novel search reward that incorporates both Top-1 accuracy and negative flips in the architecture optimization metric. We demonstrate that REG-NAS can successfully find architecture with few negative flips, in three popular architecture search spaces. Compared to the existing state-of-the-art approach [29], REG-NAS leads to 33-48% relative reduction of negative flips.

Cite

Text

Duggal et al. "Towards Regression-Free Neural Networks for Diverse Compute Platforms." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19836-6

Markdown

[Duggal et al. "Towards Regression-Free Neural Networks for Diverse Compute Platforms." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/duggal2022eccv-regressionfree/) doi:10.1007/978-3-031-19836-6

BibTeX

@inproceedings{duggal2022eccv-regressionfree,
  title     = {{Towards Regression-Free Neural Networks for Diverse Compute Platforms}},
  author    = {Duggal, Rahul and Zhou, Hao and Yang, Shuo and Fang, Jun and Xiong, Yuanjun and Xia, Wei},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19836-6},
  url       = {https://mlanthology.org/eccv/2022/duggal2022eccv-regressionfree/}
}