Graph-Guided Architecture Search for Real-Time Semantic Segmentation

Abstract

Designing a lightweight semantic segmentation network often requires researchers to find a trade-off between performance and speed, which is always empirical due to the limited interpretability of neural networks. In order to release researchers from these tedious mechanical trials, we propose a Graph-guided Architecture Search (GAS) pipeline to automatically search real-time semantic segmentation networks. Unlike previous works that use a simplified search space and stack a repeatable cell to form a network, we introduce a novel search mechanism with a new search space where a lightweight model can be effectively explored through the cell-level diversity and latency oriented constraint. Specifically, to produce the cell-level diversity, the cell-sharing constraint is eliminated through the cell-independent manner. Then a graph convolution network (GCN) is seamlessly integrated as a communication mechanism between cells. Finally, a latency-oriented constraint is endowed into the search process to balance the speed and performance. Extensive experiments on Cityscapes and CamVid datasets demonstrate that GAS achieves the new state-of-the-art trade-off between accuracy and speed. In particular, on Cityscapes dataset, GAS achieves the new best performance of 73.5% mIoU with the speed of 108.4 FPS on Titan Xp.

Cite

Text

Lin et al. "Graph-Guided Architecture Search for Real-Time Semantic Segmentation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00426

Markdown

[Lin et al. "Graph-Guided Architecture Search for Real-Time Semantic Segmentation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/lin2020cvpr-graphguided/) doi:10.1109/CVPR42600.2020.00426

BibTeX

@inproceedings{lin2020cvpr-graphguided,
  title     = {{Graph-Guided Architecture Search for Real-Time Semantic Segmentation}},
  author    = {Lin, Peiwen and Sun, Peng and Cheng, Guangliang and Xie, Sirui and Li, Xi and Shi, Jianping},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.00426},
  url       = {https://mlanthology.org/cvpr/2020/lin2020cvpr-graphguided/}
}