A Genetic Programming Approach to Designing Convolutional Neural Network Architectures

Abstract

We propose a method for designing convolutional neural network (CNN) architectures based on Cartesian genetic programming (CGP). In the proposed method, the architectures of CNNs are represented by directed acyclic graphs, in which each node represents highly-functional modules such as convolutional blocks and tensor operations, and each edge represents the connectivity of layers. The architecture is optimized to maximize the classification accuracy for a validation dataset by an evolutionary algorithm. We show that the proposed method can find competitive CNN architectures compared with state-of-the-art methods on the image classification task using CIFAR-10 and CIFAR-100 datasets.

Cite

Text

Suganuma et al. "A Genetic Programming Approach to Designing Convolutional Neural Network Architectures." International Joint Conference on Artificial Intelligence, 2018. doi:10.24963/IJCAI.2018/755

Markdown

[Suganuma et al. "A Genetic Programming Approach to Designing Convolutional Neural Network Architectures." International Joint Conference on Artificial Intelligence, 2018.](https://mlanthology.org/ijcai/2018/suganuma2018ijcai-genetic/) doi:10.24963/IJCAI.2018/755

BibTeX

@inproceedings{suganuma2018ijcai-genetic,
  title     = {{A Genetic Programming Approach to Designing Convolutional Neural Network Architectures}},
  author    = {Suganuma, Masanori and Shirakawa, Shinichi and Nagao, Tomoharu},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2018},
  pages     = {5369-5373},
  doi       = {10.24963/IJCAI.2018/755},
  url       = {https://mlanthology.org/ijcai/2018/suganuma2018ijcai-genetic/}
}