Aesthetic Inference for Smart Mobile Devices

Abstract

The convenience of cell-phone cameras has made them one of the most common ways by which people document their lives, whether it is everyday pleasures or celebrations. With thousands of images, it might prove to be a daunting task to organize them by hand. When applying automated algorithms to help us, we would like to have both images that are dear to us but are also of good quality. In this paper we explore the performance of the MobileNet CNN architecture, and the different design (inputs size, and layer depth) choices, in their ability in solving various aesthetic inference task: binary classification, regression, image cropping. We show that the baseline MobileNet architecture achieves near state-of-the-art results for binary classification on the AVA dataset while being more than 10 times smaller and compute efficient. We further show that these models, when trained for fine-grained aesthetics inference, achieve better cropping performance than other aestheticsbased croppers.

Cite

Text

Kucer and Messinger. "Aesthetic Inference for Smart Mobile Devices." IEEE/CVF Winter Conference on Applications of Computer Vision, 2018. doi:10.1109/WACV.2018.00196

Markdown

[Kucer and Messinger. "Aesthetic Inference for Smart Mobile Devices." IEEE/CVF Winter Conference on Applications of Computer Vision, 2018.](https://mlanthology.org/wacv/2018/kucer2018wacv-aesthetic/) doi:10.1109/WACV.2018.00196

BibTeX

@inproceedings{kucer2018wacv-aesthetic,
  title     = {{Aesthetic Inference for Smart Mobile Devices}},
  author    = {Kucer, Michal and Messinger, David W.},
  booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
  year      = {2018},
  pages     = {1764-1773},
  doi       = {10.1109/WACV.2018.00196},
  url       = {https://mlanthology.org/wacv/2018/kucer2018wacv-aesthetic/}
}