Learning an Aesthetic Photo Cropping Cascade
Abstract
Cropping is one of the most fundamental and common operations in image processing for improving the aesthetic quality of photographs. Instead of manually designing rules for cropping, in this paper, we propose a generative model that learns an aesthetic photo cropping cascade from a large database of well-composed images and a dataset containing images with crops generated by expert photographers. Specifically, this model includes cropping priori, intuitive likelihood, compositional likelihood and change likelihood. Our learning exploits a spatial pyramid saliency feature and a multi-level foreground segmentation. The inference is done by efficient sub window search (ESS) [10] which is benefited from the bound at conditional distribution in the cascade. Additionally, for extracting attentional subjects and capturing scene composition, we design an iterative saliency method to model the saliency moving paths, which is beyond the typical saliency model predicting a single attentional region. Experiments show that our approach outperforms the state-of-the-art cropping methods by a large margin.
Cite
Text
Wang et al. "Learning an Aesthetic Photo Cropping Cascade." IEEE/CVF Winter Conference on Applications of Computer Vision, 2015. doi:10.1109/WACV.2015.66Markdown
[Wang et al. "Learning an Aesthetic Photo Cropping Cascade." IEEE/CVF Winter Conference on Applications of Computer Vision, 2015.](https://mlanthology.org/wacv/2015/wang2015wacv-learning/) doi:10.1109/WACV.2015.66BibTeX
@inproceedings{wang2015wacv-learning,
title = {{Learning an Aesthetic Photo Cropping Cascade}},
author = {Wang, Peng and Lin, Zhe and Mech, Radomír},
booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
year = {2015},
pages = {448-455},
doi = {10.1109/WACV.2015.66},
url = {https://mlanthology.org/wacv/2015/wang2015wacv-learning/}
}