Multi-Scale Recursive and Perception-Distortion Controllable Image Super-Resolution
Abstract
We describe our solution for the PIRM Super–Resolution Challenge 2018 where we achieved the \(\varvec{2^nd}\) best perceptual quality for average \(RMSE\leqslant 16\), \(5^th\) best for \(RMSE\leqslant 12.5\), and \(7^th\) best for \(RMSE\leqslant 11.5\). We modify a recently proposed Multi–Grid Back–Projection (MGBP) architecture to work as a generative system with an input parameter that can control the amount of artificial details in the output. We propose a discriminator for adversarial training with the following novel properties: it is multi–scale that resembles a progressive–GAN; it is recursive that balances the architecture of the generator; and it includes a new layer to capture significant statistics of natural images. Finally, we propose a training strategy that avoids conflicts between reconstruction and perceptual losses. Our configuration uses only 281 k parameters and upscales each image of the competition in 0.2 s in average.
Cite
Text
Michelini et al. "Multi-Scale Recursive and Perception-Distortion Controllable Image Super-Resolution." European Conference on Computer Vision Workshops, 2018. doi:10.1007/978-3-030-11021-5_1Markdown
[Michelini et al. "Multi-Scale Recursive and Perception-Distortion Controllable Image Super-Resolution." European Conference on Computer Vision Workshops, 2018.](https://mlanthology.org/eccvw/2018/michelini2018eccvw-multiscale/) doi:10.1007/978-3-030-11021-5_1BibTeX
@inproceedings{michelini2018eccvw-multiscale,
title = {{Multi-Scale Recursive and Perception-Distortion Controllable Image Super-Resolution}},
author = {Michelini, Pablo Navarrete and Zhu, Dan and Liu, Hanwen},
booktitle = {European Conference on Computer Vision Workshops},
year = {2018},
pages = {3-19},
doi = {10.1007/978-3-030-11021-5_1},
url = {https://mlanthology.org/eccvw/2018/michelini2018eccvw-multiscale/}
}