RDONet: Rate-Distortion Optimized Learned Image Compression with Variable Depth
Abstract
Rate-distortion optimization (RDO) is responsible for large gains in image and video compression. While RDO is a standard tool in traditional image and video coding, it is not yet widely used in novel end-to-end trained neural methods. The major reason is that the decoding function is trained once and does not have free parameters. In this paper, we present RDONet, a network containing state-of-the-art components, which is perceptually optimized and capable of rate-distortion optimization. With this network, we are able to outperform VVC Intra on MS-SSIM and two different perceptual LPIPS metrics. This paper is part of the CLIC challenge, where we participate under the team name RDONet FAU.
Cite
Text
Brand et al. "RDONet: Rate-Distortion Optimized Learned Image Compression with Variable Depth." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022. doi:10.1109/CVPRW56347.2022.00186Markdown
[Brand et al. "RDONet: Rate-Distortion Optimized Learned Image Compression with Variable Depth." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022.](https://mlanthology.org/cvprw/2022/brand2022cvprw-rdonet/) doi:10.1109/CVPRW56347.2022.00186BibTeX
@inproceedings{brand2022cvprw-rdonet,
title = {{RDONet: Rate-Distortion Optimized Learned Image Compression with Variable Depth}},
author = {Brand, Fabian and Fischer, Kristian and Kopte, Alexander and Windsheimer, Marc and Kaup, André},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2022},
pages = {1758-1762},
doi = {10.1109/CVPRW56347.2022.00186},
url = {https://mlanthology.org/cvprw/2022/brand2022cvprw-rdonet/}
}