Deep Stereo Image Compression via Bi-Directional Coding
Abstract
Existing learning-based stereo compression methods usually adopt a unidirectional approach to encoding one image independently and the other image conditioned upon the first. This paper proposes a novel bi-directional coding-based end-to-end stereo image compression network (BCSIC-Net). BCSIC-Net consists of a novel bi-directional contextual transform module which performs nonlinear transform conditioned upon the inter-view context in a latent space to reduce inter-view redundancy, and a bi-directional conditional entropy model that employs inter-view correspondence as a conditional prior to improve coding efficiency. Experimental results on the InStereo2K and KITTI datasets demonstrate that the proposed BCSIC-Net can effectively reduce the inter-view redundancy and outperforms state-of-the-art methods.
Cite
Text
Lei et al. "Deep Stereo Image Compression via Bi-Directional Coding." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.01905Markdown
[Lei et al. "Deep Stereo Image Compression via Bi-Directional Coding." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/lei2022cvpr-deep/) doi:10.1109/CVPR52688.2022.01905BibTeX
@inproceedings{lei2022cvpr-deep,
title = {{Deep Stereo Image Compression via Bi-Directional Coding}},
author = {Lei, Jianjun and Liu, Xiangrui and Peng, Bo and Jin, Dengchao and Li, Wanqing and Gu, Jingxiao},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2022},
pages = {19669-19678},
doi = {10.1109/CVPR52688.2022.01905},
url = {https://mlanthology.org/cvpr/2022/lei2022cvpr-deep/}
}