Deep Homography for Efficient Stereo Image Compression

Abstract

In this paper, we propose HESIC, an end-to-end trainable deep network for stereo image compression (SIC). To fully explore the mutual information across two stereo images, we use a deep regression model to estimate the homography matrix, i.e., H matrix. Then, the left image is spatially transformed by the H matrix, and only the residual information between the left and right images is encoded to save bit-rates. A two-branch auto-encoder architecture is adopted in HESIC, corresponding to the left and right images, respectively. For entropy coding, we propose two conditional stereo entropy models, i.e., Gaussian mixture model (GMM) based and context based entropy models, to fully explore the correlation between the two images to reduce the coding bit-rates. In decoding, a cross quality enhancement module is proposed to enhance the image quality based on inverse H matrix. Experimental results show that our HESIC outperforms state-of-the-art SIC methods on InStereo2K and KITTI datasets both quantitatively and qualitatively.

Cite

Text

Deng et al. "Deep Homography for Efficient Stereo Image Compression." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.00154

Markdown

[Deng et al. "Deep Homography for Efficient Stereo Image Compression." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/deng2021cvpr-deep/) doi:10.1109/CVPR46437.2021.00154

BibTeX

@inproceedings{deng2021cvpr-deep,
  title     = {{Deep Homography for Efficient Stereo Image Compression}},
  author    = {Deng, Xin and Yang, Wenzhe and Yang, Ren and Xu, Mai and Liu, Enpeng and Feng, Qianhan and Timofte, Radu},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
  pages     = {1492-1501},
  doi       = {10.1109/CVPR46437.2021.00154},
  url       = {https://mlanthology.org/cvpr/2021/deng2021cvpr-deep/}
}