Spatio-Focal Bidirectional Disparity Estimation from a Dual-Pixel Image

Abstract

Dual-pixel photography is monocular RGB-D photography with an ultra-high resolution, enabling many applications in computational photography. However, there are still several challenges to fully utilizing dual-pixel photography. Unlike the conventional stereo pair, the dual pixel exhibits a bidirectional disparity that includes positive and negative values, depending on the focus plane depth in an image. Furthermore, capturing a wide range of dual-pixel disparity requires a shallow depth of field, resulting in a severely blurred image, degrading depth estimation performance. Recently, several data-driven approaches have been proposed to mitigate these two challenges. However, due to the lack of the ground-truth dataset of the dual-pixel disparity, existing data-driven methods estimate either inverse depth or blurriness map. In this work, we propose a self-supervised learning method that learns bidirectional disparity by utilizing the nature of anisotropic blur kernels in dual-pixel photography. We observe that the dual-pixel left/right images have reflective-symmetric anisotropic kernels, so their sum is equivalent to that of a conventional image. We take a self-supervised training approach with the novel kernel-split symmetry loss accounting for the phenomenon. Our method does not rely on a training dataset of dual-pixel disparity that does not exist yet. Our method can estimate a complete disparity map with respect to the focus-plane depth from a dual-pixel image, outperforming the baseline dual-pixel methods.

Cite

Text

Kim et al. "Spatio-Focal Bidirectional Disparity Estimation from a Dual-Pixel Image." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.00486

Markdown

[Kim et al. "Spatio-Focal Bidirectional Disparity Estimation from a Dual-Pixel Image." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/kim2023cvpr-spatiofocal/) doi:10.1109/CVPR52729.2023.00486

BibTeX

@inproceedings{kim2023cvpr-spatiofocal,
  title     = {{Spatio-Focal Bidirectional Disparity Estimation from a Dual-Pixel Image}},
  author    = {Kim, Donggun and Jang, Hyeonjoong and Kim, Inchul and Kim, Min H.},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {5023-5032},
  doi       = {10.1109/CVPR52729.2023.00486},
  url       = {https://mlanthology.org/cvpr/2023/kim2023cvpr-spatiofocal/}
}