Cross-View Aggregation Network for Stereo Image Super-Resolution

Abstract

Although stereo image super-resolution has been extensively studied, many existing works only rely on attention in a single epipolar direction to reconstruct stereo images. In the case of asymmetric parallax images, these methods often struggle to capture reliable stereo correspondence, resulting in reconstructed images suffering from blurring and artifacts. In this paper, we propose a novel method called Cross-View Aggregation Network for Stereo Image Super-Resolution (CANSSR) and explore the relationship between multi-directional epipolar lines to construct reliable stereo correspondence. Specifically, we propose a multidirectional cross-view aggregation module (MCAM) that effectively captures multi-directional stereo correspondence and obtains cross-view complementary information. Furthermore, we design a channel-spatial aggregation module (CSAM) that aggregates multi-order global-local information in intra-view to reconstruct clearer texture features. In addition, we equip a large kernel convolution in the Feedforward Network to acquire richer detailed texture information. The extensive experiments conclusively demonstrate that CANSSR outperforms the state-of-the-art method both qualitatively and quantitatively in terms of stereo image super-resolution on the Flickr 1024 and Middlebury datasets.

Cite

Text

Chen et al. "Cross-View Aggregation Network for Stereo Image Super-Resolution." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00647

Markdown

[Chen et al. "Cross-View Aggregation Network for Stereo Image Super-Resolution." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/chen2024cvprw-crossview/) doi:10.1109/CVPRW63382.2024.00647

BibTeX

@inproceedings{chen2024cvprw-crossview,
  title     = {{Cross-View Aggregation Network for Stereo Image Super-Resolution}},
  author    = {Chen, Zhitao and Lu, Tao and Zhao, Kanghui and Zhu, Bolin and Li, Zhen and Wang, Jiaming and Zhang, Yanduo},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2024},
  pages     = {6469-6478},
  doi       = {10.1109/CVPRW63382.2024.00647},
  url       = {https://mlanthology.org/cvprw/2024/chen2024cvprw-crossview/}
}