Cross-View Aggregation Network for Stereo Image Super-Resolution
Abstract
Although stereo image super-resolution has been extensively studied, many existing works only rely on attention in a single epipolar direction to reconstruct stereo images. In the case of asymmetric parallax images, these methods often struggle to capture reliable stereo correspondence, resulting in reconstructed images suffering from blurring and artifacts. In this paper, we propose a novel method called Cross-View Aggregation Network for Stereo Image Super-Resolution (CANSSR) and explore the relationship between multi-directional epipolar lines to construct reliable stereo correspondence. Specifically, we propose a multidirectional cross-view aggregation module (MCAM) that effectively captures multi-directional stereo correspondence and obtains cross-view complementary information. Furthermore, we design a channel-spatial aggregation module (CSAM) that aggregates multi-order global-local information in intra-view to reconstruct clearer texture features. In addition, we equip a large kernel convolution in the Feedforward Network to acquire richer detailed texture information. The extensive experiments conclusively demonstrate that CANSSR outperforms the state-of-the-art method both qualitatively and quantitatively in terms of stereo image super-resolution on the Flickr 1024 and Middlebury datasets.
Cite
Text
Chen et al. "Cross-View Aggregation Network for Stereo Image Super-Resolution." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00647Markdown
[Chen et al. "Cross-View Aggregation Network for Stereo Image Super-Resolution." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/chen2024cvprw-crossview/) doi:10.1109/CVPRW63382.2024.00647BibTeX
@inproceedings{chen2024cvprw-crossview,
title = {{Cross-View Aggregation Network for Stereo Image Super-Resolution}},
author = {Chen, Zhitao and Lu, Tao and Zhao, Kanghui and Zhu, Bolin and Li, Zhen and Wang, Jiaming and Zhang, Yanduo},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2024},
pages = {6469-6478},
doi = {10.1109/CVPRW63382.2024.00647},
url = {https://mlanthology.org/cvprw/2024/chen2024cvprw-crossview/}
}