LFTransMamba: A Hybrid Mamba-Transformer Model for Light Field Image Super-Resolution

Abstract

Light Field Super-Resolution (LFSR) seeks to enhance the spatial resolution of light field images while preserving angular consistency. Existing convolution-based networks struggle to capture long-range spatial-angular dependencies, and although Transformer-based methods address this limitation, they incur prohibitive quadratic complexity when processing high-resolution 4D light field data. In contrast, Mamba-based architectures can efficiently capture long-range dependencies but have limited capacity for contextual modeling. To overcome these challenges, we introduce LFTransMamba, a hybrid architecture that integrates Transformer-based global context modeling with Mamba-based efficient long-range dependency capture. Specifically, we propose a Masked Light Field Image Modeling (MLFIM) training strategy, which enhances the modeling of spatial-angular relationships through masked reconstruction without introducing additional modules or loss functions. Furthermore, we present an enhanced Position-Sensitive Windowing mechanism (EPSW), employing Gaussian-weighted aggregation to emphasize central pixels, thereby improving reconstruction quality and mitigating structural artifacts. Our method achieves state-of-the-art performance in the NTIRE 2025 Light Field Image Super-Resolution Challenge, ranking 1st in both the Classic and Large-Model tracks, and 2nd in the Efficiency track. The code is available at https://github.com/OpenMeow/LFTransMamba.

Cite

Text

Jin et al. "LFTransMamba: A Hybrid Mamba-Transformer Model for Light Field Image Super-Resolution." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.

Markdown

[Jin et al. "LFTransMamba: A Hybrid Mamba-Transformer Model for Light Field Image Super-Resolution." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.](https://mlanthology.org/cvprw/2025/jin2025cvprw-lftransmamba/)

BibTeX

@inproceedings{jin2025cvprw-lftransmamba,
  title     = {{LFTransMamba: A Hybrid Mamba-Transformer Model for Light Field Image Super-Resolution}},
  author    = {Jin, Kai and Wei, Zeqiang and Yang, Angulia and Wu, Di and Gao, Mingzhi and Zhou, Xiuzhuang},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2025},
  pages     = {1195-1204},
  url       = {https://mlanthology.org/cvprw/2025/jin2025cvprw-lftransmamba/}
}