Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes

Abstract

Mesh saliency enhances the adaptability of 3D vision by identifying and emphasizing regions that naturally attract visual attention. To investigate the interaction between geometric structure and texture in shaping visual attention, we establish a comprehensive mesh saliency dataset, which is the first to systematically capture the differences in saliency distribution under both textured and non-textured visual conditions. Furthermore, we introduce mesh Mamba, a unified saliency prediction model based on a state space model (SSM), designed to adapt across various mesh types. Mesh Mamba effectively analyzes the geometric structure of the mesh while seamlessly incorporating texture features into the topological framework, ensuring coherence throughout appearance-enhanced modeling. More importantly, by subgraph embedding and a bidirectional SSM, the model enables global context modeling for both local geometry and texture, preserving the topological structure and improving the understanding of visual details and structural complexity. Through extensive theoretical and empirical validation, our model not only improves performance across various mesh types but also demonstrates high scalability and versatility, particularly through cross validations of various visual features.

Cite

Text

Zhang et al. "Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.01512

Markdown

[Zhang et al. "Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/zhang2025cvpr-mesh/) doi:10.1109/CVPR52734.2025.01512

BibTeX

@inproceedings{zhang2025cvpr-mesh,
  title     = {{Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes}},
  author    = {Zhang, Kaiwei and Zhu, Dandan and Min, Xiongkuo and Zhai, Guangtao},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {16219-16228},
  doi       = {10.1109/CVPR52734.2025.01512},
  url       = {https://mlanthology.org/cvpr/2025/zhang2025cvpr-mesh/}
}