EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction

Abstract

Leveraging its robust linear global modeling capability, Mamba has notably excelled in computer vision. Despite its success, existing Mamba-based vision models have overlooked the nuances of event-driven tasks, especially in video reconstruction. Event-based video reconstruction (EBVR) demands spatial translation invariance and close attention to local event relationships in the spatio-temporal domain. Unfortunately, conventional Mamba algorithms apply static window partitions and standard reshape scanning methods, leading to significant losses in local connectivity. To overcome these limitations, we introduce EventMamba—a specialized model designed for EBVR task. EventMamba innovates by incorporating random window offset (RWO) in the spatial domain, moving away from the restrictive fixed partitioning. Additionally, it features a new consistent traversal serialization approach in the spatio-temporal domain, which maintains the proximity of adjacent events both spatially and temporally. These enhancements enable EventMamba to retain Mamba’s robust modeling capabilities while significantly preserving the spatio-temporal locality of event data. Comprehensive testing on multiple datasets shows that EventMamba markedly enhances video reconstruction, drastically improving computation speed while delivering superior visual quality compared to Transformer-based methods.

Cite

Text

Ge et al. "EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I3.32319

Markdown

[Ge et al. "EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/ge2025aaai-eventmamba/) doi:10.1609/AAAI.V39I3.32319

BibTeX

@inproceedings{ge2025aaai-eventmamba,
  title     = {{EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction}},
  author    = {Ge, Chengjie and Fu, Xueyang and He, Peng and Wang, Kunyu and Cao, Chengzhi and Zha, Zheng-Jun},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {3104-3112},
  doi       = {10.1609/AAAI.V39I3.32319},
  url       = {https://mlanthology.org/aaai/2025/ge2025aaai-eventmamba/}
}