Memory Efficient Matting with Adaptive Token Routing
Abstract
Transformer-based models have recently achieved outstanding performance in image matting. However, their application to high-resolution images remains challenging due to the quadratic complexity of global self-attention. To address this issue, we propose MEMatte, a memory-efficient matting framework for processing high-resolution images. MEMatte incorporates a router before each global attention block, directing informative tokens to the global attention while routing other tokens to a Lightweight Token Refinement Module (LTRM). Specifically, the router employs a local-global strategy to predict the routing probability of each token, and the LTRM utilizes efficient modules to simulate global attention. Additionally, we introduce a Batch-constrained Adaptive Token Routing (BATR) mechanism, which allows each router to dynamically route tokens based on image content and the stages of attention block in the network. Furthermore, we construct an ultra high-resolution image matting dataset, UHR-395, comprising 35,500 training images and 1,000 test images, with an average resolution of 4872 × 6017. This dataset is created by compositing 395 different alpha mattes across 11 categories onto various backgrounds, all with high-quality manual annotation. Extensive experiments demonstrate that MEMatte outperforms existing methods on both high-resolution and real-world datasets, significantly reducing memory usage by approximately 88% and latency by 50% on the Composition-1K benchmark.
Cite
Text
Lin et al. "Memory Efficient Matting with Adaptive Token Routing." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I5.32563Markdown
[Lin et al. "Memory Efficient Matting with Adaptive Token Routing." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/lin2025aaai-memory/) doi:10.1609/AAAI.V39I5.32563BibTeX
@inproceedings{lin2025aaai-memory,
title = {{Memory Efficient Matting with Adaptive Token Routing}},
author = {Lin, Yiheng and Hu, Yihan and Zhang, Chenyi and Liu, Ting and Qu, Xiaochao and Liu, Luoqi and Zhao, Yao and Wei, Yunchao},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {5298-5306},
doi = {10.1609/AAAI.V39I5.32563},
url = {https://mlanthology.org/aaai/2025/lin2025aaai-memory/}
}