Task-Customized Mixture of Adapters for General Image Fusion
Abstract
General image fusion aims at integrating important information from multi-source images. However due to the significant cross-task gap the respective fusion mechanism varies considerably in practice resulting in limited performance across subtasks. To handle this problem we propose a novel task-customized mixture of adapters (TC-MoA) for general image fusion adaptively prompting various fusion tasks in a unified model. We borrow the insight from the mixture of experts (MoE) taking the experts as efficient tuning adapters to prompt a pre-trained foundation model. These adapters are shared across different tasks and constrained by mutual information regularization ensuring compatibility with different tasks while complementarity for multi-source images. The task-specific routing networks customize these adapters to extract task-specific information from different sources with dynamic dominant intensity performing adaptive visual feature prompt fusion. Notably our TC-MoA controls the dominant intensity bias for different fusion tasks successfully unifying multiple fusion tasks in a single model. Extensive experiments show that TC-MoA outperforms the competing approaches in learning commonalities while retaining compatibility for general image fusion (multi-modal multi-exposure and multi-focus) and also demonstrating striking controllability on more generalization experiments. The code is available at https://github.com/YangSun22/TC-MoA.
Cite
Text
Zhu et al. "Task-Customized Mixture of Adapters for General Image Fusion." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00678Markdown
[Zhu et al. "Task-Customized Mixture of Adapters for General Image Fusion." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/zhu2024cvpr-taskcustomized/) doi:10.1109/CVPR52733.2024.00678BibTeX
@inproceedings{zhu2024cvpr-taskcustomized,
title = {{Task-Customized Mixture of Adapters for General Image Fusion}},
author = {Zhu, Pengfei and Sun, Yang and Cao, Bing and Hu, Qinghua},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2024},
pages = {7099-7108},
doi = {10.1109/CVPR52733.2024.00678},
url = {https://mlanthology.org/cvpr/2024/zhu2024cvpr-taskcustomized/}
}