DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations
Abstract
Pre-trained Vision Transformers now serve as powerful tools for computer vision. Yet, efficiently adapting them for multiple tasks remains a challenge that arises from the need to modify the rich hidden representations encoded by the learned weight matrices, without inducing interference between tasks. Current parameter-efficient methods like LoRA, which apply low-rank updates, force tasks to compete within constrained subspaces, ultimately degrading performance. We introduce DiTASK, a novel Diffeomorphic Multi-Task Fine-Tuning approach that maintains pre-trained representations by preserving weight matrix singular vectors, while enabling task-specific adaptations through neural diffeomorphic transformations of the singular values. By following this approach, DiTASK enables both shared and task-specific feature modulations with minimal added parameters. Our theoretical analysis shows that DiTASK achieves full-rank updates during optimization, preserving the geometric structure of pre-trained features, and establishing a new paradigm for efficient multi-task learning (MTL). Our experiments on PASCAL MTL and NYUD show that DiTASK achieves state-of-the-art performance across four dense prediction tasks, using 75% fewer parameters than existing methods.
Cite
Text
Mantri et al. "DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.02348Markdown
[Mantri et al. "DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/mantri2025cvpr-ditask/) doi:10.1109/CVPR52734.2025.02348BibTeX
@inproceedings{mantri2025cvpr-ditask,
title = {{DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations}},
author = {Mantri, Krishna Sri Ipsit and Schönlieb, Carola-Bibiane and Ribeiro, Bruno and Baskin, Chaim and Eliasof, Moshe},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2025},
pages = {25218-25229},
doi = {10.1109/CVPR52734.2025.02348},
url = {https://mlanthology.org/cvpr/2025/mantri2025cvpr-ditask/}
}