ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs
Abstract
Methods for finetuning generative models for concept-driven personalization generally achieve strong results for subject-driven or style-driven generation. Recently, low-rank adaptations () have been proposed as a parameter-efficient way of achieving concept-driven personalization. While recent work explores the combination of separate LoRAs to achieve joint generation of learned styles and subjects, existing techniques do not reliably address the problem, so that either subject fidelity or style fidelity are compromised. We propose , a method to cheaply and effectively merge independently trained style and subject LoRAs in order to achieve generation of any user-provided subject in any user-provided style. Experiments on a wide range of subject and style combinations show that can generate compelling results with meaningful improvements over baselines in subject and style fidelity while preserving the ability to recontextualize.
Cite
Text
Shah et al. "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73232-4_24Markdown
[Shah et al. "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/shah2024eccv-ziplora/) doi:10.1007/978-3-031-73232-4_24BibTeX
@inproceedings{shah2024eccv-ziplora,
title = {{ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs}},
author = {Shah, Viraj and Ruiz, Nataniel and Cole, Forrester and Lu, Erika and Lazebnik, Svetlana and Li, Yuanzhen and Jampani, Varun},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-73232-4_24},
url = {https://mlanthology.org/eccv/2024/shah2024eccv-ziplora/}
}