Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation

Abstract

Test-time adaptation (TTA) addresses the unforeseen distribution shifts occurring during test time. In TTA, performance, memory consumption, and time consumption are crucial considerations. A recent diffusion-based TTA approach for restoring corrupted images involves image-level updates. However, using pixel space diffusion significantly increases resource requirements compared to conventional model updating TTA approaches, revealing limitations as a TTA method. To address this, we propose a novel TTA method that leverages an image editing model based on a latent diffusion model (LDM) and fine-tunes it using our newly introduced corruption modeling scheme. This scheme enhances the robustness of the diffusion model against distribution shifts by creating (clean, corrupted) image pairs and fine-tuning the model to edit corrupted images into clean ones. Moreover, we introduce a distilled variant to accelerate the model for corruption editing using only 4 network function evaluations (NFEs). We extensively validated our method across various architectures and datasets including image and video domains. Our model achieves the best performance with a 100 times faster runtime than that of a diffusion-based baseline. Furthermore, it is three times faster than the previous model updating TTA method that utilizes data augmentation, making an image-level updating approach more feasible. 1 1 Project page: magentahttps://github.com/oyt9306/Decorruptor

Cite

Text

Oh et al. "Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72943-0_11

Markdown

[Oh et al. "Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/oh2024eccv-efficient/) doi:10.1007/978-3-031-72943-0_11

BibTeX

@inproceedings{oh2024eccv-efficient,
  title     = {{Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation}},
  author    = {Oh, Yeongtak and Lee, Jonghyun and Choi, Jooyoung and Jung, Dahuin and Hwang, Uiwon and Yoon, Sungroh},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-72943-0_11},
  url       = {https://mlanthology.org/eccv/2024/oh2024eccv-efficient/}
}