Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks

Abstract

Recent research tries to extend image restoration capabilities from human perception to machine perception, thereby enhancing the performance of high-level vision tasks in degraded environments. These methods, primarily based on supervised learning, typically involve the retraining of restoration networks or high-level vision networks. However, collecting paired data in real-world scenarios and retraining large-scale models are challenge. To this end, we propose an unsupervised learning method called Variational Translator (VaT), which does not require retraining existing restoration and high-level vision networks. Instead, it establishes a lightweight network that serves as an intermediate bridge between them. By variational inference, VaT approximates the joint distribution of restoration output and high-level vision input, dividing the optimization objective into preserving content and maximizing marginal likelihood associated with high-level vision tasks. By cleverly leveraging self-training paradigms, VaT achieves the above optimization objective without requiring labels. As a result, the translated images maintain a close resemblance to their original content while also demonstrating exceptional performance on high-level vision tasks. Extensive experiments in dehazing and low-light enhancement for detection and classification show the superiority of our method over other state-of-the-art unsupervised counterparts, even significantly surpassing supervised methods in some complex real-world scenarios..

Cite

Text

Wu and Jin. "Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73113-6_13

Markdown

[Wu and Jin. "Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/wu2024eccv-unsupervised/) doi:10.1007/978-3-031-73113-6_13

BibTeX

@inproceedings{wu2024eccv-unsupervised,
  title     = {{Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks}},
  author    = {Wu, Jiawei and Jin, Zhi},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73113-6_13},
  url       = {https://mlanthology.org/eccv/2024/wu2024eccv-unsupervised/}
}