GAN-Based Vision Transformer for High-Quality Thermal Image Enhancement
Abstract
Generative Adversarial Networks (GANs) have shown an outstanding ability to generate high-quality images with visual realism and similarity to real images. This paper presents a new architecture for thermal image enhancement. Precisely, the strengths of architecture-based vision transformers and generative adversarial networks are exploited. The thermal loss feature introduced in our approach is specifically used to produce high-quality images. Thermal image enhancement also relies on fine-tuning based on visible images, resulting in an overall improvement in image quality. A visual quality metric was used to evaluate the performance of the proposed architecture. Significant improvements were found over the original thermal images and other enhancement methods established on a subset of the KAIST dataset. The performance of the proposed enhancement architecture is also verified on the detection results by obtaining better performance with a considerable margin regarding different versions of the YOLO detector.
Cite
Text
Marnissi and Fathallah. "GAN-Based Vision Transformer for High-Quality Thermal Image Enhancement." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023. doi:10.1109/CVPRW59228.2023.00089Markdown
[Marnissi and Fathallah. "GAN-Based Vision Transformer for High-Quality Thermal Image Enhancement." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2023.](https://mlanthology.org/cvprw/2023/marnissi2023cvprw-ganbased/) doi:10.1109/CVPRW59228.2023.00089BibTeX
@inproceedings{marnissi2023cvprw-ganbased,
title = {{GAN-Based Vision Transformer for High-Quality Thermal Image Enhancement}},
author = {Marnissi, Mohamed Amine and Fathallah, Abir},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2023},
pages = {817-825},
doi = {10.1109/CVPRW59228.2023.00089},
url = {https://mlanthology.org/cvprw/2023/marnissi2023cvprw-ganbased/}
}