End-to-End Single-Frame Image Signal Processing for High Dynamic Range Scenes

Abstract

This paper considers photography of high dynamic range scenes containing mixtures of shadows and highlights on mobile phones. Multi-frame merging constructs a high-quality image at the cost of capturing multiple frames of the same scene. Contrarily, end-to-end optimized image signal processing (E2EISP) produces an enhanced image from a single-frame Bayer array. This paper combines the merits of the two approaches by using labels of high-quality multi-frame merged images to train E2EISP with a novel neural network architecture composed of a multi-head mixture of brightness enhancement for accurately processing shadows/highlights and a multi-head mixture of image processing featured camera settings of white balance and color correction for a proper color generation. We also proposed a combination of supervised, unsupervised, and generative adversarial losses for brightness, edge, and detail enhancement. Experimental results show that the proposed single-frame ISP produces enhanced images and outperforms state-of-the-art methods.

Cite

Text

Dinh and Choi. "End-to-End Single-Frame Image Signal Processing for High Dynamic Range Scenes." Winter Conference on Applications of Computer Vision, 2023.

Markdown

[Dinh and Choi. "End-to-End Single-Frame Image Signal Processing for High Dynamic Range Scenes." Winter Conference on Applications of Computer Vision, 2023.](https://mlanthology.org/wacv/2023/dinh2023wacv-endtoend/)

BibTeX

@inproceedings{dinh2023wacv-endtoend,
  title     = {{End-to-End Single-Frame Image Signal Processing for High Dynamic Range Scenes}},
  author    = {Dinh, Khanh Quoc and Choi, Kwang Pyo},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2023},
  pages     = {2449-2458},
  url       = {https://mlanthology.org/wacv/2023/dinh2023wacv-endtoend/}
}