Self-Supervised Video Desmoking for Laparoscopic Surgery

Abstract

Due to the difficulty of collecting real paired data, most existing desmoking methods train the models by synthesizing smoke, generalizing poorly to real surgical scenarios. Although a few works have explored single-image real-world desmoking in unpaired learning manners, they still encounter challenges in handling dense smoke. In this work, we address these issues together by introducing the self-supervised surgery video desmoking (SelfSVD). On the one hand, we observe that the frame captured before the activation of high-energy devices is generally clear (named pre-smoke frame, PS frame), thus it can serve as supervision for other smoky frames, making real-world self-supervised video desmoking practically feasible. On the other hand, in order to enhance the desmoking performance, we further feed the valuable information from PS frame into models, where a masking strategy and a regularization term are presented to avoid trivial solutions. In addition, we construct a real surgery video dataset for desmoking, which covers a variety of smoky scenes. Extensive experiments on the dataset show that our SelfSVD can remove smoke more effectively and efficiently while recovering more photo-realistic details than the state-of-the-art methods. The dataset, codes, and pre-trained models are available at https: //github.com/ZcsrenlongZ/SelfSVD.

Cite

Text

Wu et al. "Self-Supervised Video Desmoking for Laparoscopic Surgery." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73220-1_18

Markdown

[Wu et al. "Self-Supervised Video Desmoking for Laparoscopic Surgery." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/wu2024eccv-selfsupervised/) doi:10.1007/978-3-031-73220-1_18

BibTeX

@inproceedings{wu2024eccv-selfsupervised,
  title     = {{Self-Supervised Video Desmoking for Laparoscopic Surgery}},
  author    = {Wu, Renlong and Zhang, Zhilu and Zhang, Shuohao and Gou, Longfei and Chen, Haobin and Zhang, Lei and Chen, Hao and Zuo, Wangmeng},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73220-1_18},
  url       = {https://mlanthology.org/eccv/2024/wu2024eccv-selfsupervised/}
}