FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection
Abstract
Detecting objects seamlessly blended into their surroundings represents a complex task for both human cognitive capabilities and advanced artificial intelligence algorithms. Currently, the majority of methodologies for detecting camouflaged objects mainly focus on utilizing discriminative models with various unique designs. However, it has been observed that generative models, such as Stable Diffusion, possess stronger capabilities for understanding various objects in complex environments; Yet their potential for the cognition and detection of camouflaged objects has not been extensively explored. In this study, we present a novel denoising diffusion model, namely FocusDiffuser, to investigate how generative models can enhance the detection and interpretation of camouflaged objects. We believe that the secret to spotting camouflaged objects lies in catching the subtle nuances in details. Consequently, our FocusDiffuser innovatively integrates specialized enhancements, notably the Boundary-Driven LookUp (BDLU) module and Cyclic Positioning (CP) module, to elevate standard diffusion models, significantly boosting the detail-oriented analytical capabilities. Our experiments demonstrate that FocusDiffuser, from a generative perspective, effectively addresses the challenge of camouflaged object detection, surpassing leading models on benchmarks like CAMO, COD10K and NC4K. Code and pre-trained models are available at https://github.com/JWZhao-uestc/FocusDiffuser.
Cite
Text
Zhao et al. "FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73668-1_11Markdown
[Zhao et al. "FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/zhao2024eccv-focusdiffuser/) doi:10.1007/978-3-031-73668-1_11BibTeX
@inproceedings{zhao2024eccv-focusdiffuser,
title = {{FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection}},
author = {Zhao, Jianwei and Li, Xin and Yang, Fan and Zhai, Qiang and Luo, Ao and Jiao, Zhicheng and Cheng, Hong},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-73668-1_11},
url = {https://mlanthology.org/eccv/2024/zhao2024eccv-focusdiffuser/}
}