OmniControlNet: Dual-Stage Integration for Conditional Image Generation

Abstract

We provide a two-way integration for the widely adopted ControlNet by integrating external condition generation algorithms into a single dense prediction method and incorporating its individually trained image generation processes into a single model. Despite its tremendous success, the ControlNet of a two-stage pipeline bears limitations in being not self-contained (e.g. calls the external condition generation algorithms) with a large model redundancy (separately trained models for different types of conditioning inputs). Our proposed OmniControlNet consolidates 1) the condition generation (e.g., HED edges, depth maps, user scribble, and animal pose) by a single multitasking dense prediction algorithm under the task embedding guidance and 2) the image generation process for different conditioning types under the textual embedding guidance. OmniControlNet achieves significantly reduced model complexity and redundancy while capable of producing images of comparable quality for conditioned text-to-image generation.

Cite

Text

Wang et al. "OmniControlNet: Dual-Stage Integration for Conditional Image Generation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00739

Markdown

[Wang et al. "OmniControlNet: Dual-Stage Integration for Conditional Image Generation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/wang2024cvprw-omnicontrolnet/) doi:10.1109/CVPRW63382.2024.00739

BibTeX

@inproceedings{wang2024cvprw-omnicontrolnet,
  title     = {{OmniControlNet: Dual-Stage Integration for Conditional Image Generation}},
  author    = {Wang, Yilin and Xu, Haiyang and Zhang, Xiang and Chen, Zeyuan and Sha, Zhizhou and Wang, Zirui and Tu, Zhuowen},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2024},
  pages     = {7436-7448},
  doi       = {10.1109/CVPRW63382.2024.00739},
  url       = {https://mlanthology.org/cvprw/2024/wang2024cvprw-omnicontrolnet/}
}