FSDR: Frequency Space Domain Randomization for Domain Generalization

Abstract

Domain generalization aims to learn a generalizable model from a `known' source domain for various `unknown' target domains. It has been studied widely by domain randomization that transfers source images to different styles in spatial space for learning domain-agnostic features. However, most existing randomization methods use GANs that often lack of controls and even alter semantic structures of images undesirably. Inspired by the idea of JPEG that converts spatial images into multiple frequency components (FCs), we propose Frequency Space Domain Randomization (FSDR) that randomizes images in frequency space by keeping domain-invariant FCs (DIFs) and randomizing domain-variant FCs (DVFs) only. FSDR has two unique features: 1) it decomposes images into DIFs and DVFs which allows explicit access and manipulation of them and more controllable randomization; 2) it has minimal effects on semantic structures of images and domain-invariant features. We examined domain variance and invariance property of FCs statistically and designed a network that can identify and fuse DIFs and DVFs dynamically through iterative learning. Extensive experiments over multiple domain generalizable segmentation tasks show that FSDR achieves superior segmentation and its performance is even on par with domain adaptation methods that access target data in training.

Cite

Text

Huang et al. "FSDR: Frequency Space Domain Randomization for Domain Generalization." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.00682

Markdown

[Huang et al. "FSDR: Frequency Space Domain Randomization for Domain Generalization." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/huang2021cvpr-fsdr/) doi:10.1109/CVPR46437.2021.00682

BibTeX

@inproceedings{huang2021cvpr-fsdr,
  title     = {{FSDR: Frequency Space Domain Randomization for Domain Generalization}},
  author    = {Huang, Jiaxing and Guan, Dayan and Xiao, Aoran and Lu, Shijian},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
  pages     = {6891-6902},
  doi       = {10.1109/CVPR46437.2021.00682},
  url       = {https://mlanthology.org/cvpr/2021/huang2021cvpr-fsdr/}
}