Beyond Loss Functions: Exploring Data-Centric Approaches with Diffusion Model for Domain Generalization
Abstract
There has been a huge effort to tackle the Domain Generalization (DG) problem with a focus on developing new loss functions. Inspired by the image generation capabilities of the diffusion models, we pose a pivotal question: Can diffusion models function as data augmentation tools to address DG from a data-centric perspective, rather than relying on the loss functions? Our findings reveal that trivial cross-domain data augmentation (CDGA) along with the vanilla ERM using readily available diffusion models without additional finetuning outperforms state-of-the-art (SOTA) training algorithms. This paper delves into the exploration of why and how this rudimentary data generation can outperform complicated DG algorithms. With the help of domain shift quantification tools, We empirically show that CDGA reduces the domain shift between domains. We empirically reveal connections between the loss landscape, adversarial robustness, and data generation, illustrating that CDGA reduces loss sharpness and improves robustness against adversarial shifts in data. Additionally, we discuss our intuitions that CDGA along with ERM can be considered as a way to replace the pointwise kernel estimates in ERM with new density estimates in the \textit{vicinity of domain pairs} which can diminish the true data estimation error of ERM under domain shift scenario. These insights advocate for further investigation into the potential of data-centric approaches in DG.
Cite
Text
Hemati et al. "Beyond Loss Functions: Exploring Data-Centric Approaches with Diffusion Model for Domain Generalization." Transactions on Machine Learning Research, 2024.Markdown
[Hemati et al. "Beyond Loss Functions: Exploring Data-Centric Approaches with Diffusion Model for Domain Generalization." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/hemati2024tmlr-beyond/)BibTeX
@article{hemati2024tmlr-beyond,
title = {{Beyond Loss Functions: Exploring Data-Centric Approaches with Diffusion Model for Domain Generalization}},
author = {Hemati, Sobhan and Beitollahi, Mahdi and Estiri, Amir Hossein and Al Omari, Bassel and Lamghari, Soufiane and Khalil, Yasser H. and Chen, Xi and Zhang, Guojun},
journal = {Transactions on Machine Learning Research},
year = {2024},
url = {https://mlanthology.org/tmlr/2024/hemati2024tmlr-beyond/}
}