Iterative Feature Matching: Toward Provable Domain Generalization with Logarithmic Environments

Chen, Yining; Rosenfeld, Elan; Sellke, Mark; Ma, Tengyu; Risteski, Andrej

Iterative Feature Matching: Toward Provable Domain Generalization with Logarithmic Environments

Yining Chen, Elan Rosenfeld, Mark Sellke, Tengyu Ma, Andrej Risteski

NeurIPS 2022

/neurips/2022/chen2022neurips-iterative/

Abstract

Domain generalization aims at performing well on unseen test environments with data from a limited number of training environments. Despite a proliferation of proposed algorithms for this task, assessing their performance both theoretically and empirically is still very challenging. Distributional matching algorithms such as (Conditional) Domain Adversarial Networks [Ganin et al., 2016, Long et al., 2018] are popular and enjoy empirical success, but they lack formal guarantees. Other approaches such as Invariant Risk Minimization (IRM) require a prohibitively large number of training environments---linear in the dimension of the spurious feature space $d_s$---even on simple data models like the one proposed by [Rosenfeld et al., 2021]. Under a variant of this model, we show that ERM and IRM can fail to find the optimal invariant predictor with $o(d_s)$ environments. We then present an iterative feature matching algorithm that is guaranteed with high probability to find the optimal invariant predictor after seeing only $O(\log d_s)$ environments. Our results provide the first theoretical justification for distribution-matching algorithms widely used in practice under a concrete nontrivial data model.

PDF NeurIPS OpenReview Semantic Scholar

Cite

Text

Chen et al. "Iterative Feature Matching: Toward Provable Domain Generalization with Logarithmic Environments." Neural Information Processing Systems, 2022.

Markdown

[Chen et al. "Iterative Feature Matching: Toward Provable Domain Generalization with Logarithmic Environments." Neural Information Processing Systems, 2022.](https://mlanthology.org/neurips/2022/chen2022neurips-iterative/)

BibTeX

@inproceedings{chen2022neurips-iterative,
  title     = {{Iterative Feature Matching: Toward Provable Domain Generalization with Logarithmic Environments}},
  author    = {Chen, Yining and Rosenfeld, Elan and Sellke, Mark and Ma, Tengyu and Risteski, Andrej},
  booktitle = {Neural Information Processing Systems},
  year      = {2022},
  url       = {https://mlanthology.org/neurips/2022/chen2022neurips-iterative/}
}