Dr-Fairness: Dynamic Data Ratio Adjustment for Fair Training on Real and Generated Data
Abstract
Fair visual recognition has become critical for preventing demographic disparity. A major cause of model unfairness is the imbalanced representation of different groups in training data. Recently, several works aim to alleviate this issue using generated data. However, these approaches often use generated data to obtain similar amounts of data across groups, which is not optimal for achieving high fairness due to different learning difficulties and generated data qualities across groups. To address this issue, we propose a novel adaptive sampling approach that leverages both real and generated data for fairness. We design a bilevel optimization that finds the optimal data sampling ratios among groups and between real and generated data while training a model. The ratios are dynamically adjusted considering both the model's accuracy as well as its fairness. To efficiently solve our non-convex bilevel optimization, we propose a simple approximation to the solution given by the implicit function theorem. Extensive experiments show that our framework achieves state-of-the-art fairness and accuracy on the CelebA and ImageNet People Subtree datasets. We also observe that our method adaptively relies less on the generated data when it has poor quality. Our work shows the importance of using generated data together with real data for improving model fairness.
Cite
Text
Roh et al. "Dr-Fairness: Dynamic Data Ratio Adjustment for Fair Training on Real and Generated Data." Transactions on Machine Learning Research, 2023.Markdown
[Roh et al. "Dr-Fairness: Dynamic Data Ratio Adjustment for Fair Training on Real and Generated Data." Transactions on Machine Learning Research, 2023.](https://mlanthology.org/tmlr/2023/roh2023tmlr-drfairness/)BibTeX
@article{roh2023tmlr-drfairness,
title = {{Dr-Fairness: Dynamic Data Ratio Adjustment for Fair Training on Real and Generated Data}},
author = {Roh, Yuji and Nie, Weili and Huang, De-An and Whang, Steven Euijong and Vahdat, Arash and Anandkumar, Anima},
journal = {Transactions on Machine Learning Research},
year = {2023},
url = {https://mlanthology.org/tmlr/2023/roh2023tmlr-drfairness/}
}