Post-Processing for Individual Fairness

Abstract

Post-processing in algorithmic fairness is a versatile approach for correcting bias in ML systems that are already used in production. The main appeal of post-processing is that it avoids expensive retraining. In this work, we propose general post-processing algorithms for individual fairness (IF). We consider a setting where the learner only has access to the predictions of the original model and a similarity graph between individuals, guiding the desired fairness constraints. We cast the IF post-processing problem as a graph smoothing problem corresponding to graph Laplacian regularization that preserves the desired "treat similar individuals similarly" interpretation. Our theoretical results demonstrate the connection of the new objective function to a local relaxation of the original individual fairness. Empirically, our post-processing algorithms correct individual biases in large-scale NLP models such as BERT, while preserving accuracy.

PDF NeurIPS OpenReview Code Semantic Scholar

Cite

Text

Petersen et al. "Post-Processing for Individual Fairness." Neural Information Processing Systems, 2021.

Markdown

[Petersen et al. "Post-Processing for Individual Fairness." Neural Information Processing Systems, 2021.](https://mlanthology.org/neurips/2021/petersen2021neurips-postprocessing/)

BibTeX

@inproceedings{petersen2021neurips-postprocessing,
  title     = {{Post-Processing for Individual Fairness}},
  author    = {Petersen, Felix and Mukherjee, Debarghya and Sun, Yuekai and Yurochkin, Mikhail},
  booktitle = {Neural Information Processing Systems},
  year      = {2021},
  url       = {https://mlanthology.org/neurips/2021/petersen2021neurips-postprocessing/}
}