Bias on Demand: Investigating Bias with a Synthetic Data Generator
Abstract
Machine Learning (ML) systems are increasingly being adopted to make decisions that might have a significant impact on people's lives. Because these decision-making systems rely on data-driven learning, the risk is that they will systematically propagate the bias embedded in the data. To prevent harmful consequences, it is essential to comprehend how and where bias is introduced and possibly how to mitigate it. We demonstrate Bias on Demand, a framework to generate synthetic datasets with different types of bias, which is available as an open-source toolkit and as a pip package. We include a demo of our proposed synthetic data generator, in which we illustrate experiments on different scenarios to showcase the interconnection between biases and their effect on performance and fairness evaluations. We encourage readers to explore the full paper for a more detailed analysis.
Cite
Text
Baumann et al. "Bias on Demand: Investigating Bias with a Synthetic Data Generator." International Joint Conference on Artificial Intelligence, 2023. doi:10.24963/IJCAI.2023/828Markdown
[Baumann et al. "Bias on Demand: Investigating Bias with a Synthetic Data Generator." International Joint Conference on Artificial Intelligence, 2023.](https://mlanthology.org/ijcai/2023/baumann2023ijcai-bias/) doi:10.24963/IJCAI.2023/828BibTeX
@inproceedings{baumann2023ijcai-bias,
title = {{Bias on Demand: Investigating Bias with a Synthetic Data Generator}},
author = {Baumann, Joachim and Castelnovo, Alessandro and Cosentini, Andrea and Crupi, Riccardo and Inverardi, Nicole and Regoli, Daniele},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2023},
pages = {7110-7114},
doi = {10.24963/IJCAI.2023/828},
url = {https://mlanthology.org/ijcai/2023/baumann2023ijcai-bias/}
}