Data Augmentation for Imbalanced Regression
Abstract
In this work, we consider the problem of imbalanced data in a regression framework when the imbalanced phenomenon concerns continuous or discrete covariates. Such a situation can lead to biases in the estimates. In this case, we propose a data augmentation algorithm that combines a weighted resampling (WR) and a data augmentation (DA) procedure. In a first step, the DA procedure permits exploring a wider support than the initial one. In a second step, the WR method drives the exogenous distribution to a target one. We discuss the choice of the DA procedure through a numerical study that illustrates the advantages of this approach. Finally, an actuarial application is studied.
Cite
Text
Stocksieker et al. "Data Augmentation for Imbalanced Regression." Artificial Intelligence and Statistics, 2023.Markdown
[Stocksieker et al. "Data Augmentation for Imbalanced Regression." Artificial Intelligence and Statistics, 2023.](https://mlanthology.org/aistats/2023/stocksieker2023aistats-data/)BibTeX
@inproceedings{stocksieker2023aistats-data,
title = {{Data Augmentation for Imbalanced Regression}},
author = {Stocksieker, Samuel and Pommeret, Denys and Charpentier, Arthur},
booktitle = {Artificial Intelligence and Statistics},
year = {2023},
pages = {7774-7799},
volume = {206},
url = {https://mlanthology.org/aistats/2023/stocksieker2023aistats-data/}
}