A Non-Parametric Bayesian Approach for Uplift Discretization and Feature Selection

Abstract

Uplift modeling aims to estimate the incremental impact of a treatment, such as a marketing campaign or a drug, on an individual’s outcome. Bank or Telecom uplift data often have hundreds to thousands of features. In such situations, detection of irrelevant features is an essential step to reduce computational time and increase model performance. We present a parameter-free feature selection method for uplift modeling founded on a Bayesian approach. We design an automatic feature discretization method for uplift based on a space of discretization models and a prior distribution. From this model space, we define a Bayes optimal evaluation criterion of a discretization model for uplift. We then propose an optimization algorithm that finds near-optimal discretization for estimating uplift in $O(n \log n)$ O ( n log n ) time. Experiments demonstrate the high performances obtained by this new discretization method. Then we describe a parameter-free feature selection method for uplift. Experiments show that the new method both removes irrelevant features and achieves better performances than state of the art methods.

Cite

Text

Rafla et al. "A Non-Parametric Bayesian Approach for Uplift Discretization and Feature Selection." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2022. doi:10.1007/978-3-031-26419-1_15

Markdown

[Rafla et al. "A Non-Parametric Bayesian Approach for Uplift Discretization and Feature Selection." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2022.](https://mlanthology.org/ecmlpkdd/2022/rafla2022ecmlpkdd-nonparametric/) doi:10.1007/978-3-031-26419-1_15

BibTeX

@inproceedings{rafla2022ecmlpkdd-nonparametric,
  title     = {{A Non-Parametric Bayesian Approach for Uplift Discretization and Feature Selection}},
  author    = {Rafla, Mina and Voisine, Nicolas and Crémilleux, Bruno and Boullé, Marc},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2022},
  pages     = {239-254},
  doi       = {10.1007/978-3-031-26419-1_15},
  url       = {https://mlanthology.org/ecmlpkdd/2022/rafla2022ecmlpkdd-nonparametric/}
}