Discretizing Continuous Attributes While Learning Bayesian Networks

Abstract

We introduce a method for learning Bayesian networks that handles the discretization of continuous variables as an integral part of the learning process. The main ingredient in this method is a new metric based on the Minimal Description Length principle for choosing the threshold values for the discretization while learning the Bayesian network structure. This score balances the complexity of the learned discretization and the learned network structure against how well they model the training data. This ensures that the discretization of each variable introduces just enough intervals to capture its interaction with adjacent variables in the network. We formally derive the new metric, study its main properties, and propose an iterative algorithm for learning a discretization policy. Finally, we illustrate its behavior in applications to supervised learning. 1 INTRODUCTION Bayesian networks provide efficient and effective representation of the joint probability distribution over a set ...

Cite

Text

Friedman and Goldszmidt. "Discretizing Continuous Attributes While Learning Bayesian Networks." International Conference on Machine Learning, 1996.

Markdown

[Friedman and Goldszmidt. "Discretizing Continuous Attributes While Learning Bayesian Networks." International Conference on Machine Learning, 1996.](https://mlanthology.org/icml/1996/friedman1996icml-discretizing/)

BibTeX

@inproceedings{friedman1996icml-discretizing,
  title     = {{Discretizing Continuous Attributes While Learning Bayesian Networks}},
  author    = {Friedman, Nir and Goldszmidt, Moisés},
  booktitle = {International Conference on Machine Learning},
  year      = {1996},
  pages     = {157-165},
  url       = {https://mlanthology.org/icml/1996/friedman1996icml-discretizing/}
}