Hierarchical Maximum Entropy Density Estimation

Abstract

We study the problem of simultaneously estimating several densities where the datasets are organized into overlapping groups, such as a hierarchy. For this problem, we propose a maximum entropy formulation, which systematically incorporates the groups and allows us to share the strength of prediction across similar datasets. We derive general performance guarantees, and show how some previous approaches, such as hierarchical shrinkage and hierarchical priors, can be derived as special cases. We demonstrate the proposed technique on synthetic data and in a realworld application to modeling the geographic distributions of species hierarchically grouped in a taxonomy. Specifically, we model the geographic distributions of species in the Australian wet tropics and Northeast New South Wales. In these regions, small numbers of samples per species significantly hinder effective prediction. Substantial benefits are obtained by combining information across taxonomic groups.

Cite

Text

Dudík et al. "Hierarchical Maximum Entropy Density Estimation." International Conference on Machine Learning, 2007. doi:10.1145/1273496.1273528

Markdown

[Dudík et al. "Hierarchical Maximum Entropy Density Estimation." International Conference on Machine Learning, 2007.](https://mlanthology.org/icml/2007/dudik2007icml-hierarchical/) doi:10.1145/1273496.1273528

BibTeX

@inproceedings{dudik2007icml-hierarchical,
  title     = {{Hierarchical Maximum Entropy Density Estimation}},
  author    = {Dudík, Miroslav and Blei, David M. and Schapire, Robert E.},
  booktitle = {International Conference on Machine Learning},
  year      = {2007},
  pages     = {249-256},
  doi       = {10.1145/1273496.1273528},
  url       = {https://mlanthology.org/icml/2007/dudik2007icml-hierarchical/}
}