Bayesian Models for Large-Scale Hierarchical Classification
Abstract
A challenging problem in hierarchical classification is to leverage the hierarchical relations among classes for improving classification performance. An even greater challenge is to do so in a manner that is computationally feasible for the large scale problems usually encountered in practice. This paper proposes a set of Bayesian methods to model hierarchical dependencies among class labels using multivari- ate logistic regression. Specifically, the parent-child relationships are modeled by placing a hierarchical prior over the children nodes centered around the parame- ters of their parents; thereby encouraging classes nearby in the hierarchy to share similar model parameters. We present new, efficient variational algorithms for tractable posterior inference in these models, and provide a parallel implementa- tion that can comfortably handle large-scale problems with hundreds of thousands of dimensions and tens of thousands of classes. We run a comparative evaluation on multiple large-scale benchmark datasets that highlights the scalability of our approach, and shows a significant performance advantage over the other state-of- the-art hierarchical methods.
Cite
Text
Gopal et al. "Bayesian Models for Large-Scale Hierarchical Classification." Neural Information Processing Systems, 2012.Markdown
[Gopal et al. "Bayesian Models for Large-Scale Hierarchical Classification." Neural Information Processing Systems, 2012.](https://mlanthology.org/neurips/2012/gopal2012neurips-bayesian/)BibTeX
@inproceedings{gopal2012neurips-bayesian,
title = {{Bayesian Models for Large-Scale Hierarchical Classification}},
author = {Gopal, Siddharth and Yang, Yiming and Bai, Bing and Niculescu-mizil, Alexandru},
booktitle = {Neural Information Processing Systems},
year = {2012},
pages = {2411-2419},
url = {https://mlanthology.org/neurips/2012/gopal2012neurips-bayesian/}
}