Graphical Dirichlet Process for Clustering Non-Exchangeable Grouped Data
Abstract
We consider the problem of clustering grouped data with possibly non-exchangeable groups whose dependencies can be characterized by a known directed acyclic graph. To allow the sharing of clusters among the non-exchangeable groups, we propose a Bayesian nonparametric approach, termed graphical Dirichlet process, that jointly models the dependent group-specific random measures by assuming each random measure to be distributed as a Dirichlet process whose concentration parameter and base probability measure depend on those of its parent groups. The resulting joint stochastic process respects the Markov property of the directed acyclic graph that links the groups. We characterize the graphical Dirichlet process using a novel hypergraph representation as well as the stick-breaking representation, the restaurant-type representation, and the representation as a limit of a finite mixture model. We develop an efficient posterior inference algorithm and illustrate our model with simulations and a real grouped single-cell data set.
Cite
Text
Chakrabarti et al. "Graphical Dirichlet Process for Clustering Non-Exchangeable Grouped Data." Journal of Machine Learning Research, 2024.Markdown
[Chakrabarti et al. "Graphical Dirichlet Process for Clustering Non-Exchangeable Grouped Data." Journal of Machine Learning Research, 2024.](https://mlanthology.org/jmlr/2024/chakrabarti2024jmlr-graphical/)BibTeX
@article{chakrabarti2024jmlr-graphical,
title = {{Graphical Dirichlet Process for Clustering Non-Exchangeable Grouped Data}},
author = {Chakrabarti, Arhit and Ni, Yang and Morris, Ellen Ruth A. and Salinas, Michael L. and Chapkin, Robert S. and Mallick, Bani K.},
journal = {Journal of Machine Learning Research},
year = {2024},
pages = {1-56},
volume = {25},
url = {https://mlanthology.org/jmlr/2024/chakrabarti2024jmlr-graphical/}
}