Scalable Bayesian Non-Negative Tensor Factorization for Massive Count Data

Abstract

We present a Bayesian non-negative tensor factorization model for count-valued tensor data, and develop scalable inference algorithms both batch and online for dealing with massive tensors. Our generative model can handle overdispersed counts as well as infer the rank of the decomposition. Moreover, leveraging a reparameterization of the Poisson distribution as a multinomial facilitates conjugacy in the model and enables simple and efficient Gibbs sampling and variational Bayes VB inference updates, with a computational cost that only depends on the number of nonzeros in the tensor. The model also provides a nice interpretability for the factors; in our model, each factor corresponds to a "topic". We develop a set of online inference algorithms that allow further scaling up the model to massive tensors, for which batch inference methods may be infeasible. We apply our framework on diverse real-world applications, such as multiway topic modeling on a scientific publications database, analyzing a political science data set, and analyzing a massive household transactions data set.

Cite

Text

Hu et al. "Scalable Bayesian Non-Negative Tensor Factorization for Massive Count Data." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2015. doi:10.1007/978-3-319-23525-7_4

Markdown

[Hu et al. "Scalable Bayesian Non-Negative Tensor Factorization for Massive Count Data." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2015.](https://mlanthology.org/ecmlpkdd/2015/hu2015ecmlpkdd-scalable/) doi:10.1007/978-3-319-23525-7_4

BibTeX

@inproceedings{hu2015ecmlpkdd-scalable,
  title     = {{Scalable Bayesian Non-Negative Tensor Factorization for Massive Count Data}},
  author    = {Hu, Changwei and Rai, Piyush and Chen, Changyou and Harding, Matthew and Carin, Lawrence},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2015},
  pages     = {53-70},
  doi       = {10.1007/978-3-319-23525-7_4},
  url       = {https://mlanthology.org/ecmlpkdd/2015/hu2015ecmlpkdd-scalable/}
}