Scalable Probabilistic Tensor Factorization for Binary and Count Data
Abstract
Tensor factorization methods provide a useful way to extract latent factors from complex multirelational data, and also for predicting missing data. Developing tensor factorization methods for massive tensors, especially when the data are binary- or count-valued (which is true of most real-world tensors), however, remains a challenge. We develop a scalable probabilistic tensor factorization framework that enables us to perform efficient factorization of massive binary and count tensor data. The framework is based on (i) the Polya-Gamma augmentation strategy which makes the model fully locally conjugate and allows closed-form parameter updates when data are binary- or count-valued; and (ii) an efficient online Expectation Maximization algorithm, which allows processing data in small mini-batches, and facilitates handling massive tensor data. Moreover, various types of constraints on the factor matrices (e.g., sparsity, non-negativity) can be incorporated under the proposed framework, providing good interpretability, which can be useful for qualitative analyses of the results. We apply the proposed framework on analyzing several binary- and count-valued real-world data sets.
Cite
Text
Rai et al. "Scalable Probabilistic Tensor Factorization for Binary and Count Data." International Joint Conference on Artificial Intelligence, 2015.Markdown
[Rai et al. "Scalable Probabilistic Tensor Factorization for Binary and Count Data." International Joint Conference on Artificial Intelligence, 2015.](https://mlanthology.org/ijcai/2015/rai2015ijcai-scalable/)BibTeX
@inproceedings{rai2015ijcai-scalable,
title = {{Scalable Probabilistic Tensor Factorization for Binary and Count Data}},
author = {Rai, Piyush and Hu, Changwei and Harding, Matthew and Carin, Lawrence},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2015},
pages = {3770-3776},
url = {https://mlanthology.org/ijcai/2015/rai2015ijcai-scalable/}
}