Latent Feature Regression for Multivariate Count Data
Abstract
We consider the problem of regression on multivariate count data and present a Gibbs sampler for a latent feature regression model suitable for both under- and overdispersed response variables. The model learns count-valued latent features conditional on arbitrary covariates, modeling them as negative binomial variables, and maps them into the dependent count-valued observations using a Dirichlet-multinomial distribution. From another viewpoint, the model can be seen as a generalization of a specific topic model for scenarios where we are interested in generating the actual counts of observations and not just their relative frequencies and co-occurrences. The model is demonstrated on a smart traffic application where the task is to predict public transportation volume for unknown locations based on a characterization of the close-by services and venues.
Cite
Text
Klami et al. "Latent Feature Regression for Multivariate Count Data." International Conference on Artificial Intelligence and Statistics, 2015.Markdown
[Klami et al. "Latent Feature Regression for Multivariate Count Data." International Conference on Artificial Intelligence and Statistics, 2015.](https://mlanthology.org/aistats/2015/klami2015aistats-latent/)BibTeX
@inproceedings{klami2015aistats-latent,
title = {{Latent Feature Regression for Multivariate Count Data}},
author = {Klami, Arto and Tripathi, Abhishek and Sirola, Johannes and Väre, Lauri and Roulland, Frédéric},
booktitle = {International Conference on Artificial Intelligence and Statistics},
year = {2015},
url = {https://mlanthology.org/aistats/2015/klami2015aistats-latent/}
}