Latent Feature Regression for Multivariate Count Data

Abstract

We consider the problem of regression on multivariate count data and present a Gibbs sampler for a latent feature regression model suitable for both under- and overdispersed response variables. The model learns count-valued latent features conditional on arbitrary covariates, modeling them as negative binomial variables, and maps them into the dependent count-valued observations using a Dirichlet-multinomial distribution. From another viewpoint, the model can be seen as a generalization of a specific topic model for scenarios where we are interested in generating the actual counts of observations and not just their relative frequencies and co-occurrences. The model is demonstrated on a smart traffic application where the task is to predict public transportation volume for unknown locations based on a characterization of the close-by services and venues.

Cite

Text

Klami et al. "Latent Feature Regression for Multivariate Count Data." International Conference on Artificial Intelligence and Statistics, 2015.

Markdown

[Klami et al. "Latent Feature Regression for Multivariate Count Data." International Conference on Artificial Intelligence and Statistics, 2015.](https://mlanthology.org/aistats/2015/klami2015aistats-latent/)

BibTeX

@inproceedings{klami2015aistats-latent,
  title     = {{Latent Feature Regression for Multivariate Count Data}},
  author    = {Klami, Arto and Tripathi, Abhishek and Sirola, Johannes and Väre, Lauri and Roulland, Frédéric},
  booktitle = {International Conference on Artificial Intelligence and Statistics},
  year      = {2015},
  url       = {https://mlanthology.org/aistats/2015/klami2015aistats-latent/}
}