Expectile Matrix Factorization for Skewed Data Analysis

Abstract

Matrix factorization is a popular approach to solving matrix estimation problems based on partial observations. Existing matrix factorization is based on least squares and aims to yield a low-rank matrix to interpret the conditional sample means given the observations. However, in many real applications with skewed and extreme data, least squares cannot explain their central tendency or tail distributions, yielding undesired estimates. In this paper, we propose expectile matrix factorization by introducing asymmetric least squares, a key concept in expectile regression analysis, into the matrix factorization framework. We propose an efficient algorithm to solve the new problem based on alternating minimization and quadratic programming. We prove that our algorithm converges to a global optimum and exactly recovers the true underlying low-rank matrices when noise is zero. For synthetic data with skewed noise and a real-world dataset containing web service response times, the proposed scheme achieves lower recovery errors than the existing matrix factorization method based on least squares in a wide range of settings.

Cite

Text

Zhu et al. "Expectile Matrix Factorization for Skewed Data Analysis." AAAI Conference on Artificial Intelligence, 2017. doi:10.1609/AAAI.V31I1.10502

Markdown

[Zhu et al. "Expectile Matrix Factorization for Skewed Data Analysis." AAAI Conference on Artificial Intelligence, 2017.](https://mlanthology.org/aaai/2017/zhu2017aaai-expectile/) doi:10.1609/AAAI.V31I1.10502

BibTeX

@inproceedings{zhu2017aaai-expectile,
  title     = {{Expectile Matrix Factorization for Skewed Data Analysis}},
  author    = {Zhu, Rui and Niu, Di and Kong, Linglong and Li, Zongpeng},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2017},
  pages     = {259-266},
  doi       = {10.1609/AAAI.V31I1.10502},
  url       = {https://mlanthology.org/aaai/2017/zhu2017aaai-expectile/}
}