Online Mixed Missing Value Imputation Using Gaussian Copula

Abstract

Many data science algorithms require complete observations, making missing value imputation an important step in many data processing pipelines. Imputation is also of independent interest for applications such as recommender systems. To address real-world big data problems, imputation algorithms must handle mixed data, containing ordinal, boolean, and continuous variables, and such algorithms must be highly scalable. In this work we develop a semi-parametric online algorithm for mixed missing value imputation using a Gaussian Copula. This online algorithm improves on the speed of its offline counterpart by an order of magnitude, with similar accuracy. The online method can also improve on the offline method by adapting to a changing data distribution.

Cite

Text

Landgrebe et al. "Online Mixed Missing Value Imputation Using Gaussian Copula." ICML 2020 Workshops: Artemiss, 2020.

Markdown

[Landgrebe et al. "Online Mixed Missing Value Imputation Using Gaussian Copula." ICML 2020 Workshops: Artemiss, 2020.](https://mlanthology.org/icmlw/2020/landgrebe2020icmlw-online/)

BibTeX

@inproceedings{landgrebe2020icmlw-online,
  title     = {{Online Mixed Missing Value Imputation Using Gaussian Copula}},
  author    = {Landgrebe, Eric and Zhao, Yuxuan and Udell, Madeleine},
  booktitle = {ICML 2020 Workshops: Artemiss},
  year      = {2020},
  url       = {https://mlanthology.org/icmlw/2020/landgrebe2020icmlw-online/}
}