Iterative Missing Data Imputation with Model Form Adaptation and Non-Missing Feature Supervision

Abstract

Iterative imputation is a prevalent method for missing data imputation, where each feature is imputed iteratively by treating it as a target variable estimated from all other features. However, iterative imputation method suffers from two principal limitations: (1) it imposes a single parametric model form to impute all features, neglecting the potential for optimal models to vary among features, which risks model misspecification; and (2) it assumes every feature contains missing values, overlooking the potential presence of non-missing features, termed as oracle features, which are informative for imputation. To address these limitations, we propose kernel point imputation (KPI), a bi-level optimization framework for iterative missing data imputation. At the inner level, KPI adaptively learns the optimal model form for each feature within a reproducing kernel Hilbert space, addressing limitation (1). At the outer level, KPI utilizes oracle features as supervisory signals to iteratively refine the imputations, addressing limitation (2). Experiments demonstrate that KPI outperforms competitive imputation methods. Code is available at https://github.com/FMLYD/kpi.git.

Cite

Text

Wang et al. "Iterative Missing Data Imputation with Model Form Adaptation and Non-Missing Feature Supervision." Advances in Neural Information Processing Systems, 2025.

Markdown

[Wang et al. "Iterative Missing Data Imputation with Model Form Adaptation and Non-Missing Feature Supervision." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/wang2025neurips-iterative/)

BibTeX

@inproceedings{wang2025neurips-iterative,
  title     = {{Iterative Missing Data Imputation with Model Form Adaptation and Non-Missing Feature Supervision}},
  author    = {Wang, Hao and Li, Zhengnan and Chen, Zhichao and Chen, Xu and He, Shuting and Liu, Guangyi and Li, Haoxuan and Lin, Zhouchen},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/wang2025neurips-iterative/}
}